Working Paper: CEPR ID: DP3094
Authors: Bronwyn H. Hall; Adam B. Jaffe; Manuel Trajtenberg
Abstract: This Paper describes the database on US patents that we have developed over the past decade, with the goal of making it widely accessible for research. We present main trends in US patenting over the last 30 years, including a variety of original measures constructed with citation data, such as backward and forward citation lags, indices of ?originality? and ?generality?, self-citations, etc. Many of these measures exhibit interesting differences across the six main technological categories that we have developed (comprising Computers and Communications, Drugs and Medical, Electrical and Electronics, Chemical, Mechanical and Others), differences that call for further research. To stimulate such research, the entire database ? about 3 million patents and 16 million citations ? is now available on the NBER website. We discuss key issues that arise in the use of patent citations data, and suggest ways of addressing them. In particular, significant changes over time in the rate of patenting and in the number of citations made, as well as the inevitable truncation of the data, make it very hard to use the raw number of citations received by different patents directly in a meaningful way. To remedy this problem we suggest two alternative approaches: the fixed-effects approach involves scaling citations by the average citation count for a group of patents to which the patent of interest belongs; the quasi-structural approach attempts to distinguish the multiple effects on citation rates via econometric estimation.
Keywords: citations; data construction; patents; truncation
JEL Codes: O30
Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.
Cause | Effect |
---|---|
number of citations received by patents varies significantly across different technological categories (O38) | impact of innovations is heterogeneous (F61) |
innovations in the fields of computers and communications, as well as drugs and medical technologies tend to receive more citations (O31) | innovations in traditional fields such as chemical and mechanical technologies receive fewer citations (O39) |
mean number of citations received by patents in emerging fields is higher (O36) | these fields are more impactful in terms of technological significance (O30) |
citation practices have evolved over time (B25) | increase in the number of citations made per patent (O34) |
increase in the number of citations made per patent (O34) | potential artifact of changing patent office practices rather than a reflection of increased innovation value (O34) |
citation counts are not straightforward measures of patent significance (O34) | need to control for citation timing and truncation effects when analyzing citation data (C41) |