Historical Newspaper Data: A Researcher's Guide and Toolkit

Working Paper: NBER ID: w30135

Authors: Brian Beach; W. Walker Hanlon

Abstract: Digitized historical newspaper databases offer a valuable research tool. A rapidly expanding set of studies use these databases to address a wide range of topics. We review this literature and provide a toolkit for researchers interested in working with historical newspaper data. We provide a brief description of the evolution of historical newspapers, focusing on aspects that are likely to have implications for the design of empirical studies. We then review the main databases in use. We also discuss some key challenges in using these data, most importantly the fact that even the most extensive datasets contain only a selected sample of the universe of historical newspaper articles. We offer tools for evaluating the comprehensiveness of available newspaper datasets, show how to assess potential identification concerns, and suggest some solutions.

Keywords: historical newspaper data; research toolkit; empirical studies; digitized newspapers

JEL Codes: N0


Causal Claims Network Graph

Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.


Causal Claims

CauseEffect
newspaper openings (M13)local exposure to trial news (K41)
newspaper openings (M13)dissemination of information regarding significant events (G14)
newspaper openings (M13)measurement of outcomes (C52)
newspaper openings (M13)influence on racial and ethnic discrimination (J15)
newspaper openings (M13)public support for movements (D72)
newspaper openings (M13)treatment mechanism for flow of information (C45)

Back to index