Working Paper: NBER ID: w27391
Authors: Christopher R. Knittel; Bora Ozaltun
Abstract: We correlate county-level COVID-19 death rates with key variables using both linear regression and negative binomial mixed models, although we focus on linear regression models. We include four sets of variables: socio-economic variables, county-level health variables, modes of commuting, and climate and pollution patterns. Our analysis studies daily death rates from April 4, 2020 to May 27, 2020. We estimate correlation patterns both across states, as well as within states. For both models, we find higher shares of African American residents in the county are correlated with higher death rates. However, when we restrict ourselves to correlation patterns within a given state, the statistical significance of the correlation of death rates with the share of African Americans, while remaining positive, wanes. We find similar results for the share of elderly in the county. We find that higher amounts of commuting via public transportation, relative to telecommuting, is correlated with higher death rates. The correlation between driving into work, relative to telecommuting, and death rates is also positive across both models, but statistically significant only when we look across states and counties. We also find that a higher share of people not working, and thus not commuting either because they are elderly, children or unemployed, is correlated with higher death rates. Counties with higher home values, higher summer temperatures, and lower winter temperatures have higher death rates. Contrary to past work, we do not find a correlation between pollution and death rates. Also importantly, we do not find that death rates are correlated with obesity rates, ICU beds per capita, or poverty rates. Finally, our model that looks within states yields estimates of how a given state's death rate compares to other states after controlling for the variables included in our model; this may be interpreted as a measure of how states are doing relative to others. We find that death rates in the Northeast are substantially higher compared to other states, even when we control for the four sets of variables above. Death rates are also statistically significantly higher in Michigan, Louisiana, Iowa, Indiana, and Colorado. California's death rate is the lowest across all states.
Keywords: COVID-19; death rates; socioeconomic variables; commuting modes; climate
JEL Codes: I1; Q5; R4
Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.
Cause | Effect |
---|---|
Higher share of African American residents in a county (R23) | Higher COVID-19 death rates (I14) |
Higher share of elderly residents (J14) | Higher COVID-19 death rates (I14) |
Higher public transit usage relative to telecommuting (L91) | Higher COVID-19 death rates (I14) |
Higher median home values (R21) | Higher COVID-19 death rates (I14) |
Higher summer temperatures (Q54) | Higher COVID-19 death rates (I14) |
Lower winter temperatures (Q54) | Higher COVID-19 death rates (I14) |