Diagnosing Physician Error: A Machine Learning Approach to Low-Value Health Care

Working Paper: NBER ID: w26168

Authors: Sendhil Mullainathan; Ziad Obermeyer

Abstract: We use machine learning as a tool to study decision making, focusing specifically on how physicians diagnose heart attack. An algorithmic model of a patient’s probability of heart attack allows us to identify cases where physician testing decisions deviate from predicted risk. We then use actual health outcomes to evaluate whether those deviations represent mistakes or physicians’ superior knowledge. This approach reveals two inefficiencies. Physicians over-test: predictably low-risk patients are tested, but do not benefit. At the same time, physicians undertest: predictably high-risk patients are left untested, and then go on to suffer adverse health events including death. A natural experiment using shift-to-shift testing variation confirms these findings. Simultaneous over- and under-testing cannot easily be explained by incentives alone, and instead point to systematic errors in judgment. We provide suggestive evidence on the psychology underlying these errors. First, physicians use too simple a model of risk. Second, they overweight factors that are salient or representative of heart attack, such as chest pain. We argue health care models must incorporate physician error, and illustrate how policies focused solely on incentive problems can produce large inefficiencies.

Keywords: Machine Learning; Healthcare Efficiency; Physician Decision-Making; Heart Attack Diagnosis

JEL Codes: C55; D81; I13

Causal Claims Network Graph

Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.

Causal Claims

Cause	Effect
systematic errors in judgment (D91)	overtesting and undertesting (C52)
physician testing decisions (I11)	adverse health outcomes (I14)
timing of patient visits (C41)	testing rates (C12)
testing rates (C12)	health outcomes (I14)
physician testing decisions (I11)	unnecessary testing (C52)
unnecessary testing (C52)	high costs without corresponding health benefits (H51)

Back to index