Working Paper: CEPR ID: DP14450
Authors: Elisabet Viladecans-Marsal; Miquel Angel Garcia Lopez; Daniel Arribas-Bel
Abstract: This paper proposes a novel methodology for delineating urban areas based on a machine learning algorithm that groups build-ings within portions of space of sufficient density. To do so, we use the precise geolocation of all 12 million buildings in Spain. We exploit building heights to create a new dimension for urban areas, namely, the vertical land, which provides a more accurate measure of their size. To better understand their internal structure and to illustrate an additional use for our algorithm, we also identify employment centers within the delineated urban areas. We test the robustness of our method and compare our urban areas to other delineations obtained using admin-istrative borders and commuting-based patterns. We show that: 1) our urban areas are more similar to the commuting-based delineations than the administrative boundaries but that they are more precisely measured; 2) when analyzing the urban areas’ size distribution, Zipf’s law appears to hold for their population, surface and vertical land; and 3) the impact of transportation improvements on the size of the urban areas is not underestimated.
Keywords: buildings; urban areas; city size; transportation; machine learning
JEL Codes: R12; R14; R2; R40
Edges that are evidenced by causal inference methods are in orange, and the rest are in light blue.
Cause | Effect |
---|---|
delineation method (C80) | accuracy of urban area definitions (R11) |
urban area characteristics (R11) | size distribution (D39) |
transportation improvements (R42) | size of urban areas (R12) |