Reducing uncertainty in the american community survey through data-driven regionalization

PLoS One. 2015 Feb 27;10(2):e0115626. doi: 10.1371/journal.pone.0115626. eCollection 2015.

Abstract

The American Community Survey (ACS) is the largest survey of US households and is the principal source for neighborhood scale information about the US population and economy. The ACS is used to allocate billions in federal spending and is a critical input to social scientific research in the US. However, estimates from the ACS can be highly unreliable. For example, in over 72% of census tracts, the estimated number of children under 5 in poverty has a margin of error greater than the estimate. Uncertainty of this magnitude complicates the use of social data in policy making, research, and governance. This article presents a heuristic spatial optimization algorithm that is capable of reducing the margins of error in survey data via the creation of new composite geographies, a process called regionalization. Regionalization is a complex combinatorial problem. Here rather than focusing on the technical aspects of regionalization we demonstrate how to use a purpose built open source regionalization algorithm to process survey data in order to reduce the margins of error to a user-specified threshold.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Censuses
  • Cities
  • Data Collection / methods*
  • Data Collection / standards*
  • Family Characteristics*
  • Geography
  • Humans
  • Residence Characteristics*
  • Spatial Analysis
  • Uncertainty*
  • United States

Grants and funding

This work was supported by the National Science Foundation (award number 113238). This work utilized the Janus supercomputer, which is supported by the National Science Foundation (award number CNS-0821794) and the University of Colorado Boulder. The Janus supercomputer is a joint effort of the University of Colorado Boulder, the University of Colorado Denver and the National Center for Atmospheric Research. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.