5 research outputs found

    Reducing Uncertainty in the American Community Survey through Data-Driven Regionalization

    Full text link
    The American Community Survey (ACS) is the largest US survey of households and is the principal source for neighborhood scale information about the US population and economy. The ACS is used to allocate billions in federal spending and is a critical input to social scientific research in the US. However, estimates from the ACS can be highly unreliable. For example, in over 72% of census tracts the estimated number of children under 5 in poverty has a margin of error greater than the estimate. Uncertainty of this magnitude complicates the use of social data in policy making, research, and governance. This article develops a spatial optimization algorithm that is capable of reducing the margins of error in survey data via the creation of new composite geographies, a process called regionalization. Regionalization is a complex combinatorial problem. Here rather than focusing on the technical aspects of regionalization we demonstrate how to use a purpose built open source regionalization algorithm to post-process survey data in order to reduce the margins of error to some user-specified threshold

    Effects of a Government-Academic Partnership: Has the NSF-Census Bureau Research Network Helped Improve the U.S. Statistical System?

    Get PDF
    The National Science Foundation-Census Bureau Research Network (NCRN) was established in 2011 to create interdisciplinary research nodes on methodological questions of interest and significance to the broader research community and to the Federal Statistical System (FSS), particularly to the Census Bureau. The activities to date have covered both fundamental and applied statistical research and have focused at least in part on the training of current and future generations of researchers in skills of relevance to surveys and alternative measurement of economic units, households, and persons. This article focuses on some of the key research findings of the eight nodes, organized into six topics: (1) improving census and survey data-quality and data collection methods; (2) using alternative sources of data; (3) protecting privacy and confidentiality by improving disclosure avoidance; (4) using spatial and spatio-temporal statistical modeling to improve estimates; (5) assessing data cost and data-quality tradeoffs; and (6) combining information from multiple sources. The article concludes with an evaluation of the ability of the FSS to apply the NCRN’s research outcomes, suggests some next steps, and discusses the implications of this research-network model for future federal government research initiatives

    Social Space and Social Media: Analyzing Urban Space with Big Data

    Get PDF
    This dissertation focuses on the key role that big data can play in minimizing the perceived disconnect between social theory and quantitative methods in the discipline of geography. It takes as its starting point the geographic concept of space, which is conceptualized very differently in social theory versus quantitative methodology. Contrary to this disparity, an examination of the disciplinary history reveals a number of historic precedents and potential pathways for a rapprochement, especially when combined with some of the new possibilities of big data. This dissertation also proposes solutions to two common barriers to the adoption of big data in the social sciences: accessing and collecting such data and, subsequently, meaningful analysis. These methods and the theoretical foundation are combined in three case studies that show the successful integration of a quantitative research methodology with social theories on space. The case studies demonstrate how such an approach can create new and alternative understandings of urban space. In doing so it answers three specific research questions: (1) How can big data facilitate the integration of social theory on space with quantitative research methodology? (2) What are the practical challenges and solutions to moving “beyond the geotag” when utilizing big data in geographical research? (3) How can the quantitative analysis of big data provide new and useful insight in the complex character of social space? More specifically, what insights does such an analysis of relational social space provide about urban mobility and cognitive neighborhoods
    corecore