460 research outputs found

    A Primer on the Data Cleaning Pipeline

    Full text link
    The availability of both structured and unstructured databases, such as electronic health data, social media data, patent data, and surveys that are often updated in real time, among others, has grown rapidly over the past decade. With this expansion, the statistical and methodological questions around data integration, or rather merging multiple data sources, has also grown. Specifically, the science of the ``data cleaning pipeline'' contains four stages that allow an analyst to perform downstream tasks, predictive analyses, or statistical analyses on ``cleaned data.'' This article provides a review of this emerging field, introducing technical terminology and commonly used methods

    Towards Bayesian Model-Based Demography

    Get PDF
    This open access book presents a ground-breaking approach to developing micro-foundations for demography and migration studies. It offers a unique and novel methodology for creating empirically grounded agent-based models of international migration – one of the most uncertain population processes and a top-priority policy area. The book discusses in detail the process of building a simulation model of migration, based on a population of intelligent, cognitive agents, their networks and institutions, all interacting with one another. The proposed model-based approach integrates behavioural and social theory with formal modelling, by embedding the interdisciplinary modelling process within a wider inductive framework based on the Bayesian statistical reasoning. Principles of uncertainty quantification are used to devise innovative computer-based simulations, and to learn about modelling the simulated individuals and the way they make decisions. The identified knowledge gaps are subsequently filled with information from dedicated laboratory experiments on cognitive aspects of human decision-making under uncertainty. In this way, the models are built iteratively, from the bottom up, filling an important epistemological gap in migration studies, and social sciences more broadly

    Towards Bayesian Model-Based Demography

    Get PDF
    This open access book presents a ground-breaking approach to developing micro-foundations for demography and migration studies. It offers a unique and novel methodology for creating empirically grounded agent-based models of international migration – one of the most uncertain population processes and a top-priority policy area. The book discusses in detail the process of building a simulation model of migration, based on a population of intelligent, cognitive agents, their networks and institutions, all interacting with one another. The proposed model-based approach integrates behavioural and social theory with formal modelling, by embedding the interdisciplinary modelling process within a wider inductive framework based on the Bayesian statistical reasoning. Principles of uncertainty quantification are used to devise innovative computer-based simulations, and to learn about modelling the simulated individuals and the way they make decisions. The identified knowledge gaps are subsequently filled with information from dedicated laboratory experiments on cognitive aspects of human decision-making under uncertainty. In this way, the models are built iteratively, from the bottom up, filling an important epistemological gap in migration studies, and social sciences more broadly

    When Does Internet Denial Trigger the Right of Armed Self-Defense?

    Get PDF
    Amid a 2007 dispute with Russia, Estonia suffered a series of distributed denial-of-service (DDoS) cyber-attacks that disabled the websites of government ministries, political parties, news outlets, banks, and other firms for several weeks. The attacks employed digital bots to overload Estonia\u27s Internet infrastructure with an overwhelming stream of data packets, which caused serious service and communications disruptions before abruptly coming to a halt. During the initial stages, Estonia\u27s Computer Emergency Response Team (E-CERT) traced the attacks to I.P. addresses belonging to Russian nationalist groups, but was unable to establish direct participation by Moscow. Subsequent evidence suggested, however, that the attacks were tied to the Kremlin
    • …
    corecore