304 research outputs found

    Reconstruction of Age Distributions from Differentially Private Census data

    Get PDF
    The age distribution of a population is important for understanding the demand and provision of labor and services, and as a denominator for calculating key agespecific rates such as fertility and mortality. In the US, the most important source of information on age distributions is the decennial census, but a new disclosure avoidance system (DAS) based on differential privacy will inject noise into the data, potentially compromising its utility for small areas and minority populations. In this paper, we explore the question whether there are statistical methods that can be applied to noisy age distributions to enhance the research uses of census data without compromising privacy. We apply a non-parametric method for smoothing with naive or informative priors to age distributions from the 2010 Census via demonstration data which have had the US Census Bureau’s implementation of differential privacy applied. We find that smoothing age distributions can increase the fidelity of the demonstration data to previously published population counts by age. We discuss implications for uses of data from the 2020 US Census and potential consequences for the measurement of population dynamics, health, and disparities

    Impact of traffic, poverty and facility ownership on travel time to emergency care in Nairobi, Kenya

    Get PDF
    Background: In many low and middle-income countries (LMICs), timely access to emergency healthcare services is limited. In urban settings, traffic can have a significant impact on travel time, leading to life-threatening delays for time-sensitive injuries and medical emergencies. In this study, we examined travel times to hospitals in Nairobi, Kenya, one of the largest and most congested cities in the developing world. Methods: We used a network approach to estimate average minimum travel times to different types of hospitals (e.g. ownership and level of care) in Nairobi under both congested and uncongested traffic conditions. We also examined the correlation between travel time and socioeconomic status. Results: We estimate the average minimum travel time during uncongested traffic conditions to any level 4 health facility (primary hospitals) or above in Nairobi to be 4.5 min (IQR 2.5–6.1). Traffic added an average of 9.0 min (a 200% increase). In uncongested conditions, we estimate an average travel time of 7.9 min (IQR 5.1–10.4) to level 5 facilities (secondary hospitals) and 11.6 min (IQR 8.5–14.2) to Kenyatta National Hospital, the only level 6 facility (tertiary hospital) in the country. Traffic congestion added an average of 13.1 and 16.0 min (166% and 138% increase) to travel times to level 5 and level 6 facilities, respectively. For individuals living below the poverty line, we estimate that preferential use of public or faith-based facilities could increase travel time by as much as 65%. Conclusion: Average travel times to health facilities capable of providing emergency care in Nairobi are quite low, but traffic congestion double or triple estimated travel times. Furthermore, we estimate significant disparities in timely access to care for those individuals living under the poverty line who preferentially seek care in public or faith-based facilities

    Performance of the Tariff Method: validation of a simple additive algorithm for analysis of verbal autopsies

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Verbal autopsies provide valuable information for studying mortality patterns in populations that lack reliable vital registration data. Methods for transforming verbal autopsy results into meaningful information for health workers and policymakers, however, are often costly or complicated to use. We present a simple additive algorithm, the Tariff Method (termed Tariff), which can be used for assigning individual cause of death and for determining cause-specific mortality fractions (CSMFs) from verbal autopsy data.</p> <p>Methods</p> <p>Tariff calculates a score, or "tariff," for each cause, for each sign/symptom, across a pool of validated verbal autopsy data. The tariffs are summed for a given response pattern in a verbal autopsy, and this sum (score) provides the basis for predicting the cause of death in a dataset. We implemented this algorithm and evaluated the method's predictive ability, both in terms of chance-corrected concordance at the individual cause assignment level and in terms of CSMF accuracy at the population level. The analysis was conducted separately for adult, child, and neonatal verbal autopsies across 500 pairs of train-test validation verbal autopsy data.</p> <p>Results</p> <p>Tariff is capable of outperforming physician-certified verbal autopsy in most cases. In terms of chance-corrected concordance, the method achieves 44.5% in adults, 39% in children, and 23.9% in neonates. CSMF accuracy was 0.745 in adults, 0.709 in children, and 0.679 in neonates.</p> <p>Conclusions</p> <p>Verbal autopsies can be an efficient means of obtaining cause of death data, and Tariff provides an intuitive, reliable method for generating individual cause assignment and CSMFs. The method is transparent and flexible and can be readily implemented by users without training in statistics or computer science.</p

    A Two-Stage Cluster Sampling Method Using Gridded Population Data, A GIS, And Google Earthtm Imagery in a Population-Based Mortality Survey in Iraq

    Get PDF
    Background Mortality estimates can measure and monitor the impacts of conflict on a population, guide humanitarian efforts, and help to better understand the public health impacts of conflict. Vital statistics registration and surveillance systems are rarely functional in conflict settings, posing a challenge of estimating mortality using retrospective population-based surveys. Results We present a two-stage cluster sampling method for application in population-based mortality surveys. The sampling method utilizes gridded population data and a geographic information system (GIS) to select clusters in the first sampling stage and Google Earth TM imagery and sampling grids to select households in the second sampling stage. The sampling method is implemented in a household mortality study in Iraq in 2011. Factors affecting feasibility and methodological quality are described. Conclusion Sampling is a challenge in retrospective population-based mortality studies and alternatives that improve on the conventional approaches are needed. The sampling strategy presented here was designed to generate a representative sample of the Iraqi population while reducing the potential for bias and considering the context specific challenges of the study setting. This sampling strategy, or variations on it, are adaptable and should be considered and tested in other conflict settings

    Trust-based recommendation systems: an axiomatic approach

    Get PDF
    ABSTRACT High-quality, personalized recommendations are a key feature in many online systems. Since these systems often have explicit knowledge of social network structures, the recommendations may incorporate this information. This paper focuses on networks which represent trust and recommendations which incorporate trust relationships. The goal of a trust-based recommendation system is to generate personalized recommendations from known opinions and trust relationships. In analogy to prior work on voting and ranking systems, we use the axiomatic approach from the theory of social choice. We develop an natural set of five axioms which we desire any recommendation system exhibit. Then we show that no system can simultaneously satisfy all these axioms. We also exhibit systems which satisfy any four of the five axioms. Next we consider ways of weakening the axioms, which can lead to a unique recommendation system based on random walks. We consider other recommendation systems (personal page rank, majority of majorities, and min cut) and search for alternative axiomatizations which uniquely characterize these systems. Finally, we determine which of these systems are incentive compatible. This is an important property for systems deployed in a monetized environment: groups of agents interested in manipulating recommendations to make others share their opinion have nothing to gain from lying about their votes or their trust links

    A Two-Stage Cluster Sampling Method Using Gridded Population Data, a GIS, and Google Earth(TM) Imagery in a Population-Based Mortality Survey in Iraq

    Get PDF
    BACKGROUND: Mortality estimates can measure and monitor the impacts of conflict on a population, guide humanitarian efforts, and help to better understand the public health impacts of conflict. Vital statistics registration and surveillance systems are rarely functional in conflict settings, posing a challenge of estimating mortality using retrospective population-based surveys. RESULTS: We present a two-stage cluster sampling method for application in population-based mortality surveys. The sampling method utilizes gridded population data and a geographic information system (GIS) to select clusters in the first sampling stage and Google Earth TM imagery and sampling grids to select households in the second sampling stage. The sampling method is implemented in a household mortality study in Iraq in 2011. Factors affecting feasibility and methodological quality are described. CONCLUSION: Sampling is a challenge in retrospective population-based mortality studies and alternatives that improve on the conventional approaches are needed. The sampling strategy presented here was designed to generate a representative sample of the Iraqi population while reducing the potential for bias and considering the context specific challenges of the study setting. This sampling strategy, or variations on it, are adaptable and should be considered and tested in other conflict settings

    Direct estimation of cause-specific mortality fractions from verbal autopsies: multisite validation study using clinical diagnostic gold standards

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Verbal autopsy (VA) is used to estimate the causes of death in areas with incomplete vital registration systems. The King and Lu method (KL) for direct estimation of cause-specific mortality fractions (CSMFs) from VA studies is an analysis technique that estimates CSMFs in a population without predicting individual-level cause of death as an intermediate step. In previous studies, KL has shown promise as an alternative to physician-certified verbal autopsy (PCVA). However, it has previously been impossible to validate KL with a large dataset of VAs for which the underlying cause of death is known to meet rigorous clinical diagnostic criteria.</p> <p>Methods</p> <p>We applied the KL method to adult, child, and neonatal VA datasets from the Population Health Metrics Research Consortium gold standard verbal autopsy validation study, a multisite sample of 12,542 VAs where gold standard cause of death was established using strict clinical diagnostic criteria. To emulate real-world populations with varying CSMFs, we evaluated the KL estimations for 500 different test datasets of varying cause distribution. We assessed the quality of these estimates in terms of CSMF accuracy as well as linear regression and compared this with the results of PCVA.</p> <p>Results</p> <p>KL performance is similar to PCVA in terms of CSMF accuracy, attaining values of 0.669, 0.698, and 0.795 for adult, child, and neonatal age groups, respectively, when health care experience (HCE) items were included. We found that the length of the cause list has a dramatic effect on KL estimation quality, with CSMF accuracy decreasing substantially as the length of the cause list increases. We found that KL is not reliant on HCE the way PCVA is, and without HCE, KL outperforms PCVA for all age groups.</p> <p>Conclusions</p> <p>Like all computer methods for VA analysis, KL is faster and cheaper than PCVA. Since it is a direct estimation technique, though, it does not produce individual-level predictions. KL estimates are of similar quality to PCVA and slightly better in most cases. Compared to other recently developed methods, however, KL would only be the preferred technique when the cause list is short and individual-level predictions are not needed.</p

    The paradox of verbal autopsy in cause of death assignment: symptom question unreliability but predictive accuracy

    Get PDF
    Background: We believe that it is important that governments understand the reliability of the mortality data which they have at their disposable to guide policy debates. In many instances, verbal autopsy (VA) will be the only source of mortality data for populations, yet little is known about how the accuracy of VA diagnoses is affected by the reliability of the symptom responses. We previously described the effect of the duration of time between death and VA administration on VA validity. In this paper, using the same dataset, we assess the relationship between the reliability and completeness of symptom responses and the reliability and accuracy of cause of death (COD) prediction. Methods: The study was based on VAs in the Population Health Metrics Research Consortium (PHMRC) VA Validation Dataset from study sites in Bohol and Manila, Philippines and Andhra Pradesh, India. The initial interview was repeated within 3-52 months of death. Question responses were assessed for reliability and completeness between the two survey rounds. COD was predicted by Tariff Method. Results: A sample of 4226 VAs was collected for 2113 decedents, including 1394 adults, 349 children, and 370 neonates. Mean question reliability was unexpectedly low (kappa = 0.447): 42.5 % of responses positive at the first interview were negative at the second, and 47.9 % of responses positive at the second had been negative at the first. Question reliability was greater for the short form of the PHMRC instrument (kappa = 0.497) and when analyzed at the level of the individual decedent (kappa = 0.610). Reliability at the level of the individual decedent was associated with COD predictive reliability and predictive accuracy. Conclusions: Families give coherent accounts of events leading to death but the details vary from interview to interview for the same case. Accounts are accurate but inconsistent; different subsets of symptoms are identified on each occasion. However, there are sufficient accurate and consistent subsets of symptoms to enable the Tariff Method to assign a COD. Questions which contributed most to COD prediction were also the most reliable and consistent across repeat interviews; these have been included in the short form VA questionnaire. Accuracy and reliability of diagnosis for an individual death depend on the quality of interview. This has considerable implications for the progressive roll out of VAs into civil registration and vital statistics (CRVS) systems
    corecore