314 research outputs found

    Calibration estimation in dual-frame surveys

    Get PDF
    Survey statisticians make use of auxiliary information to improve estimates. One important example is calibration estimation, which constructs new weights that match benchmark constraints on auxiliary variables while remaining “close” to the design weights. Multiple-frame surveys are increasingly used by statistical agencies and private organizations to reduce sampling costs and/or avoid frame undercoverage errors. Several ways of combining estimates derived from such frames have been proposed elsewhere; in this paper, we extend the calibration paradigm, previously used for single-frame surveys, to calculate the total value of a variable of interest in a dual-frame survey. Calibration is a general tool that allows to include auxiliary information from two frames. It also incorporates, as a special case, certain dual-frame estimators that have been proposed previously. The theoretical properties of our class of estimators are derived and discussed, and simulation studies conducted to compare the efficiency of the procedure, using different sets of auxiliary variables. Finally, the proposed methodology is applied to real data obtained from the Barometer of Culture of Andalusia survey.Ministerio de Educación y CienciaConsejería de Economía, Innovación, Ciencia y EmpleoPRIN-SURWE

    Kernel-based methods for combining information of several frame surveys

    Get PDF
    A sample selected from a single sampling frame may not represent adequatly the entire population. Multiple frame surveys are becoming increasingly used and popular among statistical agencies and private organizations, in particular in situations where several sampling frames may provide better coverage or can reduce sampling costs for estimating population quantities of interest. Auxiliary information available at the population level is often categorical in nature, so that incorporating categorical and continuous information can improve the efficiency of the method of estimation. Nonparametric regression methods represent a widely used and flexible estimation approach in the survey context. We propose a kernel regression estimator for dual frame surveys that can handle both continuous and categorical data. This methodology is extended to multiple frame surveys. We derive theoretical properties of the proposed methods and numerical experiments indicate that the proposed estimator perform well in practical settings under different scenarios.Ministerio de Economía y CompetitividadConsejería de Economía, Innovación, Ciencia y Emple

    Inter-rater reliability of data elements from a prototype of the Paul Coverdell National Acute Stroke Registry

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The Paul Coverdell National Acute Stroke Registry (PCNASR) is a U.S. based national registry designed to monitor and improve the quality of acute stroke care delivered by hospitals. The registry monitors care through specific performance measures, the accuracy of which depends in part on the reliability of the individual data elements used to construct them. This study describes the inter-rater reliability of data elements collected in Michigan's state-based prototype of the PCNASR.</p> <p>Methods</p> <p>Over a 6-month period, 15 hospitals participating in the Michigan PCNASR prototype submitted data on 2566 acute stroke admissions. Trained hospital staff prospectively identified acute stroke admissions, abstracted chart information, and submitted data to the registry. At each hospital 8 randomly selected cases were re-abstracted by an experienced research nurse. Inter-rater reliability was estimated by the kappa statistic for nominal variables, and intraclass correlation coefficient (ICC) for ordinal and continuous variables. Factors that can negatively impact the kappa statistic (i.e., trait prevalence and rater bias) were also evaluated.</p> <p>Results</p> <p>A total of 104 charts were available for re-abstraction. Excellent reliability (kappa or ICC > 0.75) was observed for many registry variables including age, gender, black race, hemorrhagic stroke, discharge medications, and modified Rankin Score. Agreement was at least moderate (i.e., 0.75 > kappa ≥; 0.40) for ischemic stroke, TIA, white race, non-ambulance arrival, hospital transfer and direct admit. However, several variables had poor reliability (kappa < 0.40) including stroke onset time, stroke team consultation, time of initial brain imaging, and discharge destination. There were marked systematic differences between hospital abstractors and the audit abstractor (i.e., rater bias) for many of the data elements recorded in the emergency department.</p> <p>Conclusion</p> <p>The excellent reliability of many of the data elements supports the use of the PCNASR to monitor and improve care. However, the poor reliability for several variables, particularly time-related events in the emergency department, indicates the need for concerted efforts to improve the quality of data collection. Specific recommendations include improvements to data definitions, abstractor training, and the development of ED-based real-time data collection systems.</p

    Discovery and saturation analysis of cancer genes across 21 tumour types

    Get PDF
    Although a few cancer genes are mutated in a high proportion of tumours of a given type (>20%), most are mutated at intermediate frequencies (2–20%). To explore the feasibility of creating a comprehensive catalogue of cancer genes, we analysed somatic point mutations in exome sequences from 4,742 human cancers and their matched normal-tissue samples across 21 cancer types. We found that large-scale genomic analysis can identify nearly all known cancer genes in these tumour types. Our analysis also identified 33 genes that were not previously known to be significantly mutated in cancer, including genes related to proliferation, apoptosis, genome stability, chromatin regulation, immune evasion, RNA processing and protein homeostasis. Down-sampling analysis indicates that larger sample sizes will reveal many more genes mutated at clinically important frequencies. We estimate that near-saturation may be achieved with 600–5,000 samples per tumour type, depending on background mutation frequency. The results may help to guide the next stage of cancer genomics

    Need-based resource allocation: different need indicators, different results?

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>A key policy objective in most publicly financed health care systems is to allocate resources according to need. Many jurisdictions implement this policy objective through need-based allocation models. To date, no gold standard exists for selecting need indicators. In the absence of a gold standard, sensitivity of the choice of need indicators is of concern. The primary objective of this study was to assess the consistency and plausibility of estimates of per capita relative need for health services across Canadian provinces based on different need indicators.</p> <p>Methods</p> <p>Using the 2000/2001 Canadian Community Health Survey, we estimated relative per capita need for general practitioner, specialist, and hospital services by province using two approaches that incorporated a different set of need indicators: (1) demographics (age and sex), and (2) demographics, socioeconomic status, and health status. For both approaches, we first fitted regression models to estimate standard utilization of each of three types of health services by indicators of need. We defined the standard as average levels of utilization by needs indicators in the national sample. Subsequently, we estimated expected per capita utilization of each type of health services in each province. We compared these estimates of per capita relative need with premature mortality in each province to check their face validity.</p> <p>Results</p> <p>Both approaches suggested that expected relative per capita need for three services vary across provinces. Different approaches, however, yielded different and inconsistent results. Moreover, provincial per capita relative need for the three health services did not always indicate the same direction of need suggested by premature mortality in each province. In particular, the two approaches suggested Newfoundland had less need than the Canadian average for all three services, but it had the highest premature mortality in Canada.</p> <p>Conclusion</p> <p>Substantial differences in need for health care may exist across Canadian provinces, but the direction and magnitude of differences depend on the need indicators used. Allocations from models using survey data lacked face validity for some provinces. These results call for the need to better understand the biases that may result from the use of survey data for resource allocation.</p

    Changes in pulmonary tuberculosis prevalence: Evidence from the 2010 population survey in a populous province of China

    Get PDF
    Background: This paper reports findings from the prevalence survey conducted in Shandong China in 2010, a province with a population of 94 million. This study aimed to estimate TB prevalence of the province in 2010 in comparison with the 2000 survey; and to compare yields of TB cases from different case finding approaches. Methods: A population based, cross-sectional survey was conducted using multi-stage random cluster sampling. 54,279 adults participated in the survey with a response rate of 96%. Doctors interviewed and classified participants as suspected TB cases if they presented with persistent cough, abnormal chest X-ray (CXRAY), or both. Three sputum specimens of all suspected cases were collected and sent for smear microscopy and culture. Results: Adjusted prevalence rate of bacteriologically confirmed cases was 34 per 100,000 for adults in Shandong in 2010. Compared to the 2000 survey, TB prevalence has declined by 80%. 53% of bacteriologically confirmed cases did not present persistent cough. The yield of bacteriologically confirmed cases was 47% by symptom screening and 95% by CXRAY. Over 50% of TB cases were among over 65's. Conclusions: The prevalence rate of bacteriologically confirmed cases was significantly reduced compared with 2000. The survey raised challenges to identify TB cases without clear symptoms

    How to survey displaced workers in Switzerland ? Sources of bias and ways around them

    Get PDF
    Studying career outcomes after job loss is challenging because individually displaced worker form a self-selected group. Indeed, the same factors causing the workers to lose their jobs, such as lack of motivation, may also reduce their re-employment prospects. Using data from plant closures where all workers were displaced irrespective of their individual characteristics offers a way around this selection bias. There is no systematic data collection on workers displaced by plant closure in Switzerland. Accordingly, we conducted our own survey on 1200 manufacturing workers who had lost their job 2 years earlier. The analysis of observational data gives rise to a set of methodological challenges, in particular nonresponse bias. Our survey addressed this issue by mixing data collection modes and repeating contact attempts. In addition, we combined the survey data with data from the public unemployment register to examine the extent of nonresponse bias. Our analysis suggests that some of our adjustments helped to reduce bias. Repeated contact attempts increased the response rate, but did not reduce nonresponse bias. In contrast, using telephone interviews in addition to paper questionnaires helped to substantially improve the participation of typically underrepresented subgroups. However, the survey respondents still differ from nonrespondents in terms of age, education and occupation. Interestingly, these differences have no significant impact on the substantial conclusion about displaced workers' re-employment prospects
    corecore