987 research outputs found
Robust Estimation for Linear Panel Data Models
In different fields of applications including, but not limited to,
behavioral, environmental, medical sciences and econometrics, the use of panel
data regression models has become increasingly popular as a general framework
for making meaningful statistical inferences. However, when the ordinary least
squares (OLS) method is used to estimate the model parameters, presence of
outliers may significantly alter the adequacy of such models by producing
biased and inefficient estimates. In this work we propose a new, weighted
likelihood based robust estimation procedure for linear panel data models with
fixed and random effects. The finite sample performances of the proposed
estimators have been illustrated through an extensive simulation study as well
as with an application to blood pressure data set. Our thorough study
demonstrates that the proposed estimators show significantly better
performances over the traditional methods in the presence of outliers and
produce competitive results to the OLS based estimates when no outliers are
present in the data set
Disease prevention versus data privacy : using landcover maps to inform spatial epidemic models
The availability of epidemiological data in the early stages of an outbreak of an infectious disease is vital for modelers to make accurate predictions regarding the likely spread of disease and preferred intervention strategies. However, in some countries, the necessary demographic data are only available at an aggregate scale. We investigated the ability of models of livestock infectious diseases to predict epidemic spread and obtain optimal control policies in the event of imperfect, aggregated data. Taking a geographic information approach, we used land cover data to predict UK farm locations and investigated the influence of using these synthetic location data sets upon epidemiological predictions in the event of an outbreak of foot-and-mouth disease. When broadly classified land cover data were used to create synthetic farm locations, model predictions deviated significantly from those simulated on true data. However, when more resolved subclass land use data were used, moderate to highly accurate predictions of epidemic size, duration and optimal vaccination and ring culling strategies were obtained. This suggests that a geographic information approach may be useful where individual farm-level data are not available, to allow predictive analyses to be carried out regarding the likely spread of disease. This method can also be used for contingency planning in collaboration with policy makers to determine preferred control strategies in the event of a future outbreak of infectious disease in livestock
Application of longitudinal data analysis allows to detect differences in pre‐breeding growing curves of 24‐month calving Angus heifers under two pasture‐based system with differential puberty onset
Background. Longitudinal data analysis contributes to detect differences in the growing curve by exploiting all the information involved in repeated measurements, allowing to distinguish changes over time within individuals, from differences in the baseline levels among groups. In this research longitudinal and cross-sectional analysis were compared to evaluate differences in growth in Angus heifers under two different grazing conditions, ad libitum (AG) and controlled (CG) to gain 0.5 kg/day. Results. Longitudinal mixed models show differences in growing curves parameters between grazing conditions, that were not detected by cross sectional analysis. Differences (P < 0.05) in first derivative of growth curves (daily gain) until 289 days were observed between treatments, being AG higher than CG. Correspondingly, pubertal heifer proportion was also higher in AG at the end of rearing (AG 0.94; CG 0.67). Conclusion. In longitudinal studies, the power to detect differences between groups increases by exploiting the whole information of repeated measures, modelling the relation between measurements performed on the same individual. Under a proper analysis valid conclusion can be drawn with less animals in the trial, improving animal welfare and reducing investigation costs.Fil: Bonamy, Martin. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico CONICET- La Plata. Instituto de Genética Veterinaria "Ing. Fernando Noel Dulout". Universidad Nacional de La Plata. Facultad de Ciencias Veterinarias. Instituto de Genética Veterinaria; ArgentinaFil: de Iraola, Julieta Josefina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico CONICET- La Plata. Instituto de Genética Veterinaria "Ing. Fernando Noel Dulout". Universidad Nacional de La Plata. Facultad de Ciencias Veterinarias. Instituto de Genética Veterinaria; ArgentinaFil: Prando, Alberto José. Universidad Nacional de La Plata. Facultad de Ciencias Veterinarias; ArgentinaFil: Baldo, Andres. Universidad Nacional de La Plata. Facultad de Ciencias Veterinarias; ArgentinaFil: Giovambattista, Guillermo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico CONICET- La Plata. Instituto de Genética Veterinaria "Ing. Fernando Noel Dulout". Universidad Nacional de La Plata. Facultad de Ciencias Veterinarias. Instituto de Genética Veterinaria; ArgentinaFil: Rogberg Muñoz, Andres. Universidad de Buenos Aires. Facultad de Agronomía. Departamento de Producción Animal; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico CONICET- La Plata. Instituto de Genética Veterinaria "Ing. Fernando Noel Dulout". Universidad Nacional de La Plata. Facultad de Ciencias Veterinarias. Instituto de Genética Veterinaria; Argentin
Asthma exacerbation and proximity of residence to major roads: a population-based matched case-control study among the pediatric Medicaid population in Detroit, Michigan
<p>Abstract</p> <p>Background</p> <p>The relationship between asthma and traffic-related pollutants has received considerable attention. The use of individual-level exposure measures, such as residence location or proximity to emission sources, may avoid ecological biases.</p> <p>Method</p> <p>This study focused on the pediatric Medicaid population in Detroit, MI, a high-risk population for asthma-related events. A population-based matched case-control analysis was used to investigate associations between acute asthma outcomes and proximity of residence to major roads, including freeways. Asthma cases were identified as all children who made at least one asthma claim, including inpatient and emergency department visits, during the three-year study period, 2004-06. Individually matched controls were randomly selected from the rest of the Medicaid population on the basis of non-respiratory related illness. We used conditional logistic regression with distance as both categorical and continuous variables, and examined non-linear relationships with distance using polynomial splines. The conditional logistic regression models were then extended by considering multiple asthma states (based on the frequency of acute asthma outcomes) using polychotomous conditional logistic regression.</p> <p>Results</p> <p>Asthma events were associated with proximity to primary roads with an odds ratio of 0.97 (95% CI: 0.94, 0.99) for a 1 km increase in distance using conditional logistic regression, implying that asthma events are less likely as the distance between the residence and a primary road increases. Similar relationships and effect sizes were found using polychotomous conditional logistic regression. Another plausible exposure metric, a reduced form response surface model that represents atmospheric dispersion of pollutants from roads, was not associated under that exposure model.</p> <p>Conclusions</p> <p>There is moderately strong evidence of elevated risk of asthma close to major roads based on the results obtained in this population-based matched case-control study.</p
Robust automatic mapping algorithms in a network monitoring scenario
Automatically generating maps of a measured variable of interest can be problematic. In this work we focus on the monitoring network context where observations are collected and reported by a network of sensors, and are then transformed into interpolated maps for use in decision making. Using traditional geostatistical methods, estimating the covariance structure of data collected in an emergency situation can be difficult. Variogram determination, whether by method-of-moment estimators or by maximum likelihood, is very sensitive to extreme values. Even when a monitoring network is in a routine mode of operation, sensors can sporadically malfunction and report extreme values. If this extreme data destabilises the model, causing the covariance structure of the observed data to be incorrectly estimated, the generated maps will be of little value, and the uncertainty estimates in particular will be misleading. Marchant and Lark [2007] propose a REML estimator for the covariance, which is shown to work on small data sets with a manual selection of the damping parameter in the robust likelihood. We show how this can be extended to allow treatment of large data sets together with an automated approach to all parameter estimation. The projected process kriging framework of Ingram et al. [2007] is extended to allow the use of robust likelihood functions, including the two component Gaussian and the Huber function. We show how our algorithm is further refined to reduce the computational complexity while at the same time minimising any loss of information. To show the benefits of this method, we use data collected from radiation monitoring networks across Europe. We compare our results to those obtained from traditional kriging methodologies and include comparisons with Box-Cox transformations of the data. We discuss the issue of whether to treat or ignore extreme values, making the distinction between the robust methods which ignore outliers and transformation methods which treat them as part of the (transformed) process. Using a case study, based on an extreme radiological events over a large area, we show how radiation data collected from monitoring networks can be analysed automatically and then used to generate reliable maps to inform decision making. We show the limitations of the methods and discuss potential extensions to remedy these
Impact of Type 2 diabetes prevention programmes based on risk identification and lifestyle intervention intensity strategies: a cost-effectiveness analysis
Aim
To develop a cost-effectiveness model to compare Type 2 diabetes prevention programmes that target different at-risk population subgroups through lifestyle interventions of varying intensity.
Methods
An individual patient simulation model simulated the development of diabetes in a representative sample of adults without diabetes from the UK population. The model incorporates trajectories for HbA1c, 2-h glucose, fasting plasma glucose, BMI, systolic blood pressure, total cholesterol and HDL cholesterol. In the model, patients can be diagnosed with diabetes, cardiovascular disease, microvascular complications of diabetes, cancer, osteoarthritis and depression, or can die. The model collects costs and utilities over a lifetime horizon. The perspective is the UK National Health Service and Personal Social Services. We used the model to evaluate the population-wide impact of targeting a lifestyle intervention of varying intensity to six population subgroups defined as at high risk for diabetes.
Results
The intervention produces 0.0020 to 0.0026 incremental quality-adjusted life-years and saves £15 to £23 per person in the general population, depending on the subgroup targeted. Cost-effectiveness increases with intervention intensity. The most cost-effective options were to target South-Asian people and those with HbA1c levels > 42 mmol/mol (6%).
Conclusion
The model indicates that diabetes prevention interventions are likely to be cost-saving. The criteria for selecting at-risk individuals differentially has an impact on diabetes and cardiovascular disease outcomes, and on the timing of costs and benefits. The model is not currently able to account for potential differential uptake or efficacy between subgroups. These findings have implications for deciding who should be targeted for diabetes prevention interventions.NIH
The use of continuous electronic prescribing data to infer trends in antimicrobial consumption and estimate the impact of stewardship interventions in hospitalized children.
BACKGROUND: Understanding antimicrobial consumption is essential to mitigate the development of antimicrobial resistance, yet robust data in children are sparse and methodologically limited. Electronic prescribing systems provide an important opportunity to analyse and report antimicrobial consumption in detail. OBJECTIVES: We investigated the value of electronic prescribing data from a tertiary children's hospital to report temporal trends in antimicrobial consumption in hospitalized children and compare commonly used metrics of antimicrobial consumption. METHODS: Daily measures of antimicrobial consumption [days of therapy (DOT) and DDDs] were derived from the electronic prescribing system between 2010 and 2018. Autoregressive moving-average models were used to infer trends and the estimates were compared with simulated point prevalence surveys (PPSs). RESULTS: More than 1.3 million antimicrobial administrations were analysed. There was significant daily and seasonal variation in overall consumption, which reduced annually by 1.77% (95% CI 0.50% to 3.02%). Relative consumption of meropenem decreased by 6.6% annually (95% CI -3.5% to 15.8%) following the expansion of the hospital antimicrobial stewardship programme. DOT and DDDs exhibited similar trends for most antimicrobials, though inconsistencies were observed where changes to dosage guidelines altered consumption calculation by DDDs, but not DOT. PPS simulations resulted in estimates of change over time, which converged on the model estimates, but with much less precision. CONCLUSIONS: Electronic prescribing systems offer significant opportunities to better understand and report antimicrobial consumption in children. This approach to modelling administration data overcomes the limitations of using interval data and dispensary data. It provides substantially more detailed inferences on prescribing patterns and the potential impact of stewardship interventions
Investigating the missing data mechanism in quality of life outcomes: a comparison of approaches
Background: Missing data is classified as missing completely at random (MCAR), missing at
random (MAR) or missing not at random (MNAR). Knowing the mechanism is useful in identifying
the most appropriate analysis. The first aim was to compare different methods for identifying this
missing data mechanism to determine if they gave consistent conclusions. Secondly, to investigate
whether the reminder-response data can be utilised to help identify the missing data mechanism.
Methods: Five clinical trial datasets that employed a reminder system at follow-up were used.
Some quality of life questionnaires were initially missing, but later recovered through reminders.
Four methods of determining the missing data mechanism were applied. Two response data
scenarios were considered. Firstly, immediate data only; secondly, all observed responses
(including reminder-response).
Results: In three of five trials the hypothesis tests found evidence against the MCAR assumption.
Logistic regression suggested MAR, but was able to use the reminder-collected data to highlight
potential MNAR data in two trials.
Conclusion: The four methods were consistent in determining the missingness mechanism. One
hypothesis test was preferred as it is applicable with intermittent missingness. Some inconsistencies between the two data scenarios were found. Ignoring the reminder data could potentially give a distorted view of the missingness mechanism. Utilising reminder data allowed the possibility of MNAR to be considered.The Chief Scientist Office of the Scottish Government Health Directorate.
Research Training Fellowship (CZF/1/31
Risk-Targeted Selection of Agricultural Holdings for Post-Epidemic Surveillance: Estimation of Efficiency Gains
Current post-epidemic sero-surveillance uses random selection of animal holdings. A better strategy may be to estimate the benefits gained by sampling each farm and use this to target selection. In this study we estimate the probability of undiscovered infection for sheep farms in Devon after the 2001 foot-and-mouth disease outbreak using the combination of a previously published model of daily infection risk and a simple model of probability of discovery of infection during the outbreak. This allows comparison of the system sensitivity (ability to detect infection in the area) of arbitrary, random sampling compared to risk-targeted selection across a full range of sampling budgets. We show that it is possible to achieve 95% system sensitivity by sampling, on average, 945 farms with random sampling and 184 farms with risk-targeted sampling. We also examine the effect of ordering samples by risk to expedite return to a disease-free status. Risk ordering the sampling process results in detection of positive farms, if present, 15.6 days sooner than with randomly ordered sampling, assuming 50 farms are tested per day
- …