1,451 research outputs found

    Mathematically aggregating experts' predictions of possible futures

    Get PDF
    Structured protocols offer a transparent and systematic way to elicit and combine/aggregate, probabilistic predictions from multiple experts. These judgements can be aggregated behaviourally or mathematically to derive a final group prediction. Mathematical rules (e.g., weighted linear combinations of judgments) provide an objective approach to aggregation. The quality of this aggregation can be defined in terms of accuracy, calibration and informativeness. These measures can be used to compare different aggregation approaches and help decide on which aggregation produces the “best” final prediction. When experts’ performance can be scored on similar questions ahead of time, these scores can be translated into performance-based weights, and a performance-based weighted aggregation can then be used. When this is not possible though, several other aggregation methods, informed by measurable proxies for good performance, can be formulated and compared. Here, we develop a suite of aggregation methods, informed by previous experience and the available literature. We differentially weight our experts’ estimates by measures of reasoning, engagement, openness to changing their mind, informativeness, prior knowledge, and extremity, asymmetry or granularity of estimates. Next, we investigate the relative performance of these aggregation methods using three datasets. The main goal of this research is to explore how measures of knowledge and behaviour of individuals can be leveraged to produce a better performing combined group judgment. Although the accuracy, calibration, and informativeness of the majority of methods are very similar, a couple of the aggregation methods consistently distinguish themselves as among the best or worst. Moreover, the majority of methods outperform the usual benchmarks provided by the simple average or the median of estimates

    Development and validation of a diabetes-specific health state classification system and valuation function based on the multi-attribute theory

    Get PDF
    Preference-Based Measures of Health (PBMH) provide \u27preference\u27 or \u27utility\u27 weights that enable the calculation of quality-adjusted life years for the economic evaluations of interventions. The Diabetes Utility Index (DUI) was developed as a two-page, self-administered diabetes-specific PBMH that can replace expensive time-consuming interviews with patients to estimate their health state utilities. Inputs from theory, an existing diabetes-specific measure of quality of life, and statistical analyses were submitted to a clinical expert panel. After three rounds of pilot surveys (n1=52, n2=65, n3=111) at primary care clinics in Morgantown, WV, five attributes and severity categories for each attribute were finalized on the basis of the results of Rasch Analysis and consultations with the panel. The final attributes were: \u27Physical Ability & Energy\u27, \u27Relationships\u27, \u27Mood & Feelings\u27, \u27Enjoyment of Diet\u27, and \u27Satisfaction with Management of diabetes\u27. The next step involved obtaining preferences for health states based on combinations of DUI attributes and severity levels from 100 individuals with diabetes, recruited from primary care and community settings in and around Morgantown, WV, in hour-long one-on-one interviews. These health states were anchor states, single-attribute level states including corner states, and marker states. The interviews provided data to calculate a Multi-Attribute Utility Function (MAUF) that calculates utilities for any of the 768 health states that can be defined by the DUI, on a scale where 1.00=Perfect Health and 0.00=the all worse \u27Pits\u27 state, from respondents\u27 answers to its five questions. In addition to an overall index score, attribute-level preference scores were also calculable by the function. Finally, a validation survey was conducted in collaboration with the West Virginia University (WVU) Diabetes Institute. For concurrent and construct validation purposes, the DUI was mailed to individuals with diabetes along with generic PBMH like the EuroQol EQ-5D, the SF-6D and other patient-reported outcomes measures like the Diabetes Symptoms Checklist-Revised, the Short Form 12 (SF-12) and the Well-Being Questionnaire (W-BQ12), and their surveys responses (n=396) were merged with a clinical database consisting of ICD-9 diagnosis codes. The DUI utilities were found to be largely free of socio-demographic effects and its scores were well distributed between 0.00 and 1.00. The DUI moderately correlated with generic PBMH and distinguished between severity groups based on diabetes symptoms and complications. The scoring function of the DUI calculated utilities favorably compared against cardinal Standard Gamble utilities obtained directly from patients for three DUI health states. These results show evidence of the feasibility and validity of the DUI. Further research is suggested to demonstrate the generalizability of these findings, to study the responsiveness of the DUI, and to examine the clinical meaningfulness of the DUI change scores

    Suburban watershed nitrogen retention : estimating the effectiveness of stormwater management structures

    Get PDF
    Excess nitrogen (N) is a primary driver of freshwater and coastal eutrophication globally, and urban stormwater is a rapidly growing source of N pollution. Stormwater best management practices (BMPs) are used widely to remove excess N from runoff in urban and suburban areas, and are expected to perform under a wide variety of environmental conditions. Yet the capacity of BMPs to retain excess N varies; and both the variation and the drivers thereof are largely unknown, hindering the ability of water resource managers to meet water quality targets in a cost-effective way. Here, we use structured expert judgment (SEJ), a performance-weighted method of expert elicitation, to quantify the uncertainty in BMP performance under a range of site-specific environmental conditions and to estimate the extent to which key environmental factors influence variation in BMP performance. We hypothesized that rain event frequency and magnitude, BMP type and size, and physiographic province would significantly influence the experts’ estimates of N retention by BMPs common to suburban Piedmont and Coastal Plain watersheds of the Chesapeake Bay region. Expert knowledge indicated wide uncertainty in BMP performance, with N removal efficiencies ranging from 40%. Experts believed that the amount of rain was the primary identifiable source of variability in BMP efficiency, which is relevant given climate projections of more frequent heavy rain events in the mid-Atlantic. To assess the extent to which those projected changes might alter N export from suburban BMPs and watersheds, we combined downscaled estimates of rainfall with distributions of N loads for different-sized rain events derived from our elicitation. The model predicted higher and more variable N loads under a projected future climate regime, suggesting that current BMP regulations for reducing nutrients may be inadequate in the future

    Use of expert elicitation to assign weights to climate and hydrological models in climate impact studies

    Get PDF
    Various methods are available for assessing uncertainties in climate impact studies. Among such methods, model weighting by expert elicitation is a practical way to provide a weighted ensemble of models for specific real-world impacts. The aim is to decrease the influence of improbable models in the results and easing the decisionmaking process. In this study both climate and hydrological models are analysed, and the result of a research experiment is presented using model weighting with the participation of six climate model experts and six hydrological model experts. For the experiment, seven climate models are a priori selected from a larger EURO-CORDEX (Coordinated Regional Downscaling Experiment – European Domain) ensemble of climate models, and three different hydrological models are chosen for each of the three European river basins. The model weighting is based on qualitative evaluation by the experts for each of the selected models based on a training material that describes the overall model structure and literature about climate models and the performance of hydrological models for the present period. The expert elicitation process follows a three-stage approach, with two individual rounds of elicitation of probabilities and a final group consensus, where the experts are separated into two different community groups: a climate and a hydrological modeller group. The dialogue reveals that under the conditions of the study, most climate modellers prefer the equal weighting of ensemble members, whereas hydrological-impact modellers in general are more open for assigning weights to different models in a multi-model ensemble, based on model performance and model structure. Climate experts are more open to exclude models, if obviously flawed, than to put weights on selected models in a relatively small ensemble. The study shows that expert elicitation can be an efficient way to assign weights to different hydrological models and thereby reduce the uncertainty in climate impact. However, for the climate model ensemble, comprising seven models, the elicitation in the format of this study could only re-establish a uniform weight between climate models.European Commission European Commission Joint Research Centre 69046
    • …
    corecore