26 research outputs found
Information Retrieval Evaluation Measures Based on Preference Graphs
Offline evaluation for web search has used mostly graded judgments to evaluate the performance of information retrieval systems. While graded judgments suffer several known problems, preference judgments simply judge one item over another, which avoids the problem of complex definition of relevance scores. Previous research about evaluation measures for preference judgments focuses on translating preferences into relevance scores applied in the traditional evaluation measures, or weighting and counting the number of agreements between actual ranking from users’ preferences and ideal ranking generated by systems. However, these measures lack clear theoretical foundations and their values have no obvious interpretation. On the other hand, although preference judgments for general web search have been studied extensively, there is limited research on investigating preference judgments application for web image search.
This thesis addresses exactly these questions, which proposes a preference-based evaluation measure to compute the maximum similarity between an actual ranking from users’ preferences and an ideal ranking generated by systems. Specifically, this measure constructs a directed multigraph and computes the ordering of vertices, which we call the ideal ranking, that has maximum similarity to actual ranking calculated by the rank similarity measure. This measure is able to take any arbitrary collection of preferences that might include the property of conflicts, redundancies, incompleteness, and diverse type results (documents or images). Our results show that Greedy PGC matches or exceeds the performance of evaluation measures proposed in previous research
Human Preferences as Dueling Bandits
© 2022 Association for Computing Machinery. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in SIGIR '22: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval,
http://dx.doi.org/10.1145/3477495.3531991The dramatic improvements in core information retrieval tasks engendered by neural rankers create a need for novel evaluation methods. If every ranker returns highly relevant items in the top ranks, it becomes difficult to recognize meaningful differences between them and to build reusable test collections. Several recent papers explore pairwise preference judgments as an alternative to traditional graded relevance assessments. Rather than viewing items one at a time, assessors view items side-by-side and indicate the one that provides the better response to a query, allowing fine-grained distinctions. If we employ preference judgments to identify the probably best items for each query, we can measure rankers by their ability to place these items as high as possible. We frame the problem of finding best items as a dueling bandits problem. While many papers explore dueling bandits for online ranker evaluation via interleaving, they have not been considered as a framework for offline evaluation via human preference judgments. We review the literature for possible solutions. For human preference judgments, any usable algorithm must tolerate ties, since two items may appear nearly equal to assessors, and it must minimize the number of judgments required for any specific pair, since each such comparison requires an independent assessor. Since the theoretical guarantees provided by most algorithms depend on assumptions that are not satisfied by human preference judgments, we simulate selected algorithms on representative test cases to provide insight into their practical utility. Based on these simulations, one algorithm stands out for its potential. Our simulations suggest modifications to further improve its performance. Using the modified algorithm, we collect over 10,000 preference judgments for pools derived from submissions to the TREC 2021 Deep Learning Track, confirming its suitability. We test the idea of best-item evaluation and suggest ideas for further theoretical and practical progress.We thank Mark Smucker, Gautam Kamath, and Ben Carterette for
their feedback. This research was supported by the Natural Science
and Engineering Research Council of Canada through its Discovery
Grants program
A Spacer Cation Assisted Nucleation and Growth Strategy Enables Efficient and High‐Luminance Quasi‐2D Perovskite LEDs
Quasi‐2D Ruddlesden‐Popper perovskites receive tremendous attention for application in light‐emitting diodes (LEDs). However, the role of organic ammonium spacers on perovskite film has not been fully‐understood. Herein, a spacer cation assisted perovskite nucleation and growth strategy, where guanidinium (GA+) spacer is introduced into the perovskite precursor and at the interface between the hole transport layer (HTL) and the perovskite, to achieve dense and uniform perovskite films with enhanced optical and electrical performance is developed. A thin GABr interface pre‐formed on HTL provides more nucleation sites for perovskite crystal; while the added GA+ in perovskite reduces the crystallization rate due to strong hydrogen bonding interacts with intermediates, which promotes the growth of enhanced‐quality quasi‐2D perovskite films. The ionized ammonium group ( NH3+) of GA+ also favors formation of polydisperse domain distribution, and amine or imine ( NH2 or NH) group interact with perovskite defects through coordination bonding. The spacer cation assisted nucleation and growth strategy is advantageous for producing efficient and high‐luminance perovskite LEDs, with a peak external quantum efficiency of over 20% and a luminance up to 100 000 cd m−2. This work can inform and underpin future development of high‐performance perovskite LEDs with concurrent high efficiency and brightness
Burden of disease scenarios for 204 countries and territories, 2022–2050: a forecasting analysis for the Global Burden of Disease Study 2021
Background: Future trends in disease burden and drivers of health are of great interest to policy makers and the public at large. This information can be used for policy and long-term health investment, planning, and prioritisation. We have expanded and improved upon previous forecasts produced as part of the Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) and provide a reference forecast (the most likely future), and alternative scenarios assessing disease burden trajectories if selected sets of risk factors were eliminated from current levels by 2050. Methods: Using forecasts of major drivers of health such as the Socio-demographic Index (SDI; a composite measure of lag-distributed income per capita, mean years of education, and total fertility under 25 years of age) and the full set of risk factor exposures captured by GBD, we provide cause-specific forecasts of mortality, years of life lost (YLLs), years lived with disability (YLDs), and disability-adjusted life-years (DALYs) by age and sex from 2022 to 2050 for 204 countries and territories, 21 GBD regions, seven super-regions, and the world. All analyses were done at the cause-specific level so that only risk factors deemed causal by the GBD comparative risk assessment influenced future trajectories of mortality for each disease. Cause-specific mortality was modelled using mixed-effects models with SDI and time as the main covariates, and the combined impact of causal risk factors as an offset in the model. At the all-cause mortality level, we captured unexplained variation by modelling residuals with an autoregressive integrated moving average model with drift attenuation. These all-cause forecasts constrained the cause-specific forecasts at successively deeper levels of the GBD cause hierarchy using cascading mortality models, thus ensuring a robust estimate of cause-specific mortality. For non-fatal measures (eg, low back pain), incidence and prevalence were forecasted from mixed-effects models with SDI as the main covariate, and YLDs were computed from the resulting prevalence forecasts and average disability weights from GBD. Alternative future scenarios were constructed by replacing appropriate reference trajectories for risk factors with hypothetical trajectories of gradual elimination of risk factor exposure from current levels to 2050. The scenarios were constructed from various sets of risk factors: environmental risks (Safer Environment scenario), risks associated with communicable, maternal, neonatal, and nutritional diseases (CMNNs; Improved Childhood Nutrition and Vaccination scenario), risks associated with major non-communicable diseases (NCDs; Improved Behavioural and Metabolic Risks scenario), and the combined effects of these three scenarios. Using the Shared Socioeconomic Pathways climate scenarios SSP2-4.5 as reference and SSP1-1.9 as an optimistic alternative in the Safer Environment scenario, we accounted for climate change impact on health by using the most recent Intergovernmental Panel on Climate Change temperature forecasts and published trajectories of ambient air pollution for the same two scenarios. Life expectancy and healthy life expectancy were computed using standard methods. The forecasting framework includes computing the age-sex-specific future population for each location and separately for each scenario. 95% uncertainty intervals (UIs) for each individual future estimate were derived from the 2·5th and 97·5th percentiles of distributions generated from propagating 500 draws through the multistage computational pipeline. Findings: In the reference scenario forecast, global and super-regional life expectancy increased from 2022 to 2050, but improvement was at a slower pace than in the three decades preceding the COVID-19 pandemic (beginning in 2020). Gains in future life expectancy were forecasted to be greatest in super-regions with comparatively low life expectancies (such as sub-Saharan Africa) compared with super-regions with higher life expectancies (such as the high-income super-region), leading to a trend towards convergence in life expectancy across locations between now and 2050. At the super-region level, forecasted healthy life expectancy patterns were similar to those of life expectancies. Forecasts for the reference scenario found that health will improve in the coming decades, with all-cause age-standardised DALY rates decreasing in every GBD super-region. The total DALY burden measured in counts, however, will increase in every super-region, largely a function of population ageing and growth. We also forecasted that both DALY counts and age-standardised DALY rates will continue to shift from CMNNs to NCDs, with the most pronounced shifts occurring in sub-Saharan Africa (60·1% [95% UI 56·8–63·1] of DALYs were from CMNNs in 2022 compared with 35·8% [31·0–45·0] in 2050) and south Asia (31·7% [29·2–34·1] to 15·5% [13·7–17·5]). This shift is reflected in the leading global causes of DALYs, with the top four causes in 2050 being ischaemic heart disease, stroke, diabetes, and chronic obstructive pulmonary disease, compared with 2022, with ischaemic heart disease, neonatal disorders, stroke, and lower respiratory infections at the top. The global proportion of DALYs due to YLDs likewise increased from 33·8% (27·4–40·3) to 41·1% (33·9–48·1) from 2022 to 2050, demonstrating an important shift in overall disease burden towards morbidity and away from premature death. The largest shift of this kind was forecasted for sub-Saharan Africa, from 20·1% (15·6–25·3) of DALYs due to YLDs in 2022 to 35·6% (26·5–43·0) in 2050. In the assessment of alternative future scenarios, the combined effects of the scenarios (Safer Environment, Improved Childhood Nutrition and Vaccination, and Improved Behavioural and Metabolic Risks scenarios) demonstrated an important decrease in the global burden of DALYs in 2050 of 15·4% (13·5–17·5) compared with the reference scenario, with decreases across super-regions ranging from 10·4% (9·7–11·3) in the high-income super-region to 23·9% (20·7–27·3) in north Africa and the Middle East. The Safer Environment scenario had its largest decrease in sub-Saharan Africa (5·2% [3·5–6·8]), the Improved Behavioural and Metabolic Risks scenario in north Africa and the Middle East (23·2% [20·2–26·5]), and the Improved Nutrition and Vaccination scenario in sub-Saharan Africa (2·0% [–0·6 to 3·6]). Interpretation: Globally, life expectancy and age-standardised disease burden were forecasted to improve between 2022 and 2050, with the majority of the burden continuing to shift from CMNNs to NCDs. That said, continued progress on reducing the CMNN disease burden will be dependent on maintaining investment in and policy emphasis on CMNN disease prevention and treatment. Mostly due to growth and ageing of populations, the number of deaths and DALYs due to all causes combined will generally increase. By constructing alternative future scenarios wherein certain risk exposures are eliminated by 2050, we have shown that opportunities exist to substantially improve health outcomes in the future through concerted efforts to prevent exposure to well established risk factors and to expand access to key health interventions
Global burden and strength of evidence for 88 risk factors in 204 countries and 811 subnational locations, 1990–2021: a systematic analysis for the Global Burden of Disease Study 2021
Background: Understanding the health consequences associated with exposure to risk factors is necessary to inform public health policy and practice. To systematically quantify the contributions of risk factor exposures to specific health outcomes, the Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) 2021 aims to provide comprehensive estimates of exposure levels, relative health risks, and attributable burden of disease for 88 risk factors in 204 countries and territories and 811 subnational locations, from 1990 to 2021. Methods: The GBD 2021 risk factor analysis used data from 54 561 total distinct sources to produce epidemiological estimates for 88 risk factors and their associated health outcomes for a total of 631 risk–outcome pairs. Pairs were included on the basis of data-driven determination of a risk–outcome association. Age-sex-location-year-specific estimates were generated at global, regional, and national levels. Our approach followed the comparative risk assessment framework predicated on a causal web of hierarchically organised, potentially combinative, modifiable risks. Relative risks (RRs) of a given outcome occurring as a function of risk factor exposure were estimated separately for each risk–outcome pair, and summary exposure values (SEVs), representing risk-weighted exposure prevalence, and theoretical minimum risk exposure levels (TMRELs) were estimated for each risk factor. These estimates were used to calculate the population attributable fraction (PAF; ie, the proportional change in health risk that would occur if exposure to a risk factor were reduced to the TMREL). The product of PAFs and disease burden associated with a given outcome, measured in disability-adjusted life-years (DALYs), yielded measures of attributable burden (ie, the proportion of total disease burden attributable to a particular risk factor or combination of risk factors). Adjustments for mediation were applied to account for relationships involving risk factors that act indirectly on outcomes via intermediate risks. Attributable burden estimates were stratified by Socio-demographic Index (SDI) quintile and presented as counts, age-standardised rates, and rankings. To complement estimates of RR and attributable burden, newly developed burden of proof risk function (BPRF) methods were applied to yield supplementary, conservative interpretations of risk–outcome associations based on the consistency of underlying evidence, accounting for unexplained heterogeneity between input data from different studies. Estimates reported represent the mean value across 500 draws from the estimate's distribution, with 95% uncertainty intervals (UIs) calculated as the 2·5th and 97·5th percentile values across the draws. Findings: Among the specific risk factors analysed for this study, particulate matter air pollution was the leading contributor to the global disease burden in 2021, contributing 8·0% (95% UI 6·7–9·4) of total DALYs, followed by high systolic blood pressure (SBP; 7·8% [6·4–9·2]), smoking (5·7% [4·7–6·8]), low birthweight and short gestation (5·6% [4·8–6·3]), and high fasting plasma glucose (FPG; 5·4% [4·8–6·0]). For younger demographics (ie, those aged 0–4 years and 5–14 years), risks such as low birthweight and short gestation and unsafe water, sanitation, and handwashing (WaSH) were among the leading risk factors, while for older age groups, metabolic risks such as high SBP, high body-mass index (BMI), high FPG, and high LDL cholesterol had a greater impact. From 2000 to 2021, there was an observable shift in global health challenges, marked by a decline in the number of all-age DALYs broadly attributable to behavioural risks (decrease of 20·7% [13·9–27·7]) and environmental and occupational risks (decrease of 22·0% [15·5–28·8]), coupled with a 49·4% (42·3–56·9) increase in DALYs attributable to metabolic risks, all reflecting ageing populations and changing lifestyles on a global scale. Age-standardised global DALY rates attributable to high BMI and high FPG rose considerably (15·7% [9·9–21·7] for high BMI and 7·9% [3·3–12·9] for high FPG) over this period, with exposure to these risks increasing annually at rates of 1·8% (1·6–1·9) for high BMI and 1·3% (1·1–1·5) for high FPG. By contrast, the global risk-attributable burden and exposure to many other risk factors declined, notably for risks such as child growth failure and unsafe water source, with age-standardised attributable DALYs decreasing by 71·5% (64·4–78·8) for child growth failure and 66·3% (60·2–72·0) for unsafe water source. We separated risk factors into three groups according to trajectory over time: those with a decreasing attributable burden, due largely to declining risk exposure (eg, diet high in trans-fat and household air pollution) but also to proportionally smaller child and youth populations (eg, child and maternal malnutrition); those for which the burden increased moderately in spite of declining risk exposure, due largely to population ageing (eg, smoking); and those for which the burden increased considerably due to both increasing risk exposure and population ageing (eg, ambient particulate matter air pollution, high BMI, high FPG, and high SBP). Interpretation: Substantial progress has been made in reducing the global disease burden attributable to a range of risk factors, particularly those related to maternal and child health, WaSH, and household air pollution. Maintaining efforts to minimise the impact of these risk factors, especially in low SDI locations, is necessary to sustain progress. Successes in moderating the smoking-related burden by reducing risk exposure highlight the need to advance policies that reduce exposure to other leading risk factors such as ambient particulate matter air pollution and high SBP. Troubling increases in high FPG, high BMI, and other risk factors related to obesity and metabolic syndrome indicate an urgent need to identify and implement interventions
Experimental investigation on mechanical properties and deformation mechanism of soft-hard interbedded rock-like material based on digital image correlation
The soft-hard interbedded rocks are often encountered in rock engineering such as mining engineering, tunnel engineering, and natural gas engineering. In this study, the horizontal soft-hard interbedded composite rock-like material (HSHC) specimens were prepared with different thickness ratios of rock-like layers, and a series of uniaxial compression tests were performed by using a self-designed loading mold. A parameter R was defined for the HSHC specimens, which is related to the thickness ratio of the soft and hard rock-like layers, the length-width ratio, and the mechanical parameters of the soft and hard rock-like materials. R was regarded to characterize the strength properties of the HSHC specimens. The test results showed that the axial strength of the HSHC specimens decreases first and then tends to be constant with increasing of R. Digital image correlation (DIC) technique was used to investigate the strain evolution and crack propagation process of the HSHC specimens. The displacement variation trend of the specimen can be roughly divided into three stages: stable deformation stage, rapid crack growth stage and instability failure stage. The failure mode of the HSHC specimens was mainly splitting tensile failure along the bedding plane. Meanwhile, a simplified mechanical model of laminated rock mass under uniaxial compression is established. The strength variation trend predicted by this model is consistent with the experimental results. The present study can reveal the failure mechanism of soft-hard interbedded rock mass at the bottom of the tunnel under the action of large horizontal stress after excavation
Dense projection fusion for 3D object detection
Abstract Fusing information from LiDAR and cameras can effectively enhance the overall perceptivity of autonomous vehicles in various scenarios. Despite the relatively good results achieved by point-wise fusion and Bird’s-Eye-View (BEV) fusion, they still cannot fully leverage the image information and lack of effective depth information. For any fusion methods, the multi-modal features first need to be concatenated along the channel, and then the fused features are extracted using convolutional layers. However, this type of fusion methods is effective, but too coarse which causes that the fused features cannot pay more attention to the regions with important features and suffer from severe noise. To tackle these issues, we propose in this paper a Dense Projection Fusion (DPFusion) approach. It consists of two new modules: dense depth map guided BEV transform (DGBT) module and multi-modal feature adaptive fusion (MFAF) module. The DGBT module first quickly estimates the depth of each pixel and then projects all image features to the BEV space, making full use of the image information. The MFAF module computes the image weights and point cloud weights for each channel in each BEV grid and then adaptively weights and fuses the image BEV features with the point cloud BEV features. It is worth pointing out that the MFAF module makes the fused features pay more attention to background outline and object outline. Our proposed DPFusion demonstrates competitive results in 3D object detection, achieving a mean Average Precision (mAP) of 70.4 and a nuScenes detection score (NDS) of 72.3 on the nuScenes validation set
Comparison of microplastic pollution in different water bodies from urban creeks to coastal waters
Although freshwater and estuary systems are recognized as origins and transport pathways of plastics to the oceans, there is a lack of comparison of microplastics in different water bodies or river networks. In the present study, the spatial distribution of microplastics was compared across different water bodies, including city creeks (Shanghai), rivers (Suzhou River and Huangpu River), an estuary (Yangtze Estuary) and coastal waters (East China Sea) in the Yangtze Delta area. Significant spatial differences of micro plastic abundances were revealed across the sampling areas. The results showed that the abundance of microplastics was higher (1.8-2.4 items/L) in freshwater bodies than that in estuarine and coastal water (0.9 items/L). In the Suzhou River and the Huangpu River, microplastics showed trends of increasing abundance downstream, where the peak of microplastic pollution is closer to the city center and the estuary. In respect of abundance, microplastics are likely to be transported from pollution sources to sink areas via river networks. The proportion of fibers was the highest in city creeks (88%), followed by the Suzhou River (85%), the Huangpu River (81%), the Yangtze Estuary (66%) and the East China Sea (37%). Similarly, polyesters dominated in city creeks and rivers. The results suggest that both the abundance and properties of microplastic pollution varies across different water bodies. Microplastic pollution in small freshwater bodies is more serious than in estuarine and coastal waters. Therefore, we support prioritization of water monitoring for microplastics within entire river networks, instead of single water body surveys. (C) 2018 Elsevier Ltd. All rights reserved
5-Fluorouracil combined with CalliSphere drug-eluting beads or conventional transarterial chemoembolization for unresectable hepatocellular carcinoma: a propensity score weighting analysis
Abstract This study aimed to assess the effectiveness and safety of 5-Fluorouracil (5-Fu) combined with conventional transarterial chemoembolization (cTACE) compared to 5-Fu combined with drug-eluting bead transarterial chemoembolization (DEB-TACE) using CalliSpheres for the treatment of unresectable hepatocellular carcinoma (HCC) using propensity score weighting methods. This retrospective analysis included 131 patients with HCC treated with 5-Fu combined with cTACE (5-Fu-cTACE group, n = 65) or DEB-TACE (5-Fu-DEB-TACE group, n = 66) at the Affiliated Hospital of North Sichuan Medical College from January 2019 to December 2022. Based on the baseline data and laboratory indicators, propensity score weighting was used to reduce confounding bias. Modified response evaluation criteria in solid tumors (mRECIST) were used to evaluate clinical efficacy. The primary endpoint was progression-free survival (PFS), and the secondary endpoints were the disease control rate (DCR), objective response rate (ORR) and adverse events (AEs). PFS was assessed using Kaplan‒Meier analysis and Cox proportional hazards models. The ORRs at 1 month (M1) after treatment in the 5-Fu-DEB-TACE group and 5-Fu-cTACE group were 90.9% and 76.9%, respectively (P = 0.029), while at this time, the DCRs were 93.9% in the 5-Fu-DEB-TACE group and 90.8% in the 5-Fu-cTACE group (P = 0.494). At 3 months (M3) after treatment, the 5-Fu-DEB-TACE group had a higher ORR (84.8% vs. 56.9%, P < 0.001) and DCR (84.8% vs. 72.3%, P = 0.08). The ORR at 6 months (M6) was also higher in the 5-Fu-DEB-TACE group than in the 5-Fu-cTACE group (72.7% vs. 50.8%, P = 0.01). The median PFS after treatment with 5-Fu-DEB-TACE was longer than that after treatment with 5-Fu-cTACE (11 months vs. 6 months) (P = 0.004). Cox proportional hazards regression analysis indicated that 5-Fu-DEB-TACE (HR = 0.590, P = 0.044), Model for End-Stage Liver Disease (MELD) intermediate risk (HR = 2.470, P = 0.010), BCLC stage B (HR = 2.303, P = 0.036), BCLC stage C (HR = 3.354, P = 0.002) and ascitic fluid (HR = 2.004, P = 0.046) were independent predictors of PFS. No treatment-related deaths occurred in this study. The 5-Fu-DEB-TACE group had a greater incidence of abdominal pain (72.7% vs. 47.7%, P = 0.003). However, the incidence of postoperative elevated transaminase levels was higher in the 5-Fu-cTACE group (83.1% vs. 66.6%, P = 0.031). Subgroups analysis showed patients receiving 5-Fu-DEB-TACE have better PFS compared to those receiving 5-Fu-cTACE in the BCLC stage A group (P = 0.0093), BCLC stage B group (P = 0.0096), multifocal group (P = 0.0056), Child-Pugh stage A group (P<0.001), non- extrahepatic metastasis group (P = 0.022), non-vascular invasion group (P = 0.0093), and the group with a largest tumor diameter ≥ 5 cm (P = 0.0048). At M1, M3, and M6, patients with preserved liver function and in some cases of low tumor burden had higher Objective Response Rate (ORR) and Disease Control Rate (DCR) (P < 0.05). Compared with 5-Fu-cTACE, 5-Fu-DEB-TACE has superior therapeutic efficacy, prolongs PFS, and reduces hepatotoxicity. However, it is associated with an increased incidence of postoperative abdominal pain