187 research outputs found

    Succinct Dictionary Matching With No Slowdown

    Full text link
    The problem of dictionary matching is a classical problem in string matching: given a set S of d strings of total length n characters over an (not necessarily constant) alphabet of size sigma, build a data structure so that we can match in a any text T all occurrences of strings belonging to S. The classical solution for this problem is the Aho-Corasick automaton which finds all occ occurrences in a text T in time O(|T| + occ) using a data structure that occupies O(m log m) bits of space where m <= n + 1 is the number of states in the automaton. In this paper we show that the Aho-Corasick automaton can be represented in just m(log sigma + O(1)) + O(d log(n/d)) bits of space while still maintaining the ability to answer to queries in O(|T| + occ) time. To the best of our knowledge, the currently fastest succinct data structure for the dictionary matching problem uses space O(n log sigma) while answering queries in O(|T|log log n + occ) time. In this paper we also show how the space occupancy can be reduced to m(H0 + O(1)) + O(d log(n/d)) where H0 is the empirical entropy of the characters appearing in the trie representation of the set S, provided that sigma < m^epsilon for any constant 0 < epsilon < 1. The query time remains unchanged.Comment: Corrected typos and other minor error

    The impact of self-efficacy, expectations, and readiness on hearing aid outcomes

    Get PDF
    Objective: To examine the impact of self-efficacy and expectations for hearing aids, and readiness to improve hearing, on hearing aid outcome measures in first-time adult hearing aid users Design: A prospective, single centre design. Predictor variables measured at the hearing assessment included measures of self-efficacy, expectations and readiness to improve hearing. Outcome measures obtained at sixweek follow-up were the Glasgow Hearing Aid Benefit Profile and Satisfaction with Amplification in Daily Life. Study sample: A sample of 30 first-time adult hearing aid users were recruited through a public-sector funded audiology clinic. Results: When measured prior to hearing aid fitting, self-efficacy for hearing aids predicted satisfaction with hearing aids but was not related to other hearing aid outcomes. Expectations of hearing aids, in particular positive expectations, and readiness to improve hearing predicted outcomes for hearing aid satisfaction and benefit, although not hearing aid use. Hearing sensitivity was not correlated with hearing aid outcomes. Conclusions: These results suggest that assessment of expectations of hearing aids, and readiness to improve hearing, may be useful to help identify individuals attending audiology clinics who would most likely benefit from hearing aid provision

    Machine learning-based investigation of the association between CMEs and filaments

    Get PDF
    YesIn this work we study the association between eruptive filaments/prominences and coronal mass ejections (CMEs) using machine learning-based algorithms that analyse the solar data available between January 1996 and December 2001. The Support Vector Machine (SVM) learning algorithm is used for the purpose of knowledge extraction from the association results. The aim is to identify patterns of associations that can be represented using SVM learning rules for the subsequent use in near real-time and reliable CME prediction systems. Timing and location data in the NGDC filament catalogue and the SOHO/LASCO CME catalogue are processed to associate filaments with CMEs. In the previous studies which classified CMEs into gradual and impulsive CMEs, the associations were refined based on CME speed and acceleration. Then the associated pairs were refined manually to increase the accuracy of the training dataset. In the current study, a data- mining system has been created to process and associate filament and CME data, which are arranged in numerical training vectors. Then the data are fed to SVMs to extract the embedded knowledge and provide the learning rules that could have the potential, in the future, to provide automated predictions of CMEs. The features representing the event time (average of the start and end times), duration, type and extent of the filaments are extracted from all the associated and not-associated filaments and converted to a numerical format that is suitable for SVM use. Several validation and verification methods are used on the extracted dataset to determine if CMEs can be predicted solely and efficiently based on the associated filaments. More than 14000 experiments are carried out to optimise the SVM and determine the input features that provide the best performance

    Global Carbon Budget 2018

    Get PDF
    Accurate assessment of anthropogenic carbon dioxide (CO2) emissions and their redistribution among the atmosphere, ocean, and terrestrial biosphere – the “global carbon budget” – is important to better understand the global carbon cycle, support the development of climate policies, and project future climate change. Here we describe data sets and methodology to quantify the five major components of the global carbon budget and their uncertainties. Fossil CO2 emissions (EFF) are based on energy statistics and cement production data, while emissions from land use and land-use change (ELUC), mainly deforestation, are based on land use and land-use change data and bookkeeping models. Atmospheric CO2 concentration is measured directly and its growth rate (GATM) is computed from the annual changes in concentration. The ocean CO2 sink (SOCEAN) and terrestrial CO2 sink (SLAND) are estimated with global process models constrained by observations. The resulting carbon budget imbalance (BIM), the difference between the estimated total emissions and the estimated changes in the atmosphere, ocean, and terrestrial biosphere, is a measure of imperfect data and understanding of the contemporary carbon cycle. All uncertainties are reported as ±1σ. For the last decade available (2008–2017), EFF was 9.4±0.5 GtC yr−1, ELUC 1.5±0.7 GtC yr−1, GATM 4.7±0.02 GtC yr−1, SOCEAN 2.4±0.5 GtC yr−1, and SLAND 3.2±0.8 GtC yr−1, with a budget imbalance BIM of 0.5 GtC yr−1 indicating overestimated emissions and/or underestimated sinks. For the year 2017 alone, the growth in EFF was about 1.6 % and emissions increased to 9.9±0.5 GtC yr−1. Also for 2017, ELUC was 1.4±0.7 GtC yr−1, GATM was 4.6±0.2 GtC yr−1, SOCEAN was 2.5±0.5 GtC yr−1, and SLAND was 3.8±0.8 GtC yr−1, with a BIM of 0.3 GtC. The global atmospheric CO2 concentration reached 405.0±0.1 ppm averaged over 2017. For 2018, preliminary data for the first 6–9 months indicate a renewed growth in EFF of +2.7 % (range of 1.8 % to 3.7 %) based on national emission projections for China, the US, the EU, and India and projections of gross domestic product corrected for recent changes in the carbon intensity of the economy for the rest of the world. The analysis presented here shows that the mean and trend in the five components of the global carbon budget are consistently estimated over the period of 1959–2017, but discrepancies of up to 1 GtC yr−1 persist for the representation of semi-decadal variability in CO2 fluxes. A detailed comparison among individual estimates and the introduction of a broad range of observations show (1) no consensus in the mean and trend in land-use change emissions, (2) a persistent low agreement among the different methods on the magnitude of the land CO2 flux in the northern extra-tropics, and (3) an apparent underestimation of the CO2 variability by ocean models, originating outside the tropics. This living data update documents changes in the methods and data sets used in this new global carbon budget and the progress in understanding the global carbon cycle compared with previous publications of this data set (Le Quéré et al., 2018, 2016, 2015a, b, 2014, 2013). All results presented here can be downloaded from https://doi.org/10.18160/GCP-2018

    Mapping local patterns of childhood overweight and wasting in low- and middle-income countries between 2000 and 2017

    Get PDF
    A double burden of malnutrition occurs when individuals, household members or communities experience both undernutrition and overweight. Here, we show geospatial estimates of overweight and wasting prevalence among children under 5 years of age in 105 low- and middle-income countries (LMICs) from 2000 to 2017 and aggregate these to policy-relevant administrative units. Wasting decreased overall across LMICs between 2000 and 2017, from 8.4% (62.3 (55.1–70.8) million) to 6.4% (58.3 (47.6–70.7) million), but is predicted to remain above the World Health Organization’s Global Nutrition Target of <5% in over half of LMICs by 2025. Prevalence of overweight increased from 5.2% (30 (22.8–38.5) million) in 2000 to 6.0% (55.5 (44.8–67.9) million) children aged under 5 years in 2017. Areas most affected by double burden of malnutrition were located in Indonesia, Thailand, southeastern China, Botswana, Cameroon and central Nigeria. Our estimates provide a new perspective to researchers, policy makers and public health agencies in their efforts to address this global childhood syndemic

    Mapping geographical inequalities in childhood diarrhoeal morbidity and mortality in low-income and middle-income countries, 2000–17 : analysis for the Global Burden of Disease Study 2017

    Get PDF
    Background Across low-income and middle-income countries (LMICs), one in ten deaths in children younger than 5 years is attributable to diarrhoea. The substantial between-country variation in both diarrhoea incidence and mortality is attributable to interventions that protect children, prevent infection, and treat disease. Identifying subnational regions with the highest burden and mapping associated risk factors can aid in reducing preventable childhood diarrhoea. Methods We used Bayesian model-based geostatistics and a geolocated dataset comprising 15 072 746 children younger than 5 years from 466 surveys in 94 LMICs, in combination with findings of the Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) 2017, to estimate posterior distributions of diarrhoea prevalence, incidence, and mortality from 2000 to 2017. From these data, we estimated the burden of diarrhoea at varying subnational levels (termed units) by spatially aggregating draws, and we investigated the drivers of subnational patterns by creating aggregated risk factor estimates. Findings The greatest declines in diarrhoeal mortality were seen in south and southeast Asia and South America, where 54·0% (95% uncertainty interval [UI] 38·1–65·8), 17·4% (7·7–28·4), and 59·5% (34·2–86·9) of units, respectively, recorded decreases in deaths from diarrhoea greater than 10%. Although children in much of Africa remain at high risk of death due to diarrhoea, regions with the most deaths were outside Africa, with the highest mortality units located in Pakistan. Indonesia showed the greatest within-country geographical inequality; some regions had mortality rates nearly four times the average country rate. Reductions in mortality were correlated to improvements in water, sanitation, and hygiene (WASH) or reductions in child growth failure (CGF). Similarly, most high-risk areas had poor WASH, high CGF, or low oral rehydration therapy coverage. Interpretation By co-analysing geospatial trends in diarrhoeal burden and its key risk factors, we could assess candidate drivers of subnational death reduction. Further, by doing a counterfactual analysis of the remaining disease burden using key risk factors, we identified potential intervention strategies for vulnerable populations. In view of the demands for limited resources in LMICs, accurately quantifying the burden of diarrhoea and its drivers is important for precision public health

    Alignment of the CMS silicon tracker during commissioning with cosmic rays

    Get PDF
    This is the Pre-print version of the Article. The official published version of the Paper can be accessed from the link below - Copyright @ 2010 IOPThe CMS silicon tracker, consisting of 1440 silicon pixel and 15 148 silicon strip detector modules, has been aligned using more than three million cosmic ray charged particles, with additional information from optical surveys. The positions of the modules were determined with respect to cosmic ray trajectories to an average precision of 3–4 microns RMS in the barrel and 3–14 microns RMS in the endcap in the most sensitive coordinate. The results have been validated by several studies, including laser beam cross-checks, track fit self-consistency, track residuals in overlapping module regions, and track parameter resolution, and are compared with predictions obtained from simulation. Correlated systematic effects have been investigated. The track parameter resolutions obtained with this alignment are close to the design performance.This work is supported by FMSR (Austria); FNRS and FWO (Belgium); CNPq, CAPES, FAPERJ, and FAPESP (Brazil); MES (Bulgaria); CERN; CAS, MoST, and NSFC (China); COLCIENCIAS (Colombia); MSES (Croatia); RPF (Cyprus); Academy of Sciences and NICPB (Estonia); Academy of Finland, ME, and HIP (Finland); CEA and CNRS/IN2P3 (France); BMBF, DFG, and HGF (Germany); GSRT (Greece); OTKA and NKTH (Hungary); DAE and DST (India); IPM (Iran); SFI (Ireland); INFN (Italy); NRF (Korea); LAS (Lithuania); CINVESTAV, CONACYT, SEP, and UASLP-FAI (Mexico); PAEC (Pakistan); SCSR (Poland); FCT (Portugal); JINR (Armenia, Belarus, Georgia, Ukraine, Uzbekistan); MST and MAE (Russia); MSTDS (Serbia); MICINN and CPAN (Spain); Swiss Funding Agencies (Switzerland); NSC (Taipei); TUBITAK and TAEK (Turkey); STFC (United Kingdom); DOE and NSF (USA)

    Commissioning and performance of the CMS pixel tracker with cosmic ray muons

    Get PDF
    This is the Pre-print version of the Article. The official published verion of the Paper can be accessed from the link below - Copyright @ 2010 IOPThe pixel detector of the Compact Muon Solenoid experiment consists of three barrel layers and two disks for each endcap. The detector was installed in summer 2008, commissioned with charge injections, and operated in the 3.8 T magnetic field during cosmic ray data taking. This paper reports on the first running experience and presents results on the pixel tracker performance, which are found to be in line with the design specifications of this detector. The transverse impact parameter resolution measured in a sample of high momentum muons is 18 microns.This work is supported by FMSR (Austria); FNRS and FWO (Belgium); CNPq, CAPES, FAPERJ, and FAPESP (Brazil); MES (Bulgaria); CERN; CAS, MoST, and NSFC (China); COLCIENCIAS (Colombia); MSES (Croatia); RPF (Cyprus); Academy of Sciences and NICPB (Estonia); Academy of Finland, ME, and HIP (Finland); CEA and CNRS/IN2P3 (France); BMBF, DFG, and HGF (Germany); GSRT (Greece); OTKA and NKTH (Hungary); DAE and DST (India); IPM (Iran); SFI (Ireland); INFN (Italy); NRF (Korea); LAS (Lithuania); CINVESTAV, CONACYT, SEP, and UASLP-FAI (Mexico); PAEC (Pakistan); SCSR (Poland); FCT (Portugal); JINR (Armenia, Belarus, Georgia, Ukraine, Uzbekistan); MST and MAE (Russia); MSTDS (Serbia); MICINN and CPAN (Spain); Swiss Funding Agencies (Switzerland); NSC (Taipei); TUBITAK and TAEK (Turkey); STFC (United Kingdom); DOE and NSF (USA)

    Performance of the CMS drift-tube chamber local trigger with cosmic rays

    Get PDF
    The performance of the Local Trigger based on the drift-tube system of the CMS experiment has been studied using muons from cosmic ray events collected during the commissioning of the detector in 2008. The properties of the system are extensively tested and compared with the simulation. The effect of the random arrival time of the cosmic rays on the trigger performance is reported, and the results are compared with the design expectations for proton-proton collisions and with previous measurements obtained with muon beams
    corecore