56 research outputs found

    Inverse Classification for Comparison-based Interpretability in Machine Learning

    Full text link
    In the context of post-hoc interpretability, this paper addresses the task of explaining the prediction of a classifier, considering the case where no information is available, neither on the classifier itself, nor on the processed data (neither the training nor the test data). It proposes an instance-based approach whose principle consists in determining the minimal changes needed to alter a prediction: given a data point whose classification must be explained, the proposed method consists in identifying a close neighbour classified differently, where the closeness definition integrates a sparsity constraint. This principle is implemented using observation generation in the Growing Spheres algorithm. Experimental results on two datasets illustrate the relevance of the proposed approach that can be used to gain knowledge about the classifier.Comment: preprin

    Blood neutrophil counts are associated with exacerbation frequency and mortality in COPD

    Get PDF
    BACKGROUND: Identifying patients with COPD at increased risk of poor outcomes is challenging due to disease heterogeneity. Potential biomarkers need to be readily available in real-life clinical practice. Blood eosinophil counts are widely studied but few studies have examined the prognostic value of blood neutrophil counts (BNC). METHODS: In a large population-based COPD registry in the East of Scotland (TARDIS: Tayside Allergic and Respiratory Disease Information System), BNC were compared to measures of disease severity and mortality for up to 15 years follow-up. Potential mechanisms of disease modification by BNC were explored in a nested microbiome substudy. RESULTS: 178,120 neutrophil counts were obtained from 7220 people (mean follow up 9 years) during stable disease periods. Median BNC was 5200cells/μL (IQR 4000-7000cells/μL). Mortality rates among the 34% of patients with elevated BNCs (defined as 6000-15000cells/μL) at the study start were 80% higher (14.0/100 person years v 7.8/100py, P < 0.001) than those with BNC in the normal range (2000-6000cells/μL). People with elevated BNC were more likely to be classified as GOLD D (46% v 33% P < 0.001), have more exacerbations (mean 2.3 v 1.3/year, P < 0.001), and were more likely to have severe exacerbations (13% vs. 5%, P < 0.001) in the following year. Eosinophil counts were much less predictive of these outcomes. In a sub-cohort (N = 276), patients with elevated BNC had increased relative abundance of Proteobacteria and reduced microbiome diversity. CONCLUSIONS: High BNC may provide a useful indicator of risk of exacerbations and mortality in COPD patients

    Non-parametric class completeness estimators for collaborative knowledge graphs — the case of wikidata

    Get PDF
    Collaborative Knowledge Graph platforms allow humans and automated scripts to collaborate in creating, updating and interlinking entities and facts. To ensure both the completeness of the data as well as a uniform coverage of the different topics, it is crucial to identify underrepresented classes in the Knowledge Graph. In this paper, we tackle this problem by developing statistical techniques for class cardinality estimation in collaborative Knowledge Graph platforms. Our method is able to estimate the completeness of a class—as defined by a schema or ontology—hence can be used to answer questions such as “Does the knowledge base have a complete list of all {Beer Brands—Volcanos—Video Game Consoles}?” As a use-case, we focus on Wikidata, which poses unique challenges in terms of the size of its ontology, the number of users actively populating its graph, and its extremely dynamic nature. Our techniques are derived from species estimation and data-management methodologies, and are applied to the case of graphs and collaborative editing. In our empirical evaluation, we observe that i) the number and frequency of unique class instances drastically influence the performance of an estimator, ii) bursts of inserts cause some estimators to overestimate the true size of the class if they are not properly handled, and iii) one can effectively measure the convergence of a class towards its true size by considering the stability of an estimator against the number of available instances

    ITALIAN CANCER FIGURES - REPORT 2015: The burden of rare cancers in Italy = I TUMORI IN ITALIA - RAPPORTO 2015: I tumori rari in Italia

    Get PDF
    OBJECTIVES: This collaborative study, based on data collected by the network of Italian Cancer Registries (AIRTUM), describes the burden of rare cancers in Italy. Estimated number of new rare cancer cases yearly diagnosed (incidence), proportion of patients alive after diagnosis (survival), and estimated number of people still alive after a new cancer diagnosis (prevalence) are provided for about 200 different cancer entities. MATERIALS AND METHODS: Data herein presented were provided by AIRTUM population- based cancer registries (CRs), covering nowadays 52% of the Italian population. This monograph uses the AIRTUM database (January 2015), which includes all malignant cancer cases diagnosed between 1976 and 2010. All cases are coded according to the International Classification of Diseases for Oncology (ICD-O-3). Data underwent standard quality checks (described in the AIRTUM data management protocol) and were checked against rare-cancer specific quality indicators proposed and published by RARECARE and HAEMACARE (www.rarecarenet.eu; www.haemacare.eu). The definition and list of rare cancers proposed by the RARECAREnet "Information Network on Rare Cancers" project were adopted: rare cancers are entities (defined as a combination of topographical and morphological codes of the ICD-O-3) having an incidence rate of less than 6 per 100,000 per year in the European population. This monograph presents 198 rare cancers grouped in 14 major groups. Crude incidence rates were estimated as the number of all new cancers occurring in 2000-2010 divided by the overall population at risk, for males and females (also for gender-specific tumours).The proportion of rare cancers out of the total cancers (rare and common) by site was also calculated. Incidence rates by sex and age are reported. The expected number of new cases in 2015 in Italy was estimated assuming the incidence in Italy to be the same as in the AIRTUM area. One- and 5-year relative survival estimates of cases aged 0-99 years diagnosed between 2000 and 2008 in the AIRTUM database, and followed up to 31 December 2009, were calculated using complete cohort survival analysis. To estimate the observed prevalence in Italy, incidence and follow-up data from 11 CRs for the period 1992-2006 were used, with a prevalence index date of 1 January 2007. Observed prevalence in the general population was disentangled by time prior to the reference date (≤2 years, 2-5 years, ≤15 years). To calculate the complete prevalence proportion at 1 January 2007 in Italy, the 15-year observed prevalence was corrected by the completeness index, in order to account for those cancer survivors diagnosed before the cancer registry activity started. The completeness index by cancer and age was obtained by means of statistical regression models, using incidence and survival data available in the European RARECAREnet data. RESULTS: In total, 339,403 tumours were included in the incidence analysis. The annual incidence rate (IR) of all 198 rare cancers in the period 2000-2010 was 147 per 100,000 per year, corresponding to about 89,000 new diagnoses in Italy each year, accounting for 25% of all cancer. Five cancers, rare at European level, were not rare in Italy because their IR was higher than 6 per 100,000; these tumours were: diffuse large B-cell lymphoma and squamous cell carcinoma of larynx (whose IRs in Italy were 7 per 100,000), multiple myeloma (IR: 8 per 100,000), hepatocellular carcinoma (IR: 9 per 100,000) and carcinoma of thyroid gland (IR: 14 per 100,000). Among the remaining 193 rare cancers, more than two thirds (No. 139) had an annual IR <0.5 per 100,000, accounting for about 7,100 new cancers cases; for 25 cancer types, the IR ranged between 0.5 and 1 per 100,000, accounting for about 10,000 new diagnoses; while for 29 cancer types the IR was between 1 and 6 per 100,000, accounting for about 41,000 new cancer cases. Among all rare cancers diagnosed in Italy, 7% were rare haematological diseases (IR: 41 per 100,000), 18% were solid rare cancers. Among the latter, the rare epithelial tumours of the digestive system were the most common (23%, IR: 26 per 100,000), followed by epithelial tumours of head and neck (17%, IR: 19) and rare cancers of the female genital system (17%, IR: 17), endocrine tumours (13% including thyroid carcinomas and less than 1% with an IR of 0.4 excluding thyroid carcinomas), sarcomas (8%, IR: 9 per 100,000), central nervous system tumours and rare epithelial tumours of the thoracic cavity (5%with an IR equal to 6 and 5 per 100,000, respectively). The remaining (rare male genital tumours, IR: 4 per 100,000; tumours of eye, IR: 0.7 per 100,000; neuroendocrine tumours, IR: 4 per 100,000; embryonal tumours, IR: 0.4 per 100,000; rare skin tumours and malignant melanoma of mucosae, IR: 0.8 per 100,000) each constituted <4% of all solid rare cancers. Patients with rare cancers were on average younger than those with common cancers. Essentially, all childhood cancers were rare, while after age 40 years, the common cancers (breast, prostate, colon, rectum, and lung) became increasingly more frequent. For 254,821 rare cancers diagnosed in 2000-2008, 5-year RS was on average 55%, lower than the corresponding figures for patients with common cancers (68%). RS was lower for rare cancers than for common cancers at 1 year and continued to diverge up to 3 years, while the gap remained constant from 3 to 5 years after diagnosis. For rare and common cancers, survival decreased with increasing age. Five-year RS was similar and high for both rare and common cancers up to 54 years; it decreased with age, especially after 54 years, with the elderly (75+ years) having a 37% and 20% lower survival than those aged 55-64 years for rare and common cancers, respectively. We estimated that about 900,000 people were alive in Italy with a previous diagnosis of a rare cancer in 2010 (prevalence). The highest prevalence was observed for rare haematological diseases (278 per 100,000) and rare tumours of the female genital system (265 per 100,000). Very low prevalence (<10 prt 100,000) was observed for rare epithelial skin cancers, for rare epithelial tumours of the digestive system and rare epithelial tumours of the thoracic cavity. COMMENTS: One in four cancers cases diagnosed in Italy is a rare cancer, in agreement with estimates of 24% calculated in Europe overall. In Italy, the group of all rare cancers combined, include 5 cancer types with an IR>6 per 100,000 in Italy, in particular thyroid cancer (IR: 14 per 100,000).The exclusion of thyroid carcinoma from rare cancers reduces the proportion of them in Italy in 2010 to 22%. Differences in incidence across population can be due to the different distribution of risk factors (whether environmental, lifestyle, occupational, or genetic), heterogeneous diagnostic intensity activity, as well as different diagnostic capacity; moreover heterogeneity in accuracy of registration may determine some minor differences in the account of rare cancers. Rare cancers had worse prognosis than common cancers at 1, 3, and 5 years from diagnosis. Differences between rare and common cancers were small 1 year after diagnosis, but survival for rare cancers declined more markedly thereafter, consistent with the idea that treatments for rare cancers are less effective than those for common cancers. However, differences in stage at diagnosis could not be excluded, as 1- and 3-year RS for rare cancers was lower than the corresponding figures for common cancers. Moreover, rare cancers include many cancer entities with a bad prognosis (5-year RS <50%): cancer of head and neck, oesophagus, small intestine, ovary, brain, biliary tract, liver, pleura, multiple myeloma, acute myeloid and lymphatic leukaemia; in contrast, most common cancer cases are breast, prostate, and colorectal cancers, which have a good prognosis. The high prevalence observed for rare haematological diseases and rare tumours of the female genital system is due to their high incidence (the majority of haematological diseases are rare and gynaecological cancers added up to fairly high incidence rates) and relatively good prognosis. The low prevalence of rare epithelial tumours of the digestive system was due to the low survival rates of the majority of tumours included in this group (oesophagus, stomach, small intestine, pancreas, and liver), regardless of the high incidence rate of rare epithelial cancers of these sites. This AIRTUM study confirms that rare cancers are a major public health problem in Italy and provides quantitative estimations, for the first time in Italy, to a problem long known to exist. This monograph provides detailed epidemiologic indicators for almost 200 rare cancers, the majority of which (72%) are very rare (IR<0.5 per 100,000). These data are of major interest for different stakeholders. Health care planners can find useful information herein to properly plan and think of how to reorganise health care services. Researchers now have numbers to design clinical trials considering alternative study designs and statistical approaches. Population-based cancer registries with good quality data are the best source of information to describe the rare cancer burden in a population

    Projected sensitivity of the LUX-ZEPLIN experiment to the two-neutrino and neutrinoless double beta decays of Xe-134

    Get PDF

    Consensus statement on abusive head trauma in infants and young children

    Get PDF
    Abusive head trauma (AHT) is the leading cause of fatal head injuries in children younger than 2 years. A multidisciplinary team bases this diagnosis on history, physical examination, imaging and laboratory findings. Because the etiology of the injury is multifactorial (shaking, shaking and impact, impact, etc.) the current best and inclusive term is AHT. There is no controversy concerning the medical validity of the existence of AHT, with multiple components including subdural hematoma, intracranial and spinal changes, complex retinal hemorrhages, and rib and other fractures that are inconsistent with the provided mechanism of trauma. The workup must exclude medical diseases that can mimic AHT. However, the courtroom has become a forum for speculative theories that cannot be reconciled with generally accepted medical literature. There is no reliable medical evidence that the following processes are causative in the constellation of injuries of AHT: cerebral sinovenous thrombosis, hypoxic-ischemic injury, lumbar puncture or dysphagic choking/vomiting. There is no substantiation, at a time remote from birth, that an asymptomatic birth-related subdural hemorrhage can result in rebleeding and sudden collapse. Further, a diagnosis of AHT is a medical conclusion, not a legal determination of the intent of the perpetrator or a diagnosis of murder. We hope that this consensus document reduces confusion by recommending to judges and jurors the tools necessary to distinguish genuine evidence-based opinions of the relevant medical community from legal arguments or etiological speculations that are unwarranted by the clinical findings, medical evidence and evidence-based literature

    Projected sensitivities of the LUX-ZEPLIN (LZ) experiment to new physics via low-energy electron recoils

    Get PDF

    A next-generation liquid xenon observatory for dark matter and neutrino physics

    Get PDF
    The nature of dark matter and properties of neutrinos are among the most pressing issues in contemporary particle physics. The dual-phase xenon time-projection chamber is the leading technology to cover the available parameter space for weakly interacting massive particles, while featuring extensive sensitivity to many alternative dark matter candidates. These detectors can also study neutrinos through neutrinoless double-beta decay and through a variety of astrophysical sources. A next-generation xenon-based detector will therefore be a true multi-purpose observatory to significantly advance particle physics, nuclear physics, astrophysics, solar physics, and cosmology. This review article presents the science cases for such a detector

    Projected sensitivity of the LUX-ZEPLIN (LZ) experiment to the two-neutrino and neutrinoless double beta decays of 134Xe

    Get PDF

    Cosmogenic production of 37Ar in the context of the LUX-ZEPLIN experiment

    Get PDF
    We estimate the amount of 37Ar produced in natural xenon via cosmic ray-induced spallation, an inevitable consequence of the transportation and storage of xenon on the Earth's surface. We then calculate the resulting 37Ar concentration in a 10-tonne payload~(similar to that of the LUX-ZEPLIN experiment) assuming a representative schedule of xenon purification, storage and delivery to the underground facility. Using the spallation model by Silberberg and Tsao, the sea level production rate of 37Ar in natural xenon is estimated to be 0.024~atoms/kg/day. Assuming the xenon is successively purified to remove radioactive contaminants in 1-tonne batches at a rate of 1~tonne/month, the average 37Ar activity after 10~tonnes are purified and transported underground is 0.058--0.090~ÎĽBq/kg, depending on the degree of argon removal during above-ground purification. Such cosmogenic 37Ar will appear as a noticeable background in the early science data, while decaying with a 35~day half-life. This newly-noticed production mechanism of 37Ar should be considered when planning for future liquid xenon-based experiments
    • …
    corecore