63 research outputs found

    Comparing community structure identification

    Full text link
    We compare recent approaches to community structure identification in terms of sensitivity and computational cost. The recently proposed modularity measure is revisited and the performance of the methods as applied to ad hoc networks with known community structure, is compared. We find that the most accurate methods tend to be more computationally expensive, and that both aspects need to be considered when choosing a method for practical purposes. The work is intended as an introduction as well as a proposal for a standard benchmark test of community detection methods.Comment: 10 pages, 3 figures, 1 table. v2: condensed, updated version as appears in JSTA

    Clustering gene expression data with a penalized graph-based metric

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The search for cluster structure in microarray datasets is a base problem for the so-called "-omic sciences". A difficult problem in clustering is how to handle data with a manifold structure, i.e. data that is not shaped in the form of compact clouds of points, forming arbitrary shapes or paths embedded in a high-dimensional space, as could be the case of some gene expression datasets.</p> <p>Results</p> <p>In this work we introduce the Penalized k-Nearest-Neighbor-Graph (PKNNG) based metric, a new tool for evaluating distances in such cases. The new metric can be used in combination with most clustering algorithms. The PKNNG metric is based on a two-step procedure: first it constructs the k-Nearest-Neighbor-Graph of the dataset of interest using a low k-value and then it adds edges with a highly penalized weight for connecting the subgraphs produced by the first step. We discuss several possible schemes for connecting the different sub-graphs as well as penalization functions. We show clustering results on several public gene expression datasets and simulated artificial problems to evaluate the behavior of the new metric.</p> <p>Conclusions</p> <p>In all cases the PKNNG metric shows promising clustering results. The use of the PKNNG metric can improve the performance of commonly used pairwise-distance based clustering methods, to the level of more advanced algorithms. A great advantage of the new procedure is that researchers do not need to learn a new method, they can simply compute distances with the PKNNG metric and then, for example, use hierarchical clustering to produce an accurate and highly interpretable dendrogram of their high-dimensional data.</p

    LSST: from Science Drivers to Reference Design and Anticipated Data Products

    Get PDF
    (Abridged) We describe here the most ambitious survey currently planned in the optical, the Large Synoptic Survey Telescope (LSST). A vast array of science will be enabled by a single wide-deep-fast sky survey, and LSST will have unique survey capability in the faint time domain. The LSST design is driven by four main science themes: probing dark energy and dark matter, taking an inventory of the Solar System, exploring the transient optical sky, and mapping the Milky Way. LSST will be a wide-field ground-based system sited at Cerro Pach\'{o}n in northern Chile. The telescope will have an 8.4 m (6.5 m effective) primary mirror, a 9.6 deg2^2 field of view, and a 3.2 Gigapixel camera. The standard observing sequence will consist of pairs of 15-second exposures in a given field, with two such visits in each pointing in a given night. With these repeats, the LSST system is capable of imaging about 10,000 square degrees of sky in a single filter in three nights. The typical 5σ\sigma point-source depth in a single visit in rr will be 24.5\sim 24.5 (AB). The project is in the construction phase and will begin regular survey operations by 2022. The survey area will be contained within 30,000 deg2^2 with δ<+34.5\delta<+34.5^\circ, and will be imaged multiple times in six bands, ugrizyugrizy, covering the wavelength range 320--1050 nm. About 90\% of the observing time will be devoted to a deep-wide-fast survey mode which will uniformly observe a 18,000 deg2^2 region about 800 times (summed over all six bands) during the anticipated 10 years of operations, and yield a coadded map to r27.5r\sim27.5. The remaining 10\% of the observing time will be allocated to projects such as a Very Deep and Fast time domain survey. The goal is to make LSST data products, including a relational database of about 32 trillion observations of 40 billion objects, available to the public and scientists around the world.Comment: 57 pages, 32 color figures, version with high-resolution figures available from https://www.lsst.org/overvie

    The Changing Landscape for Stroke\ua0Prevention in AF: Findings From the GLORIA-AF Registry Phase 2

    Get PDF
    Background GLORIA-AF (Global Registry on Long-Term Oral Antithrombotic Treatment in Patients with Atrial Fibrillation) is a prospective, global registry program describing antithrombotic treatment patterns in patients with newly diagnosed nonvalvular atrial fibrillation at risk of stroke. Phase 2 began when dabigatran, the first non\u2013vitamin K antagonist oral anticoagulant (NOAC), became available. Objectives This study sought to describe phase 2 baseline data and compare these with the pre-NOAC era collected during phase&nbsp;1. Methods During phase 2, 15,641 consenting patients were enrolled (November 2011 to December 2014); 15,092 were eligible. This pre-specified cross-sectional analysis describes eligible patients\u2019 baseline characteristics. Atrial fibrillation&nbsp;disease characteristics, medical outcomes, and concomitant diseases and medications were collected. Data were analyzed using descriptive statistics. Results Of the total patients, 45.5% were female; median age was 71 (interquartile range: 64, 78) years. Patients were from Europe (47.1%), North America (22.5%), Asia (20.3%), Latin America (6.0%), and the Middle East/Africa (4.0%). Most had high stroke risk (CHA2DS2-VASc [Congestive heart failure, Hypertension, Age&nbsp; 6575 years, Diabetes mellitus, previous Stroke, Vascular disease, Age 65 to 74 years, Sex category] score&nbsp; 652; 86.1%); 13.9% had moderate risk (CHA2DS2-VASc&nbsp;= 1). Overall, 79.9% received oral anticoagulants, of whom 47.6% received NOAC and 32.3% vitamin K antagonists (VKA); 12.1% received antiplatelet agents; 7.8% received no antithrombotic treatment. For comparison, the proportion of phase 1 patients (of N&nbsp;= 1,063 all eligible) prescribed VKA was 32.8%, acetylsalicylic acid 41.7%, and no therapy 20.2%. In Europe in phase 2, treatment with NOAC was more common than VKA (52.3% and 37.8%, respectively); 6.0% of patients received antiplatelet treatment; and 3.8% received no antithrombotic treatment. In North America, 52.1%, 26.2%, and 14.0% of patients received NOAC, VKA, and antiplatelet drugs, respectively; 7.5% received no antithrombotic treatment. NOAC use was less common in Asia (27.7%), where 27.5% of patients received VKA, 25.0% antiplatelet drugs, and 19.8% no antithrombotic treatment. Conclusions The baseline data from GLORIA-AF phase 2 demonstrate that in newly diagnosed nonvalvular atrial fibrillation patients, NOAC have been highly adopted into practice, becoming more frequently prescribed than VKA in&nbsp;Europe and North America. Worldwide, however, a large proportion of patients remain undertreated, particularly in&nbsp;Asia&nbsp;and North America. (Global Registry on Long-Term Oral Antithrombotic Treatment in Patients With Atrial Fibrillation [GLORIA-AF]; NCT01468701

    The evolving SARS-CoV-2 epidemic in Africa: Insights from rapidly expanding genomic surveillance

    Get PDF
    INTRODUCTION Investment in Africa over the past year with regard to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequencing has led to a massive increase in the number of sequences, which, to date, exceeds 100,000 sequences generated to track the pandemic on the continent. These sequences have profoundly affected how public health officials in Africa have navigated the COVID-19 pandemic. RATIONALE We demonstrate how the first 100,000 SARS-CoV-2 sequences from Africa have helped monitor the epidemic on the continent, how genomic surveillance expanded over the course of the pandemic, and how we adapted our sequencing methods to deal with an evolving virus. Finally, we also examine how viral lineages have spread across the continent in a phylogeographic framework to gain insights into the underlying temporal and spatial transmission dynamics for several variants of concern (VOCs). RESULTS Our results indicate that the number of countries in Africa that can sequence the virus within their own borders is growing and that this is coupled with a shorter turnaround time from the time of sampling to sequence submission. Ongoing evolution necessitated the continual updating of primer sets, and, as a result, eight primer sets were designed in tandem with viral evolution and used to ensure effective sequencing of the virus. The pandemic unfolded through multiple waves of infection that were each driven by distinct genetic lineages, with B.1-like ancestral strains associated with the first pandemic wave of infections in 2020. Successive waves on the continent were fueled by different VOCs, with Alpha and Beta cocirculating in distinct spatial patterns during the second wave and Delta and Omicron affecting the whole continent during the third and fourth waves, respectively. Phylogeographic reconstruction points toward distinct differences in viral importation and exportation patterns associated with the Alpha, Beta, Delta, and Omicron variants and subvariants, when considering both Africa versus the rest of the world and viral dissemination within the continent. Our epidemiological and phylogenetic inferences therefore underscore the heterogeneous nature of the pandemic on the continent and highlight key insights and challenges, for instance, recognizing the limitations of low testing proportions. We also highlight the early warning capacity that genomic surveillance in Africa has had for the rest of the world with the detection of new lineages and variants, the most recent being the characterization of various Omicron subvariants. CONCLUSION Sustained investment for diagnostics and genomic surveillance in Africa is needed as the virus continues to evolve. This is important not only to help combat SARS-CoV-2 on the continent but also because it can be used as a platform to help address the many emerging and reemerging infectious disease threats in Africa. In particular, capacity building for local sequencing within countries or within the continent should be prioritized because this is generally associated with shorter turnaround times, providing the most benefit to local public health authorities tasked with pandemic response and mitigation and allowing for the fastest reaction to localized outbreaks. These investments are crucial for pandemic preparedness and response and will serve the health of the continent well into the 21st century

    Global, regional, and national progress towards Sustainable Development Goal 3.2 for neonatal and child health: all-cause and cause-specific mortality findings from the Global Burden of Disease Study 2019

    Get PDF
    Background Sustainable Development Goal 3.2 has targeted elimination of preventable child mortality, reduction of neonatal death to less than 12 per 1000 livebirths, and reduction of death of children younger than 5 years to less than 25 per 1000 livebirths, for each country by 2030. To understand current rates, recent trends, and potential trajectories of child mortality for the next decade, we present the Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) 2019 findings for all-cause mortality and cause-specific mortality in children younger than 5 years of age, with multiple scenarios for child mortality in 2030 that include the consideration of potential effects of COVID-19, and a novel framework for quantifying optimal child survival. Methods We completed all-cause mortality and cause-specific mortality analyses from 204 countries and territories for detailed age groups separately, with aggregated mortality probabilities per 1000 livebirths computed for neonatal mortality rate (NMR) and under-5 mortality rate (USMR). Scenarios for 2030 represent different potential trajectories, notably including potential effects of the COVID-19 pandemic and the potential impact of improvements preferentially targeting neonatal survival. Optimal child survival metrics were developed by age, sex, and cause of death across all GBD location-years. The first metric is a global optimum and is based on the lowest observed mortality, and the second is a survival potential frontier that is based on stochastic frontier analysis of observed mortality and Healthcare Access and Quality Index. Findings Global U5MR decreased from 71.2 deaths per 1000 livebirths (95% uncertainty interval WI] 68.3-74-0) in 2000 to 37.1 (33.2-41.7) in 2019 while global NMR correspondingly declined more slowly from 28.0 deaths per 1000 live births (26.8-29-5) in 2000 to 17.9 (16.3-19-8) in 2019. In 2019,136 (67%) of 204 countries had a USMR at or below the SDG 3.2 threshold and 133 (65%) had an NMR at or below the SDG 3.2 threshold, and the reference scenario suggests that by 2030,154 (75%) of all countries could meet the U5MR targets, and 139 (68%) could meet the NMR targets. Deaths of children younger than 5 years totalled 9.65 million (95% UI 9.05-10.30) in 2000 and 5.05 million (4.27-6.02) in 2019, with the neonatal fraction of these deaths increasing from 39% (3.76 million 95% UI 3.53-4.021) in 2000 to 48% (2.42 million; 2.06-2.86) in 2019. NMR and U5MR were generally higher in males than in females, although there was no statistically significant difference at the global level. Neonatal disorders remained the leading cause of death in children younger than 5 years in 2019, followed by lower respiratory infections, diarrhoeal diseases, congenital birth defects, and malaria. The global optimum analysis suggests NMR could be reduced to as low as 0.80 (95% UI 0.71-0.86) deaths per 1000 livebirths and U5MR to 1.44 (95% UI 1-27-1.58) deaths per 1000 livebirths, and in 2019, there were as many as 1.87 million (95% UI 1-35-2.58; 37% 95% UI 32-43]) of 5.05 million more deaths of children younger than 5 years than the survival potential frontier. Interpretation Global child mortality declined by almost half between 2000 and 2019, but progress remains slower in neonates and 65 (32%) of 204 countries, mostly in sub-Saharan Africa and south Asia, are not on track to meet either SDG 3.2 target by 2030. Focused improvements in perinatal and newborn care, continued and expanded delivery of essential interventions such as vaccination and infection prevention, an enhanced focus on equity, continued focus on poverty reduction and education, and investment in strengthening health systems across the development spectrum have the potential to substantially improve USMR. Given the widespread effects of COVID-19, considerable effort will be required to maintain and accelerate progress. Copyright (C) 2021 The Author(s). Published by Elsevier Ltd

    Mapping inequalities in exclusive breastfeeding in low- and middle-income countries, 2000–2018

    Get PDF
    Exclusive breastfeeding (EBF)—giving infants only breast-milk for the first 6 months of life—is a component of optimal breastfeeding practices effective in preventing child morbidity and mortality. EBF practices are known to vary by population and comparable subnational estimates of prevalence and progress across low- and middle-income countries (LMICs) are required for planning policy and interventions. Here we present a geospatial analysis of EBF prevalence estimates from 2000 to 2018 across 94 LMICs mapped to policy-relevant administrative units (for example, districts), quantify subnational inequalities and their changes over time, and estimate probabilities of meeting the World Health Organization’s Global Nutrition Target (WHO GNT) of ≥70% EBF prevalence by 2030. While six LMICs are projected to meet the WHO GNT of ≥70% EBF prevalence at a national scale, only three are predicted to meet the target in all their district-level units by 2030
    corecore