236 research outputs found

    Unsupervised machine learning framework for discriminating major variants of concern during COVID-19

    Full text link
    Due to high mutation rates, COVID-19 evolved rapidly, and several variants such as Alpha, Gamma, Delta, Beta, and Omicron emerged with altered viral properties like the severity of the disease caused, transmission rates, etc. These variants burdened the medical systems worldwide and created a massive impact on the world economy as each had to be studied and dealt with in its specific ways. Unsupervised machine learning methods have the ability to compress, characterize, and visualize unlabelled data. In this paper, we present a framework that utilizes unsupervised machine learning methods to discriminate and visualize the associations between major COVID-19 variants based on their genome sequences. These methods comprise a combination of selected dimensionality reduction and clustering techniques. The framework processes the RNA sequences by performing a k-mer analysis on the data and then compares the results from different dimensionality reduction methods including: Principal Component Analysis (PCA), t-Distributed Stochastic Neighbour Embedding (t-SNE), and Uniform Manifold Approximation Projection (UMAP). Our framework also employs agglomerative hierarchical clustering to visualize the mutational differences among major variants of concern and country-wise mutational differences for a particular variant (Delta and Omicron) using dendrograms. We also provide country-wise mutational differences for selected variants via dendrograms. We conclude that the proposed framework can effectively distinguish between the major variants and hence can be used for the identification of emerging variants in the future

    The Current State and Future of CRISPR-Cas9 gRNA Design Tools

    Get PDF
    Recent years have seen the development of computational tools to assist researchers in performing CRISPR-Cas9 experiment optimally. More specifically, these tools aim to maximize on-target activity (guide efficiency) while also minimizing potential off-target effects (guide specificity) by analyzing the features of the target site. Nonetheless, currently available tools cannot robustly predict experimental success as prediction accuracy depends on the approximations of the underlying model and how closely the experimental setup matches the data the model was trained on. Here, we present an overview of the available computational tools, their current limitations and future considerations. We discuss new trends around personalized health by taking genomic variants into account when predicting target sites as well as discussing other governing factors that can improve prediction accuracy

    Ecological influences on the behaviour and fertility of malaria parasites

    Get PDF
    BACKGROUND: Sexual reproduction in the mosquito is essential for the transmission of malaria parasites and a major target for transmission-blocking interventions. Male gametes need to locate and fertilize females in the challenging environment of the mosquito blood meal, but remarkably little is known about the ecology and behaviour of male gametes. METHODS: Here, a series of experiments explores how some aspects of the chemical and physical environment experienced during mating impacts upon the production, motility, and fertility of male gametes. RESULTS AND CONCLUSIONS: Specifically, the data confirm that: (a) rates of male gametogenesis vary when induced by the family of compounds (tryptophan metabolites) thought to trigger gamete differentiation in nature; and (b) complex relationships between gametogenesis and mating success exist across parasite species. In addition, the data reveal that (c) microparticles of the same size as red blood cells negatively affect mating success; and (d) instead of swimming in random directions, male gametes may be attracted by female gametes. Understanding the mating ecology of malaria parasites, may offer novel approaches for blocking transmission and explain adaptation to different species of mosquito vectors

    Association of Factor V Leiden with Subsequent Atherothrombotic Events:A GENIUS-CHD Study of Individual Participant Data

    Get PDF
    BACKGROUND: Studies examining the role of factor V Leiden among patients at higher risk of atherothrombotic events, such as those with established coronary heart disease (CHD), are lacking. Given that coagulation is involved in the thrombus formation stage on atherosclerotic plaque rupture, we hypothesized that factor V Leiden may be a stronger risk factor for atherothrombotic events in patients with established CHD. METHODS: We performed an individual-level meta-analysis including 25 prospective studies (18 cohorts, 3 case-cohorts, 4 randomized trials) from the GENIUS-CHD (Genetics of Subsequent Coronary Heart Disease) consortium involving patients with established CHD at baseline. Participating studies genotyped factor V Leiden status and shared risk estimates for the outcomes of interest using a centrally developed statistical code with harmonized definitions across studies. Cox proportional hazards regression models were used to obtain age- and sex-adjusted estimates. The obtained estimates were pooled using fixed-effect meta-analysis. The primary outcome was composite of myocardial infarction and CHD death. Secondary outcomes included any stroke, ischemic stroke, coronary revascularization, cardiovascular mortality, and all-cause mortality. RESULTS: The studies included 69 681 individuals of whom 3190 (4.6%) were either heterozygous or homozygous (n=47) carriers of factor V Leiden. Median follow-up per study ranged from 1.0 to 10.6 years. A total of 20 studies with 61 147 participants and 6849 events contributed to analyses of the primary outcome. Factor V Leiden was not associated with the combined outcome of myocardial infarction and CHD death (hazard ratio, 1.03 [95% CI, 0.92-1.16]; I2=28%; P-heterogeneity=0.12). Subgroup analysis according to baseline characteristics or strata of traditional cardiovascular risk factors did not show relevant differences. Similarly, risk estimates for the secondary outcomes including stroke, coronary revascularization, cardiovascular mortality, and all-cause mortality were also close to identity. CONCLUSIONS: Factor V Leiden was not associated with increased risk of subsequent atherothrombotic events and mortality in high-risk participants with established and treated CHD. Routine assessment of factor V Leiden status is unlikely to improve atherothrombotic events risk stratification in this population

    The Long-Baseline Neutrino Experiment: Exploring Fundamental Symmetries of the Universe

    Get PDF
    The preponderance of matter over antimatter in the early Universe, the dynamics of the supernova bursts that produced the heavy elements necessary for life and whether protons eventually decay --- these mysteries at the forefront of particle physics and astrophysics are key to understanding the early evolution of our Universe, its current state and its eventual fate. The Long-Baseline Neutrino Experiment (LBNE) represents an extensively developed plan for a world-class experiment dedicated to addressing these questions. LBNE is conceived around three central components: (1) a new, high-intensity neutrino source generated from a megawatt-class proton accelerator at Fermi National Accelerator Laboratory, (2) a near neutrino detector just downstream of the source, and (3) a massive liquid argon time-projection chamber deployed as a far detector deep underground at the Sanford Underground Research Facility. This facility, located at the site of the former Homestake Mine in Lead, South Dakota, is approximately 1,300 km from the neutrino source at Fermilab -- a distance (baseline) that delivers optimal sensitivity to neutrino charge-parity symmetry violation and mass ordering effects. This ambitious yet cost-effective design incorporates scalability and flexibility and can accommodate a variety of upgrades and contributions. With its exceptional combination of experimental configuration, technical capabilities, and potential for transformative discoveries, LBNE promises to be a vital facility for the field of particle physics worldwide, providing physicists from around the globe with opportunities to collaborate in a twenty to thirty year program of exciting science. In this document we provide a comprehensive overview of LBNE's scientific objectives, its place in the landscape of neutrino physics worldwide, the technologies it will incorporate and the capabilities it will possess.Comment: Major update of previous version. This is the reference document for LBNE science program and current status. Chapters 1, 3, and 9 provide a comprehensive overview of LBNE's scientific objectives, its place in the landscape of neutrino physics worldwide, the technologies it will incorporate and the capabilities it will possess. 288 pages, 116 figure

    Genomic, Pathway Network, and Immunologic Features Distinguishing Squamous Carcinomas

    Get PDF
    This integrated, multiplatform PanCancer Atlas study co-mapped and identified distinguishing molecular features of squamous cell carcinomas (SCCs) from five sites associated with smokin

    Pan-Cancer Analysis of lncRNA Regulation Supports Their Targeting of Cancer Genes in Each Tumor Context

    Get PDF
    Long noncoding RNAs (lncRNAs) are commonly dys-regulated in tumors, but only a handful are known toplay pathophysiological roles in cancer. We inferredlncRNAs that dysregulate cancer pathways, onco-genes, and tumor suppressors (cancer genes) bymodeling their effects on the activity of transcriptionfactors, RNA-binding proteins, and microRNAs in5,185 TCGA tumors and 1,019 ENCODE assays.Our predictions included hundreds of candidateonco- and tumor-suppressor lncRNAs (cancerlncRNAs) whose somatic alterations account for thedysregulation of dozens of cancer genes and path-ways in each of 14 tumor contexts. To demonstrateproof of concept, we showed that perturbations tar-geting OIP5-AS1 (an inferred tumor suppressor) andTUG1 and WT1-AS (inferred onco-lncRNAs) dysre-gulated cancer genes and altered proliferation ofbreast and gynecologic cancer cells. Our analysis in-dicates that, although most lncRNAs are dysregu-lated in a tumor-specific manner, some, includingOIP5-AS1, TUG1, NEAT1, MEG3, and TSIX, synergis-tically dysregulate cancer pathways in multiple tumorcontexts

    Pan-cancer Alterations of the MYC Oncogene and Its Proximal Network across the Cancer Genome Atlas

    Get PDF
    Although theMYConcogene has been implicated incancer, a systematic assessment of alterations ofMYC, related transcription factors, and co-regulatoryproteins, forming the proximal MYC network (PMN),across human cancers is lacking. Using computa-tional approaches, we define genomic and proteo-mic features associated with MYC and the PMNacross the 33 cancers of The Cancer Genome Atlas.Pan-cancer, 28% of all samples had at least one ofthe MYC paralogs amplified. In contrast, the MYCantagonists MGA and MNT were the most frequentlymutated or deleted members, proposing a roleas tumor suppressors.MYCalterations were mutu-ally exclusive withPIK3CA,PTEN,APC,orBRAFalterations, suggesting that MYC is a distinct onco-genic driver. Expression analysis revealed MYC-associated pathways in tumor subtypes, such asimmune response and growth factor signaling; chro-matin, translation, and DNA replication/repair wereconserved pan-cancer. This analysis reveals insightsinto MYC biology and is a reference for biomarkersand therapeutics for cancers with alterations ofMYC or the PMN

    Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images

    Get PDF
    Beyond sample curation and basic pathologic characterization, the digitized H&E-stained images of TCGA samples remain underutilized. To highlight this resource, we present mappings of tumorinfiltrating lymphocytes (TILs) based on H&E images from 13 TCGA tumor types. These TIL maps are derived through computational staining using a convolutional neural network trained to classify patches of images. Affinity propagation revealed local spatial structure in TIL patterns and correlation with overall survival. TIL map structural patterns were grouped using standard histopathological parameters. These patterns are enriched in particular T cell subpopulations derived from molecular measures. TIL densities and spatial structure were differentially enriched among tumor types, immune subtypes, and tumor molecular subtypes, implying that spatial infiltrate state could reflect particular tumor cell aberration states. Obtaining spatial lymphocytic patterns linked to the rich genomic characterization of TCGA samples demonstrates one use for the TCGA image archives with insights into the tumor-immune microenvironment
    • …
    corecore