69 research outputs found

    Unconventional machine learning of genome-wide human cancer data

    Full text link
    Recent advances in high-throughput genomic technologies coupled with exponential increases in computer processing and memory have allowed us to interrogate the complex aberrant molecular underpinnings of human disease from a genome-wide perspective. While the deluge of genomic information is expected to increase, a bottleneck in conventional high-performance computing is rapidly approaching. Inspired in part by recent advances in physical quantum processors, we evaluated several unconventional machine learning (ML) strategies on actual human tumor data. Here we show for the first time the efficacy of multiple annealing-based ML algorithms for classification of high-dimensional, multi-omics human cancer data from the Cancer Genome Atlas. To assess algorithm performance, we compared these classifiers to a variety of standard ML methods. Our results indicate the feasibility of using annealing-based ML to provide competitive classification of human cancer types and associated molecular subtypes and superior performance with smaller training datasets, thus providing compelling empirical evidence for the potential future application of unconventional computing architectures in the biomedical sciences

    Cholesterol-Independent SREBP-1 Maturation Is Linked to ARF1 Inactivation

    Get PDF
    Lipogenesis requires coordinated expression of genes for fatty acid, phospholipid, and triglyceride synthesis. Transcription factors, such as SREBP-1 (Sterol regulatory element binding protein), may be activated in response to feedback mechanisms linking gene activation to levels of metabolites in the pathways. SREBPs can be regulated in response to membrane cholesterol and we also found that low levels of phosphatidylcholine (a methylated phospholipid) led to SBP-1/SREBP-1 maturation in C. elegans or mammalian models. To identify additional regulatory components, we performed a targeted RNAi screen in C. elegans, finding that both lpin-1/Lipin 1 (which converts phosphatidic acid to diacylglycerol) and arf-1.2/ARF1 (a GTPase regulating Golgi function) were important for low-PC activation of SBP-1/SREBP-1. Mechanistically linking the major hits of our screen, we find that limiting PC synthesis or LPIN1 knockdown in mammalian cells reduces the levels of active GTP-bound ARF1. Thus, changes in distinct lipid ratios may converge on ARF1 to increase SBP-1/SREBP-1 activity

    Endothelial Mitogen-Activated Protein Kinase Kinase Kinase Kinase 4 Is Critical for Lymphatic Vascular Development and Function

    Get PDF
    The molecular mechanisms underlying lymphatic vascular development and function are not well understood. Recent studies have suggested a role for endothelial cell (EC) mitogen-activated protein kinase kinase kinase kinase 4 (Map4k4) in developmental angiogenesis and atherosclerosis. Here, we show that constitutive loss of EC Map4k4 in mice causes postnatal lethality due to chylothorax, suggesting that Map4k4 is required for normal lymphatic vascular function. Mice constitutively lacking EC Map4k4 displayed dilated lymphatic capillaries, insufficient lymphatic valves, and impaired lymphatic flow; furthermore, primary ECs derived from these animals displayed enhanced proliferation compared with controls. Yeast 2-hybrid analyses identified the Ras GTPase-activating protein Rasa1, a known regulator of lymphatic development and lymphatic endothelial cell fate, as a direct interacting partner for Map4k4. Map4k4 silencing in ECs enhanced basal Ras and extracellular signal-regulated kinase (Erk) activities, and primary ECs lacking Map4k4 displayed enhanced lymphatic EC marker expression. Taken together, these results reveal that EC Map4k4 is critical for lymphatic vascular development by regulating EC quiescence and lymphatic EC fate

    Genome Evolution and Innovation across the Four Major Lineages of Cryptococcus gattii

    Get PDF
    We acknowledge the Broad Institute Sequencing Platform and Imperial College London for generating the DNA sequence described here (and R265 Illumina sequences described previously [4]). We thank Sinéad Chapman for coordinating sequencing at the Broad Institute and Margaret Priest for assistance in submitting assemblies to NCBI. This project was supported by the National Human Genome Research Institute, grant no. U54HG003067. R.A.F. is supported by the Wellcome Trust. R.C.M. is supported by the Lister Institute for Preventive Medicine, the Medical Research Council UK, and the European Research Council.Peer reviewedPublisher PD

    Structure of the germline genome of Tetrahymena thermophila and relationship to the massively rearranged somatic genome

    Get PDF
    The germline genome of the binucleated ciliate Tetrahymena thermophila undergoes programmed chromosome breakage and massive DNA elimination to generate the somatic genome. Here, we present a complete sequence assembly of the germline genome and analyze multiple features of its structure and its relationship to the somatic genome, shedding light on the mechanisms of genome rearrangement as well as the evolutionary history of this remarkable germline/soma differentiation. Our results strengthen the notion that a complex, dynamic, and ongoing interplay between mobile DNA elements and the host genome have shaped Tetrahymena chromosome structure, locally and globally. Non-standard outcomes of rearrangement events, including the generation of short-lived somatic chromosomes and excision of DNA interrupting protein-coding regions, may represent novel forms of developmental gene regulation. We also compare Tetrahymenas germline/soma differentiation to that of other characterized ciliates, illustrating the wide diversity of adaptations that have occurred within this phylum.</p

    The Dynamic Genome and Transcriptome of the Human Fungal Pathogen Blastomyces and Close Relative Emmonsia

    Get PDF
    Three closely related thermally dimorphic pathogens are causal agents of major fungal diseases affecting humans in the Americas: blastomycosis, histoplasmosis and paracoccidioidomycosis. Here we report the genome sequence and analysis of four strains of the etiological agent of blastomycosis, Blastomyces, and two species of the related genus Emmonsia, typically pathogens of small mammals. Compared to related species, Blastomyces genomes are highly expanded, with long, often sharply demarcated tracts of low GC-content sequence. These GC-poor isochore-like regions are enriched for gypsy elements, are variable in total size between isolates, and are least expanded in the avirulent B. dermatitidis strain ER-3 as compared with the virulent B. gilchristii strain SLH14081. The lack of similar regions in related species suggests these isochore-like regions originated recently in the ancestor of the Blastomyces lineage. While gene content is highly conserved between Blastomyces and related fungi, we identified changes in copy number of genes potentially involved in host interaction, including proteases and characterized antigens. In addition, we studied gene expression changes of B. dermatitidis during the interaction of the infectious yeast form with macrophages and in a mouse model. Both experiments highlight a strong antioxidant defense response in Blastomyces, and upregulation of dioxygenases in vivo suggests that dioxide produced by antioxidants may be further utilized for amino acid metabolism. We identify a number of functional categories upregulated exclusively in vivo, such as secreted proteins, zinc acquisition proteins, and cysteine and tryptophan metabolism, which may include critical virulence factors missed before in in vitro studies. Across the dimorphic fungi, loss of certain zinc acquisition genes and differences in amino acid metabolism suggest unique adaptations of Blastomyces to its host environment. These results reveal the dynamics of genome evolution and of factors contributing to virulence in Blastomyces.Author SummaryDimorphic fungal pathogens including Blastomyces are the cause of major fungal diseases in North and South America. The genus Emmonsia includes species infecting small mammals as well as a newly emerging pathogenic species recently reported in HIV-positive patients in South Africa. Here, we synthesize both genome sequencing of four isolates of Blastomyces and two species of Emmonsia as well as deep sequencing of Blastomyces RNA to draw major new insights into the evolution of this group and the pathogen response to infection. We investigate the trajectory of genome evolution of this group, characterizing the phylogenetic relationships of these species, a remarkable genome expansion that formed large isochore-like regions of low GC content in Blastomyces, and variation of gene content, related to host interaction, among the dimorphic fungal pathogens. Using RNA-Seq, we profile the response of Blastomyces to macrophage and mouse pulmonary infection, identifying key pathways and novel virulence factors. The identification of key fungal genes involved in adaptation to the host suggests targets for further study and therapeutic intervention in Blastomyces and related dimorphic fungal pathogens

    Whole Genome Deep Sequencing of HIV-1 Reveals the Impact of Early Minor Variants Upon Immune Recognition During Acute Infection

    Get PDF
    Deep sequencing technologies have the potential to transform the study of highly variable viral pathogens by providing a rapid and cost-effective approach to sensitively characterize rapidly evolving viral quasispecies. Here, we report on a high-throughput whole HIV-1 genome deep sequencing platform that combines 454 pyrosequencing with novel assembly and variant detection algorithms. In one subject we combined these genetic data with detailed immunological analyses to comprehensively evaluate viral evolution and immune escape during the acute phase of HIV-1 infection. The majority of early, low frequency mutations represented viral adaptation to host CD8+ T cell responses, evidence of strong immune selection pressure occurring during the early decline from peak viremia. CD8+ T cell responses capable of recognizing these low frequency escape variants coincided with the selection and evolution of more effective secondary HLA-anchor escape mutations. Frequent, and in some cases rapid, reversion of transmitted mutations was also observed across the viral genome. When located within restricted CD8 epitopes these low frequency reverting mutations were sufficient to prime de novo responses to these epitopes, again illustrating the capacity of the immune response to recognize and respond to low frequency variants. More importantly, rapid viral escape from the most immunodominant CD8+ T cell responses coincided with plateauing of the initial viral load decline in this subject, suggestive of a potential link between maintenance of effective, dominant CD8 responses and the degree of early viremia reduction. We conclude that the early control of HIV-1 replication by immunodominant CD8+ T cell responses may be substantially influenced by rapid, low frequency viral adaptations not detected by conventional sequencing approaches, which warrants further investigation. These data support the critical need for vaccine-induced CD8+ T cell responses to target more highly constrained regions of the virus in order to ensure the maintenance of immunodominant CD8 responses and the sustained decline of early viremia

    Tracing Genetic Exchange and Biogeography of Cryptococcus neoformans var. grubii at the Global Population Level

    Get PDF
    Cryptococcus neoformans var. grubii is the causative agent of cryptococcal meningitis, a significant source of mortality in immunocompromised individuals, typically human immunodeficiency virus/AIDS patients from developing countries. Despite the worldwide emergence of this ubiquitous infection, little is known about the global molecular epidemiology of this fungal pathogen. Here we sequence the genomes of 188 diverse isolates and characterize the major subdivisions, their relative diversity, and the level of genetic exchange between them. While most isolates of C. neoformans var. grubii belong to one of three major lineages (VNI, VNII, and VNB), some haploid isolates show hybrid ancestry including some that appear to have recently interbred, based on the detection of large blocks of each ancestry across each chromosome. Many isolates display evidence of aneuploidy, which was detected for all chromosomes. In diploid isolates of C. neoformans var. grubii (serotype AA) and of hybrids with C. neoformans var. neoformans (serotype AD) such aneuploidies have resulted in loss of heterozygosity, where a chromosomal region is represented by the genotype of only one parental isolate. Phylogenetic and population genomic analyses of isolates from Brazil reveal that the previously “African” VNB lineage occurs naturally in the South American environment. This suggests migration of the VNB lineage between Africa and South America prior to its diversification, supported by finding ancestral recombination events between isolates from different lineages and regions. The results provide evidence of substantial population structure, with all lineages showing multi-continental distributions; demonstrating the highly dispersive nature of this pathogen
    corecore