85 research outputs found

    Protein coding potential of retroviruses and other transposable elements in vertebrate genomes

    Get PDF
    We suggest an annotation strategy for genes encoded by retroviruses and transposable elements (RETRA genes) based on a set of marker protein domains. Usually RETRA genes are masked in vertebrate genomes prior to the application of automated gene prediction pipelines under the assumption that they provide no selective advantage to the host. Yet, we show that about 1000 genes in four vertebrate gene sets analyzed contain at least one RETRA gene marker domain. Using the conservation of genomic neighborhood (synteny), we were able to discriminate between RETRA genes with putative functionality in the vertebrates and those that probably function only in the context of mobile elements. We identified 35 such genes in human, along with their corresponding mouse and rat orthologs; which included almost all known human genes with similarity to mobile elements. The results also imply that the vast majority of the remaining RETRA genes in current gene sets are unlikely to encode vertebrate functions. To automatically annotate RETRA genes in other vertebrate genomes, we provide as a tool a set of marker protein domains and a manually refined list of domesticated or ancestral RETRA genes for rescuing genes with vertebrate functions

    Genome-wide association and HLA fine-mapping studies identify risk loci and genetic pathways underlying allergic rhinitis

    Get PDF
    Allergic rhinitis is the most common clinical presentation of allergy, affecting 400 million people worldwide, with increasing incidence in westernized countries1,2. To elucidate the genetic architecture and understand the underlying disease mechanisms, we carried out a meta-analysis of allergic rhinitis in 59,762 cases and 152,358 controls of European ancestry and identified a total of 41 risk loci for allergic rhinitis, including 20 loci not previously associated with allergic rhinitis, which were confirmed in a replication phase of 60,720 cases and 618,527 controls. Functional annotation implicated genes involved in various immune pathways, and fine mapping of the HLA region suggested amino acid variants important for antigen binding. We further performed genome-wide association study (GWAS) analyses of allergic sensitization against inhalant allergens and nonallergic rhinitis, which suggested shared genetic mechanisms across rhinitis-related traits. Future studies of the identified loci and genes might identify novel targets for treatment and prevention of allergic rhinitis

    Improving the immunogenicity of native-like HIV-1 envelope trimers by hyperstabilization

    Get PDF
    The production of native-like recombinant versions of the HIV-1 envelope glycoprotein (Env) trimer requires overcoming the natural flexibility and instability of the complex. The engineered BG505 SOSIP.664 trimer mimics the structure and antigenicity of native Env. Here, we describe how the introduction of new disulfide bonds between the glycoprotein (gp)120 and gp41 subunits of SOSIP trimers of the BG505 and other genotypes improves their stability and antigenicity, reduces their conformational flexibility, and helps maintain them in the unliganded conformation. The resulting next-generation SOSIP.v5 trimers induce strong autologous tier-2 neutralizing antibody (NAb) responses in rabbits. In addition, the BG505 SOSIP.v6 trimers induced weak heterologous NAb responses against a subset of tier-2 viruses that were not elicited by the prototype BG505 SOSIP.664. These stabilization methods can be applied to trimers from multiple genotypes as components of multivalent vaccines aimed at inducing broadly NAbs (bNAbs)

    Pan-cancer analysis of whole genomes identifies driver rearrangements promoted by LINE-1 retrotransposition

    Get PDF
    About half of all cancers have somatic integrations of retrotransposons. Here, to characterize their role in oncogenesis, we analyzed the patterns and mechanisms of somatic retrotransposition in 2,954 cancer genomes from 38 histological cancer subtypes within the framework of the Pan-Cancer Analysis of Whole Genomes (PCAWG) project. We identified 19,166 somatically acquired retrotransposition events, which affected 35% of samples and spanned a range of event types. Long interspersed nuclear element (LINE-1; L1 hereafter) insertions emerged as the first most frequent type of somatic structural variation in esophageal adenocarcinoma, and the second most frequent in head-and-neck and colorectal cancers. Aberrant L1 integrations can delete megabase-scale regions of a chromosome, which sometimes leads to the removal of tumor-suppressor genes, and can induce complex translocations and large-scale duplications. Somatic retrotranspositions can also initiate breakage–fusion–bridge cycles, leading to high-level amplification of oncogenes. These observations illuminate a relevant role of L1 retrotransposition in remodeling the cancer genome, with potential implications for the development of human tumors

    A comprehensive assessment of somatic mutation detection in cancer using whole-genome sequencing.

    Get PDF
    As whole-genome sequencing for cancer genome analysis becomes a clinical tool, a full understanding of the variables affecting sequencing analysis output is required. Here using tumour-normal sample pairs from two different types of cancer, chronic lymphocytic leukaemia and medulloblastoma, we conduct a benchmarking exercise within the context of the International Cancer Genome Consortium. We compare sequencing methods, analysis pipelines and validation methods. We show that using PCR-free methods and increasing sequencing depth to ∼ 100 × shows benefits, as long as the tumour:control coverage ratio remains balanced. We observe widely varying mutation call rates and low concordance among analysis pipelines, reflecting the artefact-prone nature of the raw data and lack of standards for dealing with the artefacts. However, we show that, using the benchmark mutation set we have created, many issues are in fact easy to remedy and have an immediate positive impact on mutation detection accuracy.We thank the DKFZ Genomics and Proteomics Core Facility and the OICR Genome Technologies Platform for provision of sequencing services. Financial support was provided by the consortium projects READNA under grant agreement FP7 Health-F4-2008-201418, ESGI under grant agreement 262055, GEUVADIS under grant agreement 261123 of the European Commission Framework Programme 7, ICGC-CLL through the Spanish Ministry of Science and Innovation (MICINN), the Instituto de Salud Carlos III (ISCIII) and the Generalitat de Catalunya. Additional financial support was provided by the PedBrain Tumor Project contributing to the International Cancer Genome Consortium, funded by German Cancer Aid (109252) and by the German Federal Ministry of Education and Research (BMBF, grants #01KU1201A, MedSys #0315416C and NGFNplus #01GS0883; the Ontario Institute for Cancer Research to PCB and JDM through funding provided by the Government of Ontario, Ministry of Research and Innovation; Genome Canada; the Canada Foundation for Innovation and Prostate Cancer Canada with funding from the Movember Foundation (PCB). PCB was also supported by a Terry Fox Research Institute New Investigator Award, a CIHR New Investigator Award and a Genome Canada Large-Scale Applied Project Contract. The Synergie Lyon Cancer platform has received support from the French National Institute of Cancer (INCa) and from the ABS4NGS ANR project (ANR-11-BINF-0001-06). The ICGC RIKEN study was supported partially by RIKEN President’s Fund 2011, and the supercomputing resource for the RIKEN study was provided by the Human Genome Center, University of Tokyo. MDE, LB, AGL and CLA were supported by Cancer Research UK, the University of Cambridge and Hutchison-Whampoa Limited. SD is supported by the Torres Quevedo subprogram (MI CINN) under grant agreement PTQ-12-05391. EH is supported by the Research Council of Norway under grant agreements 221580 and 218241 and by the Norwegian Cancer Society under grant agreement 71220-PR-2006-0433. Very special thanks go to Jennifer Jennings for administrating the activity of the ICGC Verification Working Group and Anna Borrell for administrative support.This is the final version of the article. It first appeared from Nature Publishing Group via http://dx.doi.org/10.1038/ncomms1000

    Pan-cancer analysis of whole genomes identifies driver rearrangements promoted by LINE-1 retrotransposition.

    Get PDF
    About half of all cancers have somatic integrations of retrotransposons. Here, to characterize their role in oncogenesis, we analyzed the patterns and mechanisms of somatic retrotransposition in 2,954 cancer genomes from 38 histological cancer subtypes within the framework of the Pan-Cancer Analysis of Whole Genomes (PCAWG) project. We identified 19,166 somatically acquired retrotransposition events, which affected 35% of samples and spanned a range of event types. Long interspersed nuclear element (LINE-1; L1 hereafter) insertions emerged as the first most frequent type of somatic structural variation in esophageal adenocarcinoma, and the second most frequent in head-and-neck and colorectal cancers. Aberrant L1 integrations can delete megabase-scale regions of a chromosome, which sometimes leads to the removal of tumor-suppressor genes, and can induce complex translocations and large-scale duplications. Somatic retrotranspositions can also initiate breakage-fusion-bridge cycles, leading to high-level amplification of oncogenes. These observations illuminate a relevant role of L1 retrotransposition in remodeling the cancer genome, with potential implications for the development of human tumors

    Tuning fresh: radiation through rewiring of central metabolism in streamlined bacteria

    Get PDF
    Most free-living planktonic cells are streamlined and in spite of their limitations in functional flexibility, their vast populations have radiated into a wide range of aquatic habitats. Here we compared the metabolic potential of subgroups in the Alphaproteobacteria lineage SAR11 adapted to marine and freshwater habitats. Our results suggest that the successful leap from marine to freshwaters in SAR11 was accompanied by a loss of several carbon degradation pathways and a rewiring of the central metabolism. Examples for these are C1 and methylated compounds degradation pathways, the Entner–Doudouroff pathway, the glyoxylate shunt and anapleuretic carbon fixation being absent from the freshwater genomes. Evolutionary reconstructions further suggest that the metabolic modules making up these important freshwater metabolic traits were already present in the gene pool of ancestral marine SAR11 populations. The loss of the glyoxylate shunt had already occurred in the common ancestor of the freshwater subgroup and its closest marine relatives, suggesting that the adaptation to freshwater was a gradual process. Furthermore, our results indicate rapid evolution of TRAP transporters in the freshwater clade involved in the uptake of low molecular weight carboxylic acids. We propose that such gradual tuning of metabolic pathways and transporters toward locally available organic substrates is linked to the formation of subgroups within the SAR11 clade and that this process was critical for the freshwater clade to find and fix an adaptive phenotype.This work was supported by the Swedish Research Council (Grant Numbers 2012-4592 to AE and 2012-3892 to SB) and the Communiy Sequencing Programme of the US Department of Energy Joint Genome Institute. The work conducted by the US Department of Energy Joint Genome Institute, a DOE Office of Science User Facility, is supported under Contract No. DE-AC02-05CH11231

    GA4GH: International policies and standards for data sharing across genomic research and healthcare.

    Get PDF
    The Global Alliance for Genomics and Health (GA4GH) aims to accelerate biomedical advances by enabling the responsible sharing of clinical and genomic data through both harmonized data aggregation and federated approaches. The decreasing cost of genomic sequencing (along with other genome-wide molecular assays) and increasing evidence of its clinical utility will soon drive the generation of sequence data from tens of millions of humans, with increasing levels of diversity. In this perspective, we present the GA4GH strategies for addressing the major challenges of this data revolution. We describe the GA4GH organization, which is fueled by the development efforts of eight Work Streams and informed by the needs of 24 Driver Projects and other key stakeholders. We present the GA4GH suite of secure, interoperable technical standards and policy frameworks and review the current status of standards, their relevance to key domains of research and clinical care, and future plans of GA4GH. Broad international participation in building, adopting, and deploying GA4GH standards and frameworks will catalyze an unprecedented effort in data sharing that will be critical to advancing genomic medicine and ensuring that all populations can access its benefits

    Fluid challenges in intensive care: the FENICE study A global inception cohort study

    Get PDF
    Fluid challenges (FCs) are one of the most commonly used therapies in critically ill patients and represent the cornerstone of hemodynamic management in intensive care units. There are clear benefits and harms from fluid therapy. Limited data on the indication, type, amount and rate of an FC in critically ill patients exist in the literature. The primary aim was to evaluate how physicians conduct FCs in terms of type, volume, and rate of given fluid; the secondary aim was to evaluate variables used to trigger an FC and to compare the proportion of patients receiving further fluid administration based on the response to the FC.This was an observational study conducted in ICUs around the world. Each participating unit entered a maximum of 20 patients with one FC.2213 patients were enrolled and analyzed in the study. The median [interquartile range] amount of fluid given during an FC was 500 ml (500-1000). The median time was 24 min (40-60 min), and the median rate of FC was 1000 [500-1333] ml/h. The main indication for FC was hypotension in 1211 (59 %, CI 57-61 %). In 43 % (CI 41-45 %) of the cases no hemodynamic variable was used. Static markers of preload were used in 785 of 2213 cases (36 %, CI 34-37 %). Dynamic indices of preload responsiveness were used in 483 of 2213 cases (22 %, CI 20-24 %). No safety variable for the FC was used in 72 % (CI 70-74 %) of the cases. There was no statistically significant difference in the proportion of patients who received further fluids after the FC between those with a positive, with an uncertain or with a negatively judged response.The current practice and evaluation of FC in critically ill patients are highly variable. Prediction of fluid responsiveness is not used routinely, safety limits are rarely used, and information from previous failed FCs is not always taken into account

    Genome-wide associations for birth weight and correlations with adult disease

    Get PDF
    Birth weight (BW) has been shown to be influenced by both fetal and maternal factors and in observational studies is reproducibly associated with future risk of adult metabolic diseases including type 2 diabetes (T2D) and cardiovascular disease. These life-course associations have often been attributed to the impact of an adverse early life environment. Here, we performed a multi-ancestry genome-wide association study (GWAS) meta-analysis of BW in 153,781 individuals, identifying 60 loci where fetal genotype was associated with BW (P\textit{P}  < 5 × 108^{-8}). Overall, approximately 15% of variance in BW was captured by assays of fetal genetic variation. Using genetic association alone, we found strong inverse genetic correlations between BW and systolic blood pressure (R\textit{R}g_{g} = -0.22, P\textit{P}  = 5.5 × 1013^{-13}), T2D (R\textit{R}g_{g} = -0.27, P\textit{P}  = 1.1 × 106^{-6}) and coronary artery disease (R\textit{R}g_{g} = -0.30, P\textit{P}  = 6.5 × 109^{-9}). In addition, using large -cohort datasets, we demonstrated that genetic factors were the major contributor to the negative covariance between BW and future cardiometabolic risk. Pathway analyses indicated that the protein products of genes within BW-associated regions were enriched for diverse processes including insulin signalling, glucose homeostasis, glycogen biosynthesis and chromatin remodelling. There was also enrichment of associations with BW in known imprinted regions (P\textit{P} = 1.9 × 104^{-4}). We demonstrate that life-course associations between early growth phenotypes and adult cardiometabolic disease are in part the result of shared genetic effects and identify some of the pathways through which these causal genetic effects are mediated.For a full list of the funders pelase visit the publisher's website and look at the supplemetary material provided. Some of the funders are: British Heart Foundation, Cancer Research UK, Medical Research Council, National Institutes of Health, Royal Society and Wellcome Trust
    corecore