Search CORE

14,680 research outputs found

Recommended from our members

novoBreak: local assembly for breakpoint detection in cancer genomes.

Author: Boutros Paul
Chen Junjie
Chen Ken
Chen Tenghui
Chong Zechen
Ding Li
Fan Xian
Gao Min
Lee Anna Y
Ruan Jue
Zhou Wanding
Publication venue: eScholarship, University of California
Publication date: 01/01/2017
Field of study

We present novoBreak, a genome-wide local assembly algorithm that discovers somatic and germline structural variation breakpoints in whole-genome sequencing data. novoBreak consistently outperformed existing algorithms on real cancer genome data and on synthetic tumors in the ICGC-TCGA DREAM 8.5 Somatic Mutation Calling Challenge primarily because it more effectively utilized reads spanning breakpoints. novoBreak also demonstrated great sensitivity in identifying short insertions and deletions

eScholarship - University of California

Comparative genomics of Burkholderia multivorans, a ubiquitous pathogen with a highly conserved genomic structure

Author: Carlier Aurélien
Cooper Vaughn S
Hatcher Philip J
Peeters Charlotte
Vandamme Peter
Verheyde Bart
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2017
Field of study

The natural environment serves as a reservoir of opportunistic pathogens. A well-established method for studying the epidemiology of such opportunists is multilocus sequence typing, which in many cases has defined strains predisposed to causing infection. Burkholderia multivorans is an important pathogen in people with cystic fibrosis (CF) and its epidemiology suggests that strains are acquired from non-human sources such as the natural environment. This raises the central question of whether the isolation source (CF or environment) or the multilocus sequence type (ST) of B. multivorans better predicts their genomic content and functionality. We identified four pairs of B. multivorans isolates, representing distinct STs and consisting of one CF and one environmental isolate each. All genomes were sequenced using the PacBio SMRT sequencing technology, which resulted in eight high-quality B. multivorans genome assemblies. The present study demonstrated that the genomic structure of the examined B. multivorans STs is highly conserved and that the B. multivorans genomic lineages are defined by their ST. Orthologous protein families were not uniformly distributed among chromosomes, with core orthologs being enriched on the primary chromosome and ST-specific orthologs being enriched on the second and third chromosome. The ST-specific orthologs were enriched in genes involved in defense mechanisms and secondary metabolism, corroborating the strain-specificity of these virulence characteristics. Finally, the same B. multivorans genomic lineages occur in both CF and environmental samples and on different continents, demonstrating their ubiquity and evolutionary persistence

Ghent University Academic Bibliography

Directory of Open Access Journals

The Personal Genome Project-UK, an open access resource of human multi-omics data

Author: Beck Stephan
Berner Alison
Chervova Olga
Conde Lucia
Guerra-Assunção José Afonso
Hamoudi Rifat
Herrero Javier
Jesus Tiago F.
Larose Cadieux Elizabeth
Moghul Ismail
Tian Yuan
Voloshin Vitaly
Webster Amy P.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/10/2019
Field of study

Integrative analysis of multi-omics data is a powerful approach for gaining functional insights into biological and medical processes. Conducting these multifaceted analyses on human samples is often complicated by the fact that the raw sequencing output is rarely available under open access. The Personal Genome Project UK (PGP-UK) is one of few resources that recruits its participants under open consent and makes the resulting multi-omics data freely and openly available. As part of this resource, we describe the PGP-UK multi-omics reference panel consisting of ten genomic, methylomic and transcriptomic data. Specifically, we outline the data processing, quality control and validation procedures which were implemented to ensure data integrity and exclude sample mix-ups. In addition, we provide a REST API to facilitate the download of the entire PGP-UK dataset. The data are also available from two cloud-based environments, providing platforms for free integrated analysis. In conclusion, the genotype-validated PGP-UK multi-omics human reference panel described here provides a valuable new open access resource for integrated analyses in support of personal and medical genomics

UCL Discovery

Warwick Research Archives Portal Repository

A haplotype-resolved draft genome of the European sardine (Sardina pilchardus)

Author: Canario A.V.M.
Cox Cymon
De Moro Gianluca
Garcia Carlos
Louro Bruno
Sabatino Stephen J.
Santos António M.
Veríssimo Ana
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2019
Field of study

The European sardine (Sardina pilchardus Walbaum, 1792) is culturally and economically important throughout its distribution. Monitoring studies of sardine populations report an alarming decrease in stocks due to overfishing and environmental change, which has resulted in historically low captures along the Iberian Atlantic coast. Important biological and ecological features such as population diversity, structure, and migratory patterns can be addressed with the development and use of genomics resources.Agência financiadora Portuguese national funds from FCT-Foundation for Science and Technology: UID/Multi/04326/2016; European Regional Development Fund (FEDER): 22153-01/SAICT/2016; ALG-01-0145-FEDER-022121; ALG-01-0145-FEDER-022231; MAR2020 operational programme of the European Maritime and Fisheries Fund (project SARDI-NOMICS): MAR-01.04.02-FEAMP-0024; European Union's Horizon 2020 research and innovation programme: 654008info:eu-repo/semantics/publishedVersio

Sapientia

Discordant bioinformatic predictions of antimicrobial resistance from whole-genome sequencing data of bacterial isolates: an inter-laboratory study.

Author: Aller SD
Bruchmann S
Clark T
Coello Pelegrin A
Cormican M
Diez Benavente E
Doyle RM
Ellington MJ
Harris KA
Huggett JF
McGrath E
Moran-Gilad J
Motro Y
O'Sullivan DM
Phelan J
Phuong Thuy Nguyen T
Shaw LP
Stabler RA
van Belkum A
van Dorp L
Woodford N
Publication venue: 'Microbiology Society'
Publication date: 05/10/2019
Field of study

Antimicrobial resistance (AMR) poses a threat to public health. Clinical microbiology laboratories typically rely on culturing bacteria for antimicrobial-susceptibility testing (AST). As the implementation costs and technical barriers fall, whole-genome sequencing (WGS) has emerged as a 'one-stop' test for epidemiological and predictive AST results. Few published comparisons exist for the myriad analytical pipelines used for predicting AMR. To address this, we performed an inter-laboratory study providing sets of participating researchers with identical short-read WGS data from clinical isolates, allowing us to assess the reproducibility of the bioinformatic prediction of AMR between participants, and identify problem cases and factors that lead to discordant results. We produced ten WGS datasets of varying quality from cultured carbapenem-resistant organisms obtained from clinical samples sequenced on either an Illumina NextSeq or HiSeq instrument. Nine participating teams ('participants') were provided these sequence data without any other contextual information. Each participant used their choice of pipeline to determine the species, the presence of resistance-associated genes, and to predict susceptibility or resistance to amikacin, gentamicin, ciprofloxacin and cefotaxime. We found participants predicted different numbers of AMR-associated genes and different gene variants from the same clinical samples. The quality of the sequence data, choice of bioinformatic pipeline and interpretation of the results all contributed to discordance between participants. Although much of the inaccurate gene variant annotation did not affect genotypic resistance predictions, we observed low specificity when compared to phenotypic AST results, but this improved in samples with higher read depths. Had the results been used to predict AST and guide treatment, a different antibiotic would have been recommended for each isolate by at least one participant. These challenges, at the final analytical stage of using WGS to predict AMR, suggest the need for refinements when using this technology in clinical settings. Comprehensive public resistance sequence databases, full recommendations on sequence data quality and standardization in the comparisons between genotype and resistance phenotypes will all play a fundamental role in the successful implementation of AST prediction using WGS in clinical microbiology laboratories

LSHTM Research Online

UCL Discovery

Oxford University Research Archive

Institutional Repository Universiteit Antwerpen

St George's Online Research Archive

Recommended from our members

Transcriptome and translatome profiles of Streptomyces species in different growth phases.

Author: Cho Byung-Kwan
Cho Suhyung
Hwang Soonkyu
Kim Woori
Lee Namil
Lee Yongjae
Palsson Bernhard
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

Streptomyces are efficient producers of various bioactive compounds, which are mostly synthesized by their secondary metabolite biosynthetic gene clusters (smBGCs). The smBGCs are tightly controlled by complex regulatory systems at transcriptional and translational levels to effectively utilize precursors that are supplied by primary metabolism. Thus, dynamic changes in gene expression in response to cellular status at both the transcriptional and translational levels should be elucidated to directly reflect protein levels, rapid downstream responses, and cellular energy costs. In this study, RNA-Seq and ribosome profiling were performed for five industrially important Streptomyces species at different growth phases, for the deep sequencing of total mRNA, and only those mRNA fragments that are protected by translating ribosomes, respectively. Herein, 12.0 to 763.8 million raw reads were sufficiently obtained with high quality of more than 80% for the Phred score Q30 and high reproducibility. These data provide a comprehensive understanding of the transcriptional and translational landscape across the Streptomyces species and contribute to facilitating the rational engineering of secondary metabolite production

eScholarship - University of California

Online Research Database In Technology

Recommended from our members

Worldwide genetic variation of the IGHV and TRBV immune receptor gene families in humans.

Author: Li Heng
Luo Shishi
Song Yun
Yu Jane
Publication venue: eScholarship, University of California
Publication date: 01/04/2019
Field of study

The immunoglobulin heavy variable (IGHV) and T cell beta variable (TRBV) loci are among the most complex and variable regions in the human genome. Generated through a process of gene duplication/deletion and diversification, these loci can vary extensively between individuals in copy number and contain genes that are highly similar, making their analysis technically challenging. Here, we present a comprehensive study of the functional gene segments in the IGHV and TRBV loci, quantifying their copy number and single-nucleotide variation in a globally diverse sample of 109 (IGHV) and 286 (TRBV) humans from over a 100 populations. We find that the IGHV and TRBV gene families exhibit starkly different patterns of variation. In addition to providing insight into the different evolutionary paths of the IGHV and TRBV loci, our results are also important to the adaptive immune repertoire sequencing community, where the lack of frequencies of common alleles and copy number variants is hampering existing analytical pipelines

eScholarship - University of California

Systematic genetic analysis of the MHC region reveals mechanistic underpinnings of HLA type associations with disease.

Author: Aguiar
Assis
Auton
Bijvelds
Blackwell
Bonder
D'Antonio
D'Antonio-Chronowska
DeBoever
Dechecchi
DeGiorgio
Dendrou
Diwakar
Dobin
Eguchi
Ernst
Fehrmann
Freudenberg
Gambino
Gensterblum-Miller
Giambartolomei
González-Galarza
Gough
Graffelman
Guo
Hardy
Harrow
Herrmann
Holoshitz
Huang
Jakubosky
Jakubosky
Jensen
Jia
Kilpinen
Kilpinen
Klein
Kontakioti
Kundaje
Laki
Lam
Lee
Leung
Li
Li
Li
Li
Li
Lyczak
Mahdi
Mall
Matzaraki
Mayba
McNicholas
Miretti
Morison
Munder
Nariai
Norman
Oldstone
Panopoulos
Panousis
Pier
Robinson
Sondo
Stegle
Stoltz
Streeter
Tan
Tomati
Trowsdale
Van der Auwera
Vicente
Wilke
Yin
Zhang
Zheng
Publication venue: eScholarship, University of California
Publication date: 01/11/2019
Field of study

The MHC region is highly associated with autoimmune and infectious diseases. Here we conduct an in-depth interrogation of associations between genetic variation, gene expression and disease. We create a comprehensive map of regulatory variation in the MHC region using WGS from 419 individuals to call eight-digit HLA types and RNA-seq data from matched iPSCs. Building on this regulatory map, we explored GWAS signals for 4083 traits, detecting colocalization for 180 disease loci with eQTLs. We show that eQTL analyses taking HLA type haplotypes into account have substantially greater power compared with only using single variants. We examined the association between the 8.1 ancestral haplotype and delayed colonization in Cystic Fibrosis, postulating that downregulation of RNF5 expression is the likely causal mechanism. Our study provides insights into the genetic architecture of the MHC region and pinpoints disease associations that are due to differential expression of HLA genes and non-HLA genes

Crossref

eScholarship - University of California

Forensic SNP genotyping using nanopore MinION sequencing

Author: Cornelis Senne
Deforce Dieter
Deleye Lieselot
Gansemans Yannick
Van Nieuwerburgh Filip
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

One of the latest developments in next generation sequencing is the Oxford Nanopore Technologies' (ONT) MinION nanopore sequencer. We studied the applicability of this system to perform forensic genotyping of the forensic female DNA standard 9947 A using the 52 SNP-plex assay developed by the SNPforID consortium. All but one of the loci were correctly genotyped. Several SNP loci were identified as problematic for correct and robust genotyping using nanopore sequencing. All these loci contained homopolymers in the sequence flanking the forensic SNP and most of them were already reported as problematic in studies using other sequencing technologies. When these problematic loci are avoided, correct forensic genotyping using nanopore sequencing is technically feasible

Ghent University Academic Bibliography

PubMed Central

An improved Plasmodium cynomolgi genome assembly reveals an unexpected methyltransferase gene expansion.

Author: Berriman Matt
Böhme Ulrike
Kocken Clemens H.M.
Otto Thomas Dan
Pasini Erica M.
Rutledge Gavin G.
Sanders Mandy
Voorberg-Van der Wel Annemarie
Publication venue: 'F1000 Research Ltd'
Publication date: 01/06/2017
Field of study

Background: Plasmodium cynomolgi, a non-human primate malaria parasite species, has been an important model parasite since its discovery in 1907. Similarities in the biology of P. cynomolgi to the closely related, but less tractable, human malaria parasite P. vivax make it the model parasite of choice for liver biology and vaccine studies pertinent to P. vivax malaria. Molecular and genome-scale studies of P. cynomolgi have relied on the current reference genome sequence, which remains highly fragmented with 1,649 unassigned scaffolds and little representation of the subtelomeres. Methods: Using long-read sequence data (Pacific Biosciences SMRT technology), we assembled and annotated a new reference genome sequence, PcyM, sourced from an Indian rhesus monkey. We compare the newly assembled genome sequence with those of several other Plasmodium species, including a re-annotated P. coatneyi assembly. Results: The new PcyM genome assembly is of significantly higher quality than the existing reference, comprising only 56 pieces, no gaps and an improved average gene length. Detailed manual curation has ensured a comprehensive annotation of the genome with 6,632 genes, nearly 1,000 more than previously attributed to P. cynomolgi. The new assembly also has an improved representation of the subtelomeric regions, which account for nearly 40% of the sequence. Within the subtelomeres, we identified more than 1300 Plasmodium interspersed repeat (pir) genes, as well as a striking expansion of 36 methyltransferase pseudogenes that originated from a single copy on chromosome 9. Conclusions: The manually curated PcyM reference genome sequence is an important new resource for the malaria research community. The high quality and contiguity of the data have enabled the discovery of a novel expansion of methyltransferase in the subtelomeres, and illustrates the new comparative genomics capabilities that are being unlocked by complete reference genomes

Directory of Open Access Journals

Enlighten