Search CORE

22,524 research outputs found

De Novo Co-Assembly Of Bacterial Genomes From Multiple Single Cells

Author: Movahedi Tabrizi Narjes Sadat
Publication venue: DigitalCommons@WayneState
Publication date: 01/01/2014
Field of study

Recent progress in DNA amplication techniques, particularly multiple displacement amplication (MDA), has made it possible to sequence and assemble bacterial genomes from a single cell. However, the quality of single cell genome assembly has not yet reached the quality of normal multicell genome assembly due to the coverage bias and errors caused by MDA. Using a template of more than one cell for MDA or combining separate MDA products has been shown to improve the result of genome assembly from few single cells, but providing identical single cells, as a necessary step for these approaches, is a challenge. As a solution to this problem, we give an algorithm for de novo co-assembly of bacterial genomes from multiple single cells. Our novel method not only detects the outlier cells in a pool, it also identies and eliminates their genomic sequences from the nal assembly. Our proposed co-assembly algorithm is based on colored de Bruijn graph which has been recently proposed for de novo structural variation detection. Our results show that de novo co-assembly of bacterial genomes from multiple single cells outperforms single cell assembly of each individual one in all standard metrics. Moreover, our de novo co-assembly also outperforms the mixed assembly in which the input datasets are simply concatenated. We implemented our algorithm in a software tool called HyDA which is available from http://chitsazlab.org/software/hyda

Digital Commons@Wayne State University

Ensemble Analysis of Adaptive Compressed Genome Sequencing Strategies

Author: Taghavi Zeinab
Publication venue
Publication date: 25/04/2014
Field of study

Acquiring genomes at single-cell resolution has many applications such as in the study of microbiota. However, deep sequencing and assembly of all of millions of cells in a sample is prohibitively costly. A property that can come to rescue is that deep sequencing of every cell should not be necessary to capture all distinct genomes, as the majority of cells are biological replicates. Biologically important samples are often sparse in that sense. In this paper, we propose an adaptive compressed method, also known as distilled sensing, to capture all distinct genomes in a sparse microbial community with reduced sequencing effort. As opposed to group testing in which the number of distinct events is often constant and sparsity is equivalent to rarity of an event, sparsity in our case means scarcity of distinct events in comparison to the data size. Previously, we introduced the problem and proposed a distilled sensing solution based on the breadth first search strategy. We simulated the whole process which constrained our ability to study the behavior of the algorithm for the entire ensemble due to its computational intensity. In this paper, we modify our previous breadth first search strategy and introduce the depth first search strategy. Instead of simulating the entire process, which is intractable for a large number of experiments, we provide a dynamic programming algorithm to analyze the behavior of the method for the entire ensemble. The ensemble analysis algorithm recursively calculates the probability of capturing every distinct genome and also the expected total sequenced nucleotides for a given population profile. Our results suggest that the expected total sequenced nucleotides grows proportional to

\log

of the number of cells and proportional linearly with the number of distinct genomes

arXiv.org e-Print Archive

Springer - Publisher Connector

A Reference-Free Algorithm for Computational Normalization of Shotgun Sequencing Data

Author: Brom Timothy H.
Brown C. Titus
Howe Adina
Pyrkosz Alexis B.
Zhang Qingpeng
Publication venue
Publication date: 21/05/2012
Field of study

Deep shotgun sequencing and analysis of genomes, transcriptomes, amplified single-cell genomes, and metagenomes has enabled investigation of a wide range of organisms and ecosystems. However, sampling variation in short-read data sets and high sequencing error rates of modern sequencers present many new computational challenges in data interpretation. These challenges have led to the development of new classes of mapping tools and {\em de novo} assemblers. These algorithms are challenged by the continued improvement in sequencing throughput. We here describe digital normalization, a single-pass computational algorithm that systematizes coverage in shotgun sequencing data sets, thereby decreasing sampling variation, discarding redundant data, and removing the majority of errors. Digital normalization substantially reduces the size of shotgun data sets and decreases the memory and time requirements for {\em de novo} sequence assembly, all without significantly impacting content of the generated contigs. We apply digital normalization to the assembly of microbial genomic data, amplified single-cell genomic data, and transcriptomic data. Our implementation is freely available for use and modification

arXiv.org e-Print Archive

CiteSeerX

Recovering complete and draft population genomes from metagenome datasets.

Author: Gilbert Jack A
Sangwan Naseer
Xia Fangfang
Publication venue: eScholarship, University of California
Publication date: 01/03/2016
Field of study

Assembly of metagenomic sequence data into microbial genomes is of fundamental value to improving our understanding of microbial ecology and metabolism by elucidating the functional potential of hard-to-culture microorganisms. Here, we provide a synthesis of available methods to bin metagenomic contigs into species-level groups and highlight how genetic diversity, sequencing depth, and coverage influence binning success. Despite the computational cost on application to deeply sequenced complex metagenomes (e.g., soil), covarying patterns of contig coverage across multiple datasets significantly improves the binning process. We also discuss and compare current genome validation methods and reveal how these methods tackle the problem of chimeric genome bins i.e., sequences from multiple species. Finally, we explore how population genome assembly can be used to uncover biogeographic trends and to characterize the effect of in situ functional constraints on the genome-wide evolution

Woods Hole Open Access Server

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Recommended from our members

Evolution of host support for two ancient bacterial symbionts with differentially degraded genomes in a leafhopper host.

Author: Bennett Gordon M
Mao Meng
Yang Xiushuai
Publication venue: eScholarship, University of California
Publication date: 01/12/2018
Field of study

Plant sap-feeding insects (Hemiptera) rely on bacterial symbionts for nutrition absent in their diets. These bacteria experience extreme genome reduction and require genetic resources from their hosts, particularly for basic cellular processes other than nutrition synthesis. The host-derived mechanisms that complete these processes have remained poorly understood. It is also unclear how hosts meet the distinct needs of multiple bacterial partners with differentially degraded genomes. To address these questions, we investigated the cell-specific gene-expression patterns in the symbiotic organs of the aster leafhopper (ALF), Macrosteles quadrilineatus (Cicadellidae). ALF harbors two intracellular symbionts that have two of the smallest known bacterial genomes: Nasuia (112 kb) and Sulcia (190 kb). Symbionts are segregated into distinct host cell types (bacteriocytes) and vary widely in their basic cellular capabilities. ALF differentially expresses thousands of genes between the bacteriocyte types to meet the functional needs of each symbiont, including the provisioning of metabolites and support of cellular processes. For example, the host highly expresses genes in the bacteriocytes that likely complement gene losses in nucleic acid synthesis, DNA repair mechanisms, transcription, and translation. Such genes are required to function in the bacterial cytosol. Many host genes comprising these support mechanisms are derived from the evolution of novel functional traits via horizontally transferred genes, reassigned mitochondrial support genes, and gene duplications with bacteriocyte-specific expression. Comparison across other hemipteran lineages reveals that hosts generally support the incomplete symbiont cellular processes, but the origins of these support mechanisms are generally specific to the host-symbiont system

eScholarship - University of California

Recommended from our members

Optimizing sequencing protocols for leaderboard metagenomics by combining long and short reads.

Author: Arthur Timothy D
Bankevich Anton
Boland Brigid S
Brennan Caitriona
Chang John T
Chen Feng
Conrad Douglas J
Dang Jason W
Dorrestein Pieter C
Fedarko Marcus
Gaffney James
Green Cliff
Humphrey Greg C
Jepsen Kristen
Khosroheidari Mahdieh
Knight Rob
Liyanage Marlon
Martino Cameron
Minich Jeremiah
Nurk Sergey
Pevzner Pavel A
Phelan Vanessa V
Quinn Robert A
Rana Tariq M
Salido Rodolfo A
Sandborn William J
Sanders Jon G
Sanders Karenina
Smarr Larry
Xu Zhenjiang Z
Zhu Qiyun
Publication venue: eScholarship, University of California
Publication date: 01/10/2019
Field of study

As metagenomic studies move to increasing numbers of samples, communities like the human gut may benefit more from the assembly of abundant microbes in many samples, rather than the exhaustive assembly of fewer samples. We term this approach leaderboard metagenome sequencing. To explore protocol optimization for leaderboard metagenomics in real samples, we introduce a benchmark of library prep and sequencing using internal references generated by synthetic long-read technology, allowing us to evaluate high-throughput library preparation methods against gold-standard reference genomes derived from the samples themselves. We introduce a low-cost protocol for high-throughput library preparation and sequencing

eScholarship - University of California

Host-linked soil viral ecology along a permafrost thaw gradient

Author: Bolduc Benjamin
Boyd Joel A.
Brum Jennifer R.
Chanton Jeffrey P.
Crill Patrick M.
Emerson Joanne B.
Frolking Stephen E.
Hodgkins Suzanne B.
Jang Ho Bin
Li Changsheng
Naas Adrian E.
Pope Phillip B.
Rich Virginia I.
Roux Simon
Saleska Scott R.
Singleton Caitlin M.
Solden Lindsey M.
Sullivan Matthew B.
Trubl Gareth
Tyson Gene W.
Wilson Rachel M.
Woodcroft Ben J.
Wrighton Kelly C.
Publication venue: University of New Hampshire Scholars\u27 Repository
Publication date: 16/07/2018
Field of study

Climate change threatens to release abundant carbon that is sequestered at high latitudes, but the constraints on microbial metabolisms that mediate the release of methane and carbon dioxide are poorly understood1,2,3,4,5,6,7. The role of viruses, which are known to affect microbial dynamics, metabolism and biogeochemistry in the oceans8,9,10, remains largely unexplored in soil. Here, we aimed to investigate how viruses influence microbial ecology and carbon metabolism in peatland soils along a permafrost thaw gradient in Sweden. We recovered 1,907 viral populations (genomes and large genome fragments) from 197 bulk soil and size-fractionated metagenomes, 58% of which were detected in metatranscriptomes and presumed to be active. In silico predictions linked 35% of the viruses to microbial host populations, highlighting likely viral predators of key carbon-cycling microorganisms, including methanogens and methanotrophs. Lineage-specific virus/host ratios varied, suggesting that viral infection dynamics may differentially impact microbial responses to a changing climate. Virus-encoded glycoside hydrolases, including an endomannanase with confirmed functional activity, indicated that viruses influence complex carbon degradation and that viral abundances were significant predictors of methane dynamics. These findings suggest that viruses may impact ecosystem function in climate-critical, terrestrial habitats and identify multiple potential viral contributions to soil carbon cycling

UNH Scholars' Repository

Comparative genome analysis of Wolbachia strain wAu

Author: Harris Simon R.
Parkhill Julian
Sinkins Steven P.
Sutton Elizabeth R.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

BACKGROUND: Wolbachia intracellular bacteria can manipulate the reproduction of their arthropod hosts, including inducing sterility between populations known as cytoplasmic incompatibility (CI). Certain strains have been identified that are unable to induce or rescue CI, including wAu from Drosophila. Genome sequencing and comparison with CI-inducing related strain wMel was undertaken in order to better understand the molecular basis of the phenotype. RESULTS: Although the genomes were broadly similar, several rearrangements were identified, particularly in the prophage regions. Many orthologous genes contained single nucleotide polymorphisms (SNPs) between the two strains, but a subset containing major differences that would likely cause inactivation in wAu were identified, including the absence of the wMel ortholog of a gene recently identified as a CI candidate in a proteomic study. The comparative analyses also focused on a family of transcriptional regulator genes implicated in CI in previous work, and revealed numerous differences between the strains, including those that would have major effects on predicted function. CONCLUSIONS: The study provides support for existing candidates and novel genes that may be involved in CI, and provides a basis for further functional studies to examine the molecular basis of the phenotype

Crossref

Springer - Publisher Connector

PubMed Central

Enlighten

The evolution of the natural killer complex; a comparison between mammals using new high-quality genome assemblies and targeted annotation.

Author: Bickhart Derek M
Gibson Mark S
Hammond John A
Heimeier Dorothea
Koren Sergey
Medrano Juan F
Phillippy Adam M
Schwartz John C
Smith Timothy PL
Publication venue: eScholarship, University of California
Publication date: 01/01/2017
Field of study

Natural killer (NK) cells are a diverse population of lymphocytes with a range of biological roles including essential immune functions. NK cell diversity is in part created by the differential expression of cell surface receptors which modulate activation and function, including multiple subfamilies of C-type lectin receptors encoded within the NK complex (NKC). Little is known about the gene content of the NKC beyond rodent and primate lineages, other than it appears to be extremely variable between mammalian groups. We compared the NKC structure between mammalian species using new high-quality draft genome assemblies for cattle and goat; re-annotated sheep, pig, and horse genome assemblies; and the published human, rat, and mouse lemur NKC. The major NKC genes are largely in the equivalent positions in all eight species, with significant independent expansions and deletions between species, allowing us to propose a model for NKC evolution during mammalian radiation. The ruminant species, cattle and goats, have independently evolved a second KLRC locus flanked by KLRA and KLRJ, and a novel KLRH-like gene has acquired an activating tail. This novel gene has duplicated several times within cattle, while other activating receptor genes have been selectively disrupted. Targeted genome enrichment in cattle identified varying levels of allelic polymorphism between the NKC genes concentrated in the predicted extracellular ligand-binding domains. This novel recombination and allelic polymorphism is consistent with NKC evolution under balancing selection, suggesting that this diversity influences individual immune responses and may impact on differential outcomes of pathogen infection and vaccination

Springer - Publisher Connector

PubMed Central

Repositório da Universidade Nova de Lisboa

eScholarship - University of California