24 research outputs found
Assessing the feasibility of GS FLX Pyrosequencing for sequencing the Atlantic salmon genome
<p>Abstract</p> <p>Background</p> <p>With a whole genome duplication event and wealth of biological data, salmonids are excellent model organisms for studying evolutionary processes, fates of duplicated genes and genetic and physiological processes associated with complex behavioral phenotypes. It is surprising therefore, that no salmonid genome has been sequenced. Atlantic salmon (<it>Salmo salar</it>) is a good representative salmonid for sequencing given its importance in aquaculture and the genomic resources available. However, the size and complexity of the genome combined with the lack of a sequenced reference genome from a closely related fish makes assembly challenging. Given the cost and time limitations of Sanger sequencing as well as recent improvements to next generation sequencing technologies, we examined the feasibility of using the Genome Sequencer (GS) FLX pyrosequencing system to obtain the sequence of a salmonid genome. Eight pooled BACs belonging to a minimum tiling path covering ~1 Mb of the Atlantic salmon genome were sequenced by GS FLX shotgun and Long Paired End sequencing and compared with a ninth BAC sequenced by Sanger sequencing of a shotgun library.</p> <p>Results</p> <p>An initial assembly using only GS FLX shotgun sequences (average read length 248.5 bp) with ~30× coverage allowed gene identification, but was incomplete even when 126 Sanger-generated BAC-end sequences (~0.09× coverage) were incorporated. The addition of paired end sequencing reads (additional ~26× coverage) produced a final assembly comprising 175 contigs assembled into four scaffolds with 171 gaps. Sanger sequencing of the ninth BAC (~10.5× coverage) produced nine contigs and two scaffolds. The number of scaffolds produced by the GS FLX assembly was comparable to Sanger-generated sequencing; however, the number of gaps was much higher in the GS FLX assembly.</p> <p>Conclusion</p> <p>These results represent the first use of GS FLX paired end reads for <it>de novo </it>sequence assembly. Our data demonstrated that this improved the GS FLX assemblies; however, with respect to <it>de novo </it>sequencing of complex genomes, the GS FLX technology is limited to gene mining and establishing a set of ordered sequence contigs. Currently, for a salmonid reference sequence, it appears that a substantial portion of sequencing should be done using Sanger technology.</p
High-throughput 454 resequencing for allele discovery and recombination mapping in Plasmodium falciparum
<p>Abstract</p> <p>Background</p> <p>Knowledge of the origins, distribution, and inheritance of variation in the malaria parasite (<it>Plasmodium falciparum</it>) genome is crucial for understanding its evolution; however the 81% (A+T) genome poses challenges to high-throughput sequencing technologies. We explore the viability of the Roche 454 Genome Sequencer FLX (GS FLX) high throughput sequencing technology for both whole genome sequencing and fine-resolution characterization of genetic exchange in malaria parasites.</p> <p>Results</p> <p>We present a scheme to survey recombination in the haploid stage genomes of two sibling parasite clones, using whole genome pyrosequencing that includes a sliding window approach to predict recombination breakpoints. Whole genome shotgun (WGS) sequencing generated approximately 2 million reads, with an average read length of approximately 300 bp. <it>De novo </it>assembly using a combination of WGS and 3 kb paired end libraries resulted in contigs ≤ 34 kb. More than 8,000 of the 24,599 SNP markers identified between parents were genotyped in the progeny, resulting in a marker density of approximately 1 marker/3.3 kb and allowing for the detection of previously unrecognized crossovers (COs) and many non crossover (NCO) gene conversions throughout the genome.</p> <p>Conclusions</p> <p>By sequencing the 23 Mb genomes of two haploid progeny clones derived from a genetic cross at more than 30× coverage, we captured high resolution information on COs, NCOs and genetic variation within the progeny genomes. This study is the first to resequence progeny clones to examine fine structure of COs and NCOs in malaria parasites.</p
Comparative and Functional Genomics of Rhodococcus opacus PD630 for Biofuels Development
The Actinomycetales bacteria Rhodococcus opacus PD630 and Rhodococcus jostii RHA1 bioconvert a diverse range of organic substrates through lipid biosynthesis into large quantities of energy-rich triacylglycerols (TAGs). To describe the genetic basis of the Rhodococcus oleaginous metabolism, we sequenced and performed comparative analysis of the 9.27 Mb R. opacus PD630 genome. Metabolic-reconstruction assigned 2017 enzymatic reactions to the 8632 R. opacus PD630 genes we identified. Of these, 261 genes were implicated in the R. opacus PD630 TAGs cycle by metabolic reconstruction and gene family analysis. Rhodococcus synthesizes uncommon straight-chain odd-carbon fatty acids in high abundance and stores them as TAGs. We have identified these to be pentadecanoic, heptadecanoic, and cis-heptadecenoic acids. To identify bioconversion pathways, we screened R. opacus PD630, R. jostii RHA1, Ralstonia eutropha H16, and C. glutamicum 13032 for growth on 190 compounds. The results of the catabolic screen, phylogenetic analysis of the TAGs cycle enzymes, and metabolic product characterizations were integrated into a working model of prokaryotic oleaginy.Cambridge-MIT InstituteMassachusetts Institute of Technology. (Seed Grant program)Shell Oil CompanyNational Institute of Allergy and Infectious Diseases (U.S.)United States. National Institutes of HealthNational Institutes of Health. Department of Health and Human Services (Contract No. HHSN272200900006C
Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication
Cultivated citrus are selections from, or hybrids of, wild progenitor species whose identities and contributions to citrus domestication remain controversial. Here we sequence and compare citrus genomes-a high-quality reference haploid clementine genome and mandarin, pummelo, sweet-orange and sour-orange genomes-and show that cultivated types derive from two progenitor species. Although cultivated pummelos represent selections from one progenitor species, Citrus maxima, cultivated mandarins are introgressions of C. maxima into the ancestral mandarin species Citrus reticulata. The most widely cultivated citrus, sweet orange, is the offspring of previously admixed individuals, but sour orange is an F1 hybrid of pure C. maxima and C. reticulata parents, thus implying that wild mandarins were part of the early breeding germplasm. A Chinese wild 'mandarin' diverges substantially from C. reticulata, thus suggesting the possibility of other unrecognized wild citrus species. Understanding citrus phylogeny through genome analysis clarifies taxonomic relationships and facilitates sequence-directed genetic improvement. (Résumé d'auteur
Post-traumatic stress and future substance use outcomes: leveraging antecedent factors to stratify risk
BackgroundPost-traumatic stress disorder (PTSD) and substance use (tobacco, alcohol, and cannabis) are highly comorbid. Many factors affect this relationship, including sociodemographic and psychosocial characteristics, other prior traumas, and physical health. However, few prior studies have investigated this prospectively, examining new substance use and the extent to which a wide range of factors may modify the relationship to PTSD.MethodsThe Advancing Understanding of RecOvery afteR traumA (AURORA) study is a prospective cohort of adults presenting at emergency departments (N = 2,943). Participants self-reported PTSD symptoms and the frequency and quantity of tobacco, alcohol, and cannabis use at six total timepoints. We assessed the associations of PTSD and future substance use, lagged by one timepoint, using the Poisson generalized estimating equations. We also stratified by incident and prevalent substance use and generated causal forests to identify the most important effect modifiers of this relationship out of 128 potential variables.ResultsAt baseline, 37.3% (N = 1,099) of participants reported likely PTSD. PTSD was associated with tobacco frequency (incidence rate ratio (IRR): 1.003, 95% CI: 1.00, 1.01, p = 0.02) and quantity (IRR: 1.01, 95% CI: 1.001, 1.01, p = 0.01), and alcohol frequency (IRR: 1.002, 95% CI: 1.00, 1.004, p = 0.03) and quantity (IRR: 1.003, 95% CI: 1.001, 1.01, p = 0.001), but not with cannabis use. There were slight differences in incident compared to prevalent tobacco frequency and quantity of use; prevalent tobacco frequency and quantity were associated with PTSD symptoms, while incident tobacco frequency and quantity were not. Using causal forests, lifetime worst use of cigarettes, overall self-rated physical health, and prior childhood trauma were major moderators of the relationship between PTSD symptoms and the three substances investigated.ConclusionPTSD symptoms were highly associated with tobacco and alcohol use, while the association with prospective cannabis use is not clear. Findings suggest that understanding the different risk stratification that occurs can aid in tailoring interventions to populations at greatest risk to best mitigate the comorbidity between PTSD symptoms and future substance use outcomes. We demonstrate that this is particularly salient for tobacco use and, to some extent, alcohol use, while cannabis is less likely to be impacted by PTSD symptoms across the strata
Global mRNA decay analysis at single nucleotide resolution reveals segmental and positional degradation patterns in a Gram-positive bacterium
Background
Recent years have shown a marked increase in the use of next-generation sequencing technologies for quantification of gene expression (RNA sequencing, RNA-Seq). The expression level of a gene is a function of both its rate of transcription and RNA decay, and the influence of mRNA decay rates on gene expression in genome-wide studies of Gram-positive bacteria is under-investigated.
Results
In this work, we employed RNA-Seq in a genome-wide determination of mRNA half-lives in the Gram-positive bacterium Bacillus cereus. By utilizing a newly developed normalization protocol, RNA-Seq was used successfully to determine global mRNA decay rates at the single nucleotide level. The analysis revealed positional degradation patterns, with mRNAs being degraded from both ends of the molecule, indicating that both 5' to 3' and 3' to 5' directions of RNA decay are present in B. cereus. Other operons showed segmental degradation patterns where specific ORFs within polycistrons were degraded at variable rates, underlining the importance of RNA processing in gene regulation. We determined the half-lives for more than 2,700 ORFs in B. cereus ATCC 10987, ranging from less than one minute to more than fifteen minutes, and showed that mRNA decay rate correlates globally with mRNA expression level, GC content, and functional class of the ORF.
Conclusions
To our knowledge, this study presents the first global analysis of mRNA decay in a bacterium at single nucleotide resolution. We provide a proof of principle for using RNA-Seq in bacterial mRNA decay analysis, revealing RNA processing patterns at the single nucleotide level
Structural Mechanics Predictions Relating to Clinical Coronary Stent Fracture in a 5 Year Period in FDA MAUDE Database
Endovascular stents are the mainstay of interventional cardiovascular medicine. Technological advances have reduced biological and clinical complications but not mechanical failure. Stent strut fracture is increasingly recognized as of paramount clinical importance. Though consensus reigns that fractures can result from material fatigue, how fracture is induced and the mechanisms underlying its clinical sequelae remain ill-defined. In this study, strut fractures were identified in the prospectively maintained Food and Drug Administration’s (FDA) Manufacturer and User Facility Device Experience Database (MAUDE), covering years 2006–2011, and differentiated based on specific coronary artery implantation site and device configuration. These data, and knowledge of the extent of dynamic arterial deformations obtained from patient CT images and published data, were used to define boundary conditions for 3D finite element models incorporating multimodal, multi-cycle deformation. The structural response for a range of stent designs and configurations was predicted by computational models and included estimation of maximum principal, minimum principal and equivalent plastic strains. Fatigue assessment was performed with Goodman diagrams and safe/unsafe regions defined for different stent designs. Von Mises stress and maximum principal strain increased with multimodal, fully reversed deformation. Spatial maps of unsafe locations corresponded to the identified locations of fracture in different coronary arteries in the clinical database. These findings, for the first time, provide insight into a potential link between patient adverse events and computational modeling of stent deformation. Understanding of the mechanical forces imposed under different implantation conditions may assist in rational design and optimal placement of these devices.United States. Food and Drug Administration. Office of Regulatory Affairs (Grant R01 GM49039
Whole-genome comparison reveals novel genetic elements that characterize the genome of industrial strains of saccharomyces cerevisiae
Human intervention has subjected the yeast Saccharomyces cerevisiae to multiple rounds of independent domestication and thousands of generations of artificial selection. As a result, this species comprises a genetically diverse collection of natural isolates as well as domesticated strains that are used in specific industrial applications. However the scope of genetic diversity that was captured during the domesticated evolution of the industrial representatives of this important organism remains to be determined. To begin to address this, we have produced whole-genome assemblies of six commercial strains of S. cerevisiae (four wine and two brewing strains). These represent the first genome assemblies produced from S. cerevisiae strains in their industrially-used forms and the first high-quality assemblies for S. cerevisiae strains used in brewing. By comparing these sequences to six existing high-coverage S. cerevisiae genome assemblies, clear signatures were found that defined each industrial class of yeast. This genetic variation was comprised of both single nucleotide polymorphisms and large-scale insertions and deletions, with the latter often being associated with ORF heterogeneity between strains. This included the discovery of more than twenty probable genes that had not been identified previously in the S. cerevisiae genome. Comparison of this large number of S. cerevisiae strains also enabled the characterization of a cluster of five ORFs that have integrated into the genomes of the wine and bioethanol strains on multiple occasions and at diverse genomic locations via what appears to involve the resolution of a circular DNA intermediate. This work suggests that, despite the scrutiny that has been directed at the yeast genome, there remains a significant reservoir of ORFs and novel modes of genetic transmission that may have significant phenotypic impact in this important model and industrial species.10 page(s