125 research outputs found
Long-range regulation is a major driving force in maintaining genome integrity
<p>Abstract</p> <p>Background</p> <p>The availability of newly sequenced vertebrate genomes, along with more efficient and accurate alignment algorithms, have enabled the expansion of the field of comparative genomics. Large-scale genome rearrangement events modify the order of genes and non-coding conserved regions on chromosomes. While certain large genomic regions have remained intact over much of vertebrate evolution, others appear to be hotspots for genomic breakpoints. The cause of the non-uniformity of breakpoints that occurred during vertebrate evolution is poorly understood.</p> <p>Results</p> <p>We describe a machine learning method to distinguish genomic regions where breakpoints would be expected to have deleterious effects (called breakpoint-refractory regions) from those where they are expected to be neutral (called breakpoint-susceptible regions). Our predictor is trained using breakpoints that took place along the human lineage since amniote divergence. Based on our predictions, refractory and susceptible regions have very distinctive features. Refractory regions are significantly enriched for conserved non-coding elements as well as for genes involved in development, whereas susceptible regions are enriched for housekeeping genes, likely to have simpler transcriptional regulation.</p> <p>Conclusion</p> <p>We postulate that long-range transcriptional regulation strongly influences chromosome break fixation. In many regions, the fitness cost of altering the spatial association between long-range regulatory regions and their target genes may be so high that rearrangements are not allowed. Consequently, only a limited, identifiable fraction of the genome is susceptible to genome rearrangements.</p
Complete genome sequence of Streptococcus thermophilus SMQ-301, a model strain for phage-host interactions
Streptococcus thermophilus is used by the dairy industry to manufacture yogurt and several cheeses. Using PacBio and Illumina platforms, we sequenced the genome of S. thermophilus SMQ-301, the host of several virulent phages. The genome is composed of 1,861,792 bp and contains 2,037 genes, 67 tRNAs, and 18 rRNAs
First complete genome sequence of Staphylococcus xylosus, a meat starter culture and a host to propagate Staphylococcus aureus phages
Staphylococcus xylosus is a bacterial species used in meat fermentation and a commensal microorganism found on animals. We
present the first complete circular genome from this species. The genome is composed of 2,757,557 bp, with a GC content of
32.9%, and contains 2,514 genes and 79 structural RNAs
Excretion of Host DNA in Feces Is Associated with Risk of Clostridium difficile Infection
Clostridium difficile infection (CDI) is intricately linked to the health of the gastrointestinal tract and its indigenous microbiota. In this study, we assessed whether fecal excretion of host DNA is associated with CDI development. Assuming that shedding of epithelial cell increases in the inflamed intestine, we used human DNA excretion as a marker of intestinal insult. Whole-genome shotgun sequencing was employed to quantify host DNA excretion and evaluate bacterial content in fecal samples collected from patients with incipient CDI, hospitalized controls, and healthy subjects. Human DNA excretion was significantly increased in patients admitted to the hospital for a gastrointestinal ailment, as well as prior to an episode of CDI. In multivariable analyses, human read abundance was independently associated with CDI development. Host DNA proportions were negatively correlated with intestinal microbiota diversity. Enterococcus and Escherichia were enriched in patients excreting high quantities of human DNA, while Ruminococcus and Odoribacter were depleted. These findings suggest that intestinal inflammation can occur prior to CDI development and may influence patient susceptibility to CDI. The quantification of human DNA in feces could serve as a simple and noninvasive approach to assess bowel inflammation and identify patients at risk of CDI
The Amphioxus Hox Cluster: Characterization, Comparative Genomics, and Evolution
The amphioxus Hox cluster is often viewed as “archetypal” for the chordate lineage. Here we present a descriptive account of the 448kb region spanning the Hox cluster of the amphioxus Branchiostoma floridae from Hox14 to Hox1.We provide complete coding sequences of all 14 previously described amphioxus sequences and describe a detailed analysis of the conserved non-coding regulatory sequence elements. We find that the posterior part of the Hox cluster is so highly derived that even the complete genomic sequence is insufficient to decide whether the posterior Hox genes arose by independent duplications or whether they are true orthologs of the corresponding gnathostome paralog groups. In contrast, the anterior region is much better conserved. The amphioxus Hox cluster strongly excludes repetitive elements with the exception of two repeat islands in the posterior region. Repeat exclusion is also observed in gnathostomes, but not protostome Hox clusters. We thus hypothesize that the much shorter vertebrate Hox clusters are the result of extensive resolution of the redundancy of regulatory DNA following the genome duplications rather than the consequence of a selection pressure to remove non-functional sequence from the cluster
De novo assembly of the olive fruit fly (Bactrocera oleae) genome with linked-reads and long-read technologies minimizes gaps and provides exceptional Y chromosome assembly
Background: The olive fruit fly, Bactrocera oleae, is the most important pest in the olive fruit agribusiness industry. This is because female flies lay their eggs in the unripe fruits and upon hatching the larvae feed on the fruits thus destroying them. The lack of a high-quality genome and other genomic and transcriptomic data has hindered progress in understanding the fly’s biology and proposing alternative control methods to pesticide use. Results: Genomic DNA was sequenced from male and female Demokritos strain flies, maintained in the laboratory for over 45 years. We used short-, mate-pair-, and long-read sequencing technologies to generate a combined male-female genome assembly (GenBank accession GCA_001188975.2). Genomic DNA sequencing from male insects using 10x Genomics linked-reads technology followed by mate-pair and long-read scaffolding and gap-closing generated a highly contiguous 489 Mb genome with a scaffold N50 of 4.69 Mb and L50 of 30 scaffolds (GenBank accession GCA_001188975.4). RNA-seq data generated from 12 tissues and/or developmental stages allowed for genome annotation. Short reads from both males and females and the chromosome quotient method enabled identification of Y-chromosome scaffolds which were extensively validated by PCR. Conclusions: The high-quality genome generated represents a critical tool in olive fruit fly research. We provide an extensive RNA-seq data set, and genome annotation, critical towards gaining an insight into the biology of the olive fruit fly. In addition, elucidation of Y-chromosome sequences will advance our understanding of the Y-chromosome’s organization, function and evolution and is poised to provide avenues for sterile insect technique approaches
Improved genome assembly and evidence-based global gene model set for the chordate Ciona intestinalis: new insight into intron and operon populations
An improved assembly of the Ciona intestinalis genome reveals that it contains non-canonical introns and that about 20% of Ciona genes reside in operons
Recommended from our members
New Frontiers for Organismal Biology
Understanding how complex organisms function as integrated units that constantly interact with their environment is a long-standing challenge in biology. To address this challenge, organismal biology reveals general organizing principles of physiological systems and behavior—in particular, in complex multicellular animals. Organismal biology also focuses on the role of individual variability in the evolutionary maintenance of diversity. To broadly advance these frontiers, cross-compatibility of experimental designs, methodological approaches, and data interpretation pipelines represents a key prerequisite. It is now possible to rapidly and systematically analyze complete genomes to elucidate genetic variation associated with traits and conditions that define individuals, populations, and species. However, genetic variation alone does not explain the varied individual physiology and behavior of complex organisms. We propose that such emergent properties of complex organisms can best be explained through a renewed emphasis on the context and life-history dependence of individual phenotypes to complement genetic data.Organismic and Evolutionary Biolog
- …