104 research outputs found
A novel proviral clone of HIV-2: Biological and phylogenetic relationship to other primate immunodeficiency viruses.
Infectious molecular clones of the human immunodeficiency virus type 2 (HIV-2) will be valuable tools for the study of regulatory gene functions and the development of an animal model for the human acquired immunodeficiency syndrome (AIDS). To this end, we have cloned and sequenced a novel HIV-2 isolate, HIV-2BEN. One clone, designated MK6, is infectious for various human T-cell lines and for human and macaque peripheral blood lymphocytes (PBL), allowing molecular studies of HIV-2 infection and replication. Since MK6 is highly cytopathic in MT-2 and Molt-4 clone 8 cells, antiviral agents and neutralizing sera may be tested. Cluster analysis of HIV-1, HIV-2, and simian immunodeficiency virus (SIV) env and gag genes revealed that HIV-2BEN yielded the earliest node of phylogenetic divergence for all reported HIV-2 sequences. Noise analysis showed that, with the current data, no specification of any branching order can be made among the four groups of primate lentiviruses, HIV-1, HIV-2/SIVSMM/MAC, SIVAGM, and SIVMND
Mycobacterium leprae genomes from a British medieval leprosy hospital: towards understanding an ancient epidemic.
BACKGROUND: Leprosy has afflicted humankind throughout history leaving evidence in both early texts and the archaeological record. In Britain, leprosy was widespread throughout the Middle Ages until its gradual and unexplained decline between the 14th and 16th centuries. The nature of this ancient endemic leprosy and its relationship to modern strains is only partly understood. Modern leprosy strains are currently divided into 5 phylogenetic groups, types 0 to 4, each with strong geographical links. Until recently, European strains, both ancient and modern, were thought to be exclusively type 3 strains. However, evidence for type 2 strains, a group normally associated with Central Asia and the Middle East, has recently been found in archaeological samples in Scandinavia and from two skeletons from the medieval leprosy hospital (or leprosarium) of St Mary Magdalen, near Winchester, England.
RESULTS: Here we report the genotypic analysis and whole genome sequencing of two further ancient M. leprae genomes extracted from the remains of two individuals, Sk14 and Sk27, that were excavated from 10th-12th century burials at the leprosarium of St Mary Magdalen. DNA was extracted from the surfaces of bones showing osteological signs of leprosy. Known M. leprae polymorphisms were PCR amplified and Sanger sequenced, while draft genomes were generated by enriching for M. leprae DNA, and Illumina sequencing. SNP-typing and phylogenetic analysis of the draft genomes placed both of these ancient strains in the conserved type 2 group, with very few novel SNPs compared to other ancient or modern strains.
CONCLUSIONS: The genomes of the two newly sequenced M. leprae strains group firmly with other type 2F strains. Moreover, the M. leprae strain most closely related to one of the strains, Sk14, in the worldwide phylogeny is a contemporaneous ancient St Magdalen skeleton, vividly illustrating the epidemic and clonal nature of leprosy at this site. The prevalence of these type 2 strains indicates that type 2F strains, in contrast to later European and associated North American type 3 isolates, may have been the co-dominant or even the predominant genotype at this location during the 11th century
GenomeRing: alignment visualization based on SuperGenome coordinates
Motivation: The number of completely sequenced genomes is continuously rising, allowing for comparative analyses of genomic variation. Such analyses are often based on whole-genome alignments to elucidate structural differences arising from insertions, deletions or from rearrangement events. Computational tools that can visualize genome alignments in a meaningful manner are needed to help researchers gain new insights into the underlying data. Such visualizations typically are either realized in a linear fashion as in genome browsers or by using a circular approach, where relationships between genomic regions are indicated by arcs. Both methods allow for the integration of additional information such as experimental data or annotations. However, providing a visualization that still allows for a quick and comprehensive interpretation of all important genomic variations together with various supplemental data, which may be highly heterogeneous, remains a challenge
MpsAB is important for Staphylococcus aureus virulence and growth at atmospheric CO2 levels
The mechanisms behind carbon dioxide (CO2) dependency in non-autotrophic bacterial isolates are unclear. Here we show that the Staphylococcus aureus mpsAB operon, known to play a role in membrane potential generation, is crucial for growth at atmospheric CO2 levels. The genes mpsAB can complement an Escherichia coli carbonic anhydrase (CA) mutant, and CA from E. coli can complement the S. aureus delta-mpsABC mutant. In comparison with the wild type, S. aureus mps mutants produce less hemolytic toxin and are less virulent in animal models of infection. Homologs of mpsA and mpsB are widespread among bacteria and are often found adjacent to each other on the genome. We propose that MpsAB represents a dissolved inorganic carbon transporter, or bicarbonate concentrating system, possibly acting as a sodium bicarbonate cotransporter. © 2019, The Author(s)
Visualizing dimensionality reduction of systems biology data
One of the challenges in analyzing high-dimensional expression data is the
detection of important biological signals. A common approach is to apply a
dimension reduction method, such as principal component analysis. Typically,
after application of such a method the data is projected and visualized in the
new coordinate system, using scatter plots or profile plots. These methods
provide good results if the data have certain properties which become visible
in the new coordinate system and which were hard to detect in the original
coordinate system. Often however, the application of only one method does not
suffice to capture all important signals. Therefore several methods addressing
different aspects of the data need to be applied. We have developed a framework
for linear and non-linear dimension reduction methods within our visual
analytics pipeline SpRay. This includes measures that assist the interpretation
of the factorization result. Different visualizations of these measures can be
combined with functional annotations that support the interpretation of the
results. We show an application to high-resolution time series microarray data
in the antibiotic-producing organism Streptomyces coelicolor as well as to
microarray data measuring expression of cells with normal karyotype and cells
with trisomies of human chromosomes 13 and 21
MultiMetEval: comparative and multi-objective analysis of genome-scale metabolic models
Comparative metabolic modelling is emerging as a novel field, supported by the development of reliable and standardized approaches for constructing genome-scale metabolic models in high throughput. New software solutions are needed to allow efficient comparative analysis of multiple models in the context of multiple cellular objectives. Here, we present the user-friendly software framework Multi-Metabolic Evaluator (MultiMetEval), built upon SurreyFBA, which allows the user to compose collections of metabolic models that together can be subjected to flux balance analysis. Additionally, MultiMetEval implements functionalities for multi-objective analysis by calculating the Pareto front between two cellular objectives. Using a previously generated dataset of 38 actinobacterial genome-scale metabolic models, we show how these approaches can lead to exciting novel insights. Firstly, after incorporating several pathways for the biosynthesis of natural products into each of these models, comparative flux balance analysis predicted that species like Streptomyces that harbour the highest diversity of secondary metabolite biosynthetic gene clusters in their genomes do not necessarily have the metabolic network topology most suitable for compound overproduction. Secondly, multi-objective analysis of biomass production and natural product biosynthesis in these actinobacteria shows that the well-studied occurrence of discrete metabolic switches during the change of cellular objectives is inherent to their metabolic network architecture. Comparative and multi-objective modelling can lead to insights that could not be obtained by normal flux balance analyses. MultiMetEval provides a powerful platform that makes these analyses straightforward for biologists. Sources and binaries of MultiMetEval are freely available from https://github.com/PiotrZakrzewski/MetEval/downloads
Prioritizing orphan proteins for further study using phylogenomics and gene expression profiles in Streptomyces coelicolor
BACKGROUND:Streptomyces coelicolor, a model organism of antibiotic producing bacteria, has one of the largest genomes of the bacterial kingdom, including 7825 predicted protein coding genes. A large number of these genes, nearly 34%, are functionally orphan (hypothetical proteins with unknown function). However, in gene expression time course data, many of these functionally orphan genes show interesting expression patterns.RESULTS:In this paper, we analyzed all functionally orphan genes of Streptomyces coelicolor and identified a list of "high priority" orphans by combining gene expression analysis and additional phylogenetic information (i.e. the level of evolutionary conservation of each protein).CONCLUSIONS:The prioritized orphan genes are promising candidates to be examined experimentally in the lab for further characterization of their functio
Mayday SeaSight: Combined Analysis of Deep Sequencing and Microarray Data
Recently emerged deep sequencing technologies offer new high-throughput methods to quantify gene expression, epigenetic modifications and DNA-protein binding. From a computational point of view, the data is very different from that produced by the already established microarray technology, providing a new perspective on the samples under study and complementing microarray gene expression data. Software offering the integrated analysis of data from different technologies is of growing importance as new data emerge in systems biology studies. Mayday is an extensible platform for visual data exploration and interactive analysis and provides many methods for dissecting complex transcriptome datasets. We present Mayday SeaSight, an extension that allows to integrate data from different platforms such as deep sequencing and microarrays. It offers methods for computing expression values from mapped reads and raw microarray data, background correction and normalization and linking microarray probes to genomic coordinates. It is now possible to use Mayday's wealth of methods to analyze sequencing data and to combine data from different technologies in one analysis
Separating the wheat from the chaff: a prioritisation pipeline for the analysis of metabolomics datasets
Liquid Chromatography Mass Spectrometry (LC-MS) is a powerful and widely applied method for the study of biological systems, biomarker discovery and pharmacological interventions. LC-MS measurements are, however, significantly complicated by several technical challenges, including: (1) ionisation suppression/enhancement, disturbing the correct quantification of analytes, and (2) the detection of large amounts of separate derivative ions, increasing the complexity of the spectra, but not their information content. Here we introduce an experimental and analytical strategy that leads to robust metabolome profiles in the face of these challenges. Our method is based on rigorous filtering of the measured signals based on a series of sample dilutions. Such data sets have the additional characteristic that they allow a more robust assessment of detection signal quality for each metabolite. Using our method, almost 80% of the recorded signals can be discarded as uninformative, while important information is retained. As a consequence, we obtain a broader understanding of the information content of our analyses and a better assessment of the metabolites detected in the analyzed data sets. We illustrate the applicability of this method using standard mixtures, as well as cell extracts from bacterial samples. It is evident that this method can be applied in many types of LC-MS analyses and more specifically in untargeted metabolomics
nocoRNAc: Characterization of non-coding RNAs in prokaryotes
<p>Abstract</p> <p>Background</p> <p>The interest in non-coding RNAs (ncRNAs) constantly rose during the past few years because of the wide spectrum of biological processes in which they are involved. This led to the discovery of numerous ncRNA genes across many species. However, for most organisms the non-coding transcriptome still remains unexplored to a great extent. Various experimental techniques for the identification of ncRNA transcripts are available, but as these methods are costly and time-consuming, there is a need for computational methods that allow the detection of functional RNAs in complete genomes in order to suggest elements for further experiments. Several programs for the genome-wide prediction of functional RNAs have been developed but most of them predict a genomic locus with no indication whether the element is transcribed or not.</p> <p>Results</p> <p>We present <smcaps>NOCO</smcaps>RNAc, a program for the genome-wide prediction of ncRNA transcripts in bacteria. <smcaps>NOCO</smcaps>RNAc incorporates various procedures for the detection of transcriptional features which are then integrated with functional ncRNA loci to determine the transcript coordinates. We applied RNAz and <smcaps>NOCO</smcaps>RNAc to the genome of <it>Streptomyces coelicolor </it>and detected more than 800 putative ncRNA transcripts most of them located antisense to protein-coding regions. Using a custom design microarray we profiled the expression of about 400 of these elements and found more than 300 to be transcribed, 38 of them are predicted novel ncRNA genes in intergenic regions. The expression patterns of many ncRNAs are similarly complex as those of the protein-coding genes, in particular many antisense ncRNAs show a high expression correlation with their protein-coding partner.</p> <p>Conclusions</p> <p>We have developed <smcaps>NOCO</smcaps>RNAc, a framework that facilitates the automated characterization of functional ncRNAs. <smcaps>NOCO</smcaps>RNAc increases the confidence of predicted ncRNA loci, especially if they contain transcribed ncRNAs. <smcaps>NOCO</smcaps>RNAc is not restricted to intergenic regions, but it is applicable to the prediction of ncRNA transcripts in whole microbial genomes. The software as well as a user guide and example data is available at <url>http://www.zbit.uni-tuebingen.de/pas/nocornac.htm</url>.</p
- …