217 research outputs found
Comparative analysis of module-based versus direct methods for reverse-engineering transcriptional regulatory networks
We have compared a recently developed module-based algorithm LeMoNe for
reverse-engineering transcriptional regulatory networks to a mutual information
based direct algorithm CLR, using benchmark expression data and databases of
known transcriptional regulatory interactions for Escherichia coli and
Saccharomyces cerevisiae. A global comparison using recall versus precision
curves hides the topologically distinct nature of the inferred networks and is
not informative about the specific subtasks for which each method is most
suited. Analysis of the degree distributions and a regulator specific
comparison show that CLR is 'regulator-centric', making true predictions for a
higher number of regulators, while LeMoNe is 'target-centric', recovering a
higher number of known targets for fewer regulators, with limited overlap in
the predicted interactions between both methods. Detailed biological examples
in E. coli and S. cerevisiae are used to illustrate these differences and to
prove that each method is able to infer parts of the network where the other
fails. Biological validation of the inferred networks cautions against
over-interpreting recall and precision values computed using incomplete
reference networks.Comment: 13 pages, 1 table, 6 figures + 6 pages supplementary information (1
table, 5 figures
Eukaryotic Evolutionary Transitions Are Associated with Extreme Codon Bias in Functionally-Related Proteins
Codon bias in the genome of an organism influences its phenome by changing the speed and efficiency of mRNA translation and hence protein abundance. We hypothesized that differences in codon bias, either between-species differences in orthologous genes, or within-species differences between genes, may play an evolutionary role. To explore this hypothesis, we compared the genome-wide codon bias in six species that occupy vital positions in the Eukaryotic Tree of Life. We acquired the entire protein coding sequences for these organisms, computed the codon bias for all genes in each organism and explored the output for relationships between codon bias and protein function, both within- and between-lineages. We discovered five notable coordinated patterns, with extreme codon bias most pronounced in traits considered highly characteristic of a given lineage. Firstly, the Homo sapiens genome had stronger codon bias for DNA-binding transcription factors than the Saccharomyces cerevisiae genome, whereas the opposite was true for ribosomal proteins – perhaps underscoring transcriptional regulation in the origin of complexity. Secondly, both mammalian species examined possessed extreme codon bias in genes relating to hair – a tissue unique to mammals. Thirdly, Arabidopsis thaliana showed extreme codon bias in genes implicated in cell wall formation and chloroplast function – which are unique to plants. Fourthly, Gallus gallus possessed strong codon bias in a subset of genes encoding mitochondrial proteins – perhaps reflecting the enhanced bioenergetic efficiency in birds that co-evolved with flight. And lastly, the G. gallus genome had extreme codon bias for the Ciliary Neurotrophic Factor – which may help to explain their spontaneous recovery from deafness. We propose that extreme codon bias in groups of genes that encode functionally related proteins has a pathway-level energetic explanation
Distinct genotypic profiles of the two major clades of Mycobacterium africanum
Background:
Mycobacterium tuberculosis is the principal etiologic agent of human tuberculosis (TB) and a member of the M. tuberculosis complex (MTC). Additional MTC species that cause TB in humans and other mammals include Mycobacterium africanum and Mycobacterium bovis. One result of studies interrogating recently identified MTC phylogenetic markers has been the recognition of at least two distinct lineages of M. africanum, known as West African-1 and West African-2. Methods: We screened a blinded non-random set of MTC strains isolated from TB patients in Ghana (n = 47) for known chromosomal region-of-difference (RD) loci and single nucleotide polymorphisms (SNPs). A MTC PCR-typing panel, single-target standard PCR, multi-primer PCR, PCR-restriction fragment analysis, and sequence analysis of amplified products were among the methods utilized for the comparative evaluation of targets and identification systems. The MTC distributions of novel SNPs were characterized in the both the Ghana collection and two other diverse collections of MTC strains (n = 175 in total). Results: The utility of various polymorphisms as species-, lineage-, and sublineage-defining phylogenetic markers for M. africanum was determined. Novel SNPs were also identified and found to be specific to either M. africanum West African-1 (Rv1332 523; n = 32) or M. africanum West African-2 (nat
751; n = 27). In the final analysis, a strain identification approach that combined multi-primer PCR targeting of the RD loci RD9, RD10, and RD702 was the most simple, straight-forward, and definitive means of distinguishing the two clades of M. africanum from one another and from other MTC species. Conclusion: With this study, we have organized a series of consistent phylogenetically-relevant markers for each of the distinct MTC lineages that share the M. africanum designation. A differential distribution of each M. africanum clade in Western Africa is described
Genotypic Diversity and Drug Susceptibility Patterns among M. tuberculosis Complex Isolates from South-Western Ghana
OBJECTIVE: The aim of this study was to use spoligotyping and large sequence polymorphism (LSP) to study the population structure of M. tuberculosis complex (MTBC) isolates. METHODS: MTBC isolates were identified using standard biochemical procedures, IS6110 PCR, and large sequence polymorphisms. Isolates were further typed using spoligotyping, and the phenotypic drug susceptibility patterns were determined by the proportion method. RESULT: One hundred and sixty-two isolates were characterised by LSP typing. Of these, 130 (80.25%) were identified as Mycobacterium tuberculosis sensu stricto (MTBss), with the Cameroon sub-lineage being dominant (N = 59/130, 45.38%). Thirty-two (19.75%) isolates were classified as Mycobacterium africanum type 1, and of these 26 (81.25%) were identified as West-Africa I, and 6 (18.75%) as West-Africa II. Spoligotyping sub-lineages identified among the MTBss included Haarlem (N = 15, 11.53%), Ghana (N = 22, 16.92%), Beijing (4, 3.08%), EAI (4, 3.08%), Uganda I (4, 3.08%), LAM (2, 1.54%), X (N = 1, 0.77%) and S (2, 1.54%). Nine isolates had SIT numbers with no identified sub-lineages while 17 had no SIT numbers. MTBss isolates were more likely to be resistant to streptomycin (p>0.008) and to any drug resistance (p>0.03) when compared to M. africanum. CONCLUSION: This study demonstrated that overall 36.4% of TB in South-Western Ghana is caused by the Cameroon sub-lineage of MTBC and 20% by M. africanum type 1, including both the West-Africa 1 and West-Africa 2 lineages. The diversity of MTBC in Ghana should be considered when evaluating new TB vaccine
Differential Trends in the Codon Usage Patterns in HIV-1 Genes
Host-pathogen interactions underlie one of the most complex evolutionary phenomena resulting in continual adaptive genetic changes, where pathogens exploit the host's molecular resources for growth and survival, while hosts try to eliminate the pathogen. Deciphering the molecular basis of host–pathogen interactions is useful in understanding the factors governing pathogen evolution and disease propagation. In host-pathogen context, a balance between mutation, selection, and genetic drift is known to maintain codon bias in both organisms. Studies revealing determinants of the bias and its dynamics are central to the understanding of host-pathogen evolution. We considered the Human Immunodeficiency Virus (HIV) type 1 and its human host to search for evolutionary signatures in the viral genome. Positive selection is known to dominate intra-host evolution of HIV-1, whereas high genetic variability underlies the belief that neutral processes drive inter-host differences. In this study, we analyze the codon usage patterns of HIV-1 genomes across all subtypes and clades sequenced over a period of 23 years. We show presence of unique temporal correlations in the codon bias of three HIV-1 genes illustrating differential adaptation of the HIV-1 genes towards the host preferred codons. Our results point towards gene-specific translational selection to be an important force driving the evolution of HIV-1 at the population level
Mycobacterium tuberculosis Lineage Influences Innate Immune Response and Virulence and Is Associated with Distinct Cell Envelope Lipid Profiles
The six major genetic lineages of Mycobacterium tuberculosis are strongly associated with specific geographical regions, but their relevance to bacterial virulence and the clinical consequences of infection are unclear. Previously, we found that in Vietnam, East Asian/Beijing and Indo-Oceanic strains were significantly more likely to cause disseminated tuberculosis with meningitis than those from the Euro-American lineage. To investigate this observation we characterised 7 East Asian/Beijing, 5 Indo-Oceanic and 6 Euro-American Vietnamese strains in bone-marrow-derived macrophages, dendritic cells and mice. East Asian/Beijing and Indo-Oceanic strains induced significantly more TNF-α and IL-1β from macrophages than the Euro-American strains, and East Asian/Beijing strains were detectable earlier in the blood of infected mice and grew faster in the lungs. We hypothesised that these differences were induced by lineage-specific variation in cell envelope lipids. Whole lipid extracts from East Asian/Beijing and Indo-Oceanic strains induced higher concentrations of TNF-α from macrophages than Euro-American lipids. The lipid extracts were fractionated and compared by thin layer chromatography to reveal a distinct pattern of lineage-associated profiles. A phthiotriol dimycocerosate was exclusively produced by East Asian/Beijing strains, but not the phenolic glycolipid previously associated with the hyper-virulent phenotype of some isolates of this lineage. All Indo-Oceanic strains produced a unique unidentified lipid, shown to be a phenolphthiocerol dimycocerosate dependent upon an intact pks15/1 for its production. This was described by Goren as the ‘attenuation indictor lipid’ more than 40 years ago, due to its association with less virulent strains from southern India. Mutation of pks15/1 in a representative Indo-Oceanic strain prevented phenolphthiocerol dimycocerosate synthesis, but did not alter macrophage cytokine induction. Our findings suggest that the early interactions between M. tuberculosis and host are determined by the lineage of the infecting strain; but we were unable to show these differences are driven by lineage-specific cell-surface expressed lipids
Generic Algorithm to Predict the Speed of Translational Elongation: Implications for Protein Biogenesis
Synonymous codon usage and variations in the level of isoaccepting tRNAs exert a powerful selective force on translation fidelity. We have developed an algorithm to evaluate the relative rate of translation which allows large-scale comparisons of the non-uniform translation rate on the protein biogenesis. Using the complete genomes of Escherichia coli and Bacillus subtilis we show that stretches of codons pairing to minor tRNAs form putative sites to locally attenuate translation; thereby the tendency is to cluster in near proximity whereas long contiguous stretches of slow-translating triplets are avoided. The presence of slow-translating segments positively correlates with the protein length irrespective of the protein abundance. The slow-translating clusters are predominantly located down-stream of the domain boundaries presumably to fine-tune translational accuracy with the folding fidelity of multidomain proteins. Translation attenuation patterns at highly structurally and functionally conserved domains are preserved across the species suggesting a concerted selective pressure on the codon selection and species-specific tRNA abundance in these regions
Detection of small RNAs in Bordetella pertussis and identification of a novel repeated genetic element
Background: Small bacterial RNAs (sRNAs) have been shown to participate in the regulation of gene expression and have been identified in numerous prokaryotic species. Some of them are involved in the regulation of virulence in pathogenic bacteria. So far, little is known about sRNAs in Bordetella, and only very few sRNAs have been identified in the genome of Bordetella pertussis, the causative agent of whooping cough. Results: An in silico approach was used to predict sRNAs genes in intergenic regions of the B. pertussis genome. The genome sequences of B. pertussis, Bordetella parapertussis, Bordetella bronchiseptica and Bordetella avium were compared using a Blast, and significant hits were analyzed using RNAz. Twenty-three candidate regions were obtained, including regions encoding the already documented 6S RNA, and the GCVT and FMN riboswitches. The existence of sRNAs was verified by Northern blot analyses, and transcripts were detected for 13 out of the 20 additional candidates. These new sRNAs were named Bordetella pertussis RNAs, bpr. The expression of 4 of them differed between the early, exponential and late growth phases, and one of them, bprJ2, was found to be under the control of BvgA/BvgS two-component regulatory system of Bordetella virulence. A phylogenetic study of the bprJ sequence revealed a novel, so far undocumented repeat of ~90 bp, found in numerous copies in the Bordetella genomes and in that of other Betaproteobacteria. This repeat exhibits certain features of mobil
A Ribosomal Misincorporation of Lys for Arg in Human Triosephosphate Isomerase Expressed in Escherichia coli Gives Rise to Two Protein Populations
We previously observed that human homodimeric triosephosphate isomerase (HsTIM) expressed in Escherichia coli and purified to apparent homogeneity exhibits two significantly different thermal transitions. A detailed exploration of the phenomenon showed that the preparations contain two proteins; one has the expected theoretical mass, while the mass of the other is 28 Da lower. The two proteins were separated by size exclusion chromatography in 3 M urea. Both proteins correspond to HsTIM as shown by Tandem Mass Spectrometry (LC/ESI-MS/MS). The two proteins were present in nearly equimolar amounts under certain growth conditions. They were catalytically active, but differed in molecular mass, thermostability, susceptibility to urea and proteinase K. An analysis of the nucleotides in the human TIM gene revealed the presence of six codons that are not commonly used in E. coli. We examined if they were related to the formation of the two proteins. We found that expression of the enzyme in a strain that contains extra copies of genes that encode for tRNAs that frequently limit translation of heterologous proteins (Arg, Ile, Leu), as well as silent mutations of two consecutive rare Arg codons (positions 98 and 99), led to the exclusive production of the more stable protein. Further analysis by LC/ESI-MS/MS showed that the 28 Da mass difference is due to the substitution of a Lys for an Arg residue at position 99. Overall, our work shows that two proteins with different biochemical and biophysical properties that coexist in the same cell environment are translated from the same nucleotide sequence frame
Leaderless genes in bacteria: clue to the evolution of translation initiation mechanisms in prokaryotes
<p>Abstract</p> <p>Background</p> <p>Shine-Dalgarno (SD) signal has long been viewed as the dominant translation initiation signal in prokaryotes. Recently, leaderless genes, which lack 5'-untranslated regions (5'-UTR) on their mRNAs, have been shown abundant in archaea. However, current large-scale <it>in silico </it>analyses on initiation mechanisms in bacteria are mainly based on the SD-led initiation way, other than the leaderless one. The study of leaderless genes in bacteria remains open, which causes uncertain understanding of translation initiation mechanisms for prokaryotes.</p> <p>Results</p> <p>Here, we study signals in translation initiation regions of all genes over 953 bacterial and 72 archaeal genomes, then make an effort to construct an evolutionary scenario in view of leaderless genes in bacteria. With an algorithm designed to identify multi-signal in upstream regions of genes for a genome, we classify all genes into SD-led, TA-led and atypical genes according to the category of the most probable signal in their upstream sequences. Particularly, occurrence of TA-like signals about 10 bp upstream to translation initiation site (TIS) in bacteria most probably means leaderless genes.</p> <p>Conclusions</p> <p>Our analysis reveals that leaderless genes are totally widespread, although not dominant, in a variety of bacteria. Especially for <it>Actinobacteria </it>and <it>Deinococcus-Thermus</it>, more than twenty percent of genes are leaderless. Analyzed in closely related bacterial genomes, our results imply that the change of translation initiation mechanisms, which happens between the genes deriving from a common ancestor, is linearly dependent on the phylogenetic relationship. Analysis on the macroevolution of leaderless genes further shows that the proportion of leaderless genes in bacteria has a decreasing trend in evolution.</p
- …