640 research outputs found

    The genome sequence of Brucella pinnipedialis B2/94 sheds light on the evolutionary history of the genus Brucella

    Get PDF
    International audienceBackground: Since the discovery of the Malta fever agent, Brucella melitensis, in the 19th century, six terrestrial mammal-associated Brucella species were recognized over the next century. More recently the number of novel Brucella species has increased and among them, isolation of species B. pinnipedialis and B. ceti from marine mammals raised many questions about their origin as well as on the evolutionary history of the whole genus. Results: We report here on the first complete genome sequence of a Brucella strain isolated from marine mammals, Brucella pinnipedialis strain B2/94. A whole gene-based phylogenetic analysis shows that five main groups of host-associated Brucella species rapidly diverged from a likely free-living ancestor close to the recently isolated B. microti. However, this tree lacks the resolution required to resolve the order of divergence of those groups. Comparative analyses focusing on a) genome segments unshared between B. microti and B. pinnipedialis, b) gene deletion/fusion events and c) positions and numbers of Brucella specific IS711 elements in the available Brucella genomes provided enough information to propose a branching order for those five groups. Conclusions: In this study, it appears that the closest relatives of marine mammal Brucella sp. are B. ovis and Brucella sp. NVSL 07-0026 isolated from a baboon, followed by B. melitensis and B. abortus strains, and finally the group consisting of B. suis strains, including B. canis and the group consisting of the single B. neotomae species. We were not able, however, to resolve the order of divergence of the two latter groups

    Diverse molecular signatures for ribosomally 'active' Perkinsea in marine sediments

    Get PDF
    This is the final published PDF. Available from BMC via the DOI in this record.Background Perkinsea are a parasitic lineage within the eukaryotic superphylum Alveolata. Recent studies making use of environmental small sub-unit ribosomal RNA gene (SSU rDNA) sequencing methodologies have detected a significant diversity and abundance of Perkinsea-like phylotypes in freshwater environments. In contrast only a few Perkinsea environmental sequences have been retrieved from marine samples and only two groups of Perkinsea have been cultured and morphologically described and these are parasites of marine molluscs or marine protists. These two marine groups form separate and distantly related phylogenetic clusters, composed of closely related lineages on SSU rDNA trees. Here, we test the hypothesis that Perkinsea are a hitherto under-sampled group in marine environments. Using 454 diversity β€˜tag’ sequencing we investigate the diversity and distribution of these protists in marine sediments and water column samples taken from the Deep Chlorophyll Maximum (DCM) and sub-surface using both DNA and RNA as the source template and sampling four European offshore locations. Results We detected the presence of 265 sequences branching with known Perkinsea, the majority of them recovered from marine sediments. Moreover, 27% of these sequences were sampled from RNA derived cDNA libraries. Phylogenetic analyses classify a large proportion of these sequences into 38 cluster groups (including 30 novel marine cluster groups), which share less than 97% sequence similarity suggesting this diversity encompasses a range of biologically and ecologically distinct organisms. Conclusions These results demonstrate that the Perkinsea lineage is considerably more diverse than previously detected in marine environments. This wide diversity of Perkinsea-like protists is largely retrieved in marine sediment with a significant proportion detected in RNA derived libraries suggesting this diversity represents ribosomally β€˜active’ and intact cells. Given the phylogenetic range of hosts infected by known Perkinsea parasites, these data suggest that Perkinsea either play a significant but hitherto unrecognized role as parasites in marine sediments and/or members of this group are present in the marine sediment possibly as part of the β€˜seed bank’ microbial community.Marie Curie Intra-European Fellowship grantEMBO Long-Term fellowshipGordon and Betty Moore Foundatio

    Mimivirus Gene Promoters Exhibit an Unprecedented Conservation among all Eukaryotes

    Full text link
    The initial analysis of the recently sequenced genome of Acanthamoeba polyphaga Mimivirus, the largest known double-stranded DNA virus, predicted a proteome of size and complexity more akin to small parasitic bacteria than to other nucleo-cytoplasmic large DNA viruses, and identified numerous functions never before described in a virus. It has been proposed that the Mimivirus lineage could have emerged before the individualization of cellular organisms from the 3 domains of life. An exhaustive in silico analysis of the non-coding moiety of all known viral genomes, now uncovers the unprecedented perfect conservation of a AAAATTGA motif in close to 50% of the Mimivirus genes. This motif preferentially occurs in genes transcribed from the predicted leading strand and is associated with functions required early in the viral infectious cycle, such as transcription and protein translation. A comparison with the known promoter of unicellular eukaryotes, in particular amoebal protists, strongly suggests that the AAAATTGA motif is the structural equivalent of the TATA box core promoter element. This element is specific to the Mimivirus lineage, and may correspond to an ancestral promoter structure predating the radiation of the eukaryotic kingdoms. This unprecedented conservation of core promoter regions is another exceptional features of Mimivirus, that again raises the question of its evolutionary origin

    Simcluster: clustering enumeration gene expression data on the simplex space

    Get PDF
    Transcript enumeration methods such as SAGE, MPSS, and sequencing-by-synthesis EST "digital northern", are important high-throughput techniques for digital gene expression measurement. As other counting or voting processes, these measurements constitute compositional data exhibiting properties particular to the simplex space where the summation of the components is constrained. These properties are not present on regular Euclidean spaces, on which hybridization-based microarray data is often modeled. Therefore, pattern recognition methods commonly used for microarray data analysis may be non-informative for the data generated by transcript enumeration techniques since they ignore certain fundamental properties of this space.

Here we present a software tool, Simcluster, designed to perform clustering analysis for data on the simplex space. We present Simcluster as a stand-alone command-line C package and as a user-friendly on-line tool. Both versions are available at: http://xerad.systemsbiology.net/simcluster.

Simcluster is designed in accordance with a well-established mathematical framework for compositional data analysis, which provides principled procedures for dealing with the simplex space, and is thus applicable in a number of contexts, including enumeration-based gene expression data

    ElrA binding to the 3β€²UTR of cyclin E1 mRNA requires polyadenylation elements

    Get PDF
    The early cell divisions of Xenopus laevis and other metazoan embryos occur in the presence of constitutively high levels of the cell cycle regulator cyclin E1. Upon completion of the 12th cell division, a time at which many maternal proteins are downregulated by deadenylation and destabilization of their encoding mRNAs, maternal cyclin E1 protein is downregulated while its mRNA is polyadenylated and stable. We report here that stable polyadenylation of cyclin E1 mRNA requires three cis-acting elements in the 3β€² untranslated region; the nuclear polyadenylation sequence, a contiguous cytoplasmic polyadenylation element and an upstream AU-rich element. ElrA, the Xenopus homolog of HuR and a member of the ELAV gene family binds the cyclin E1 3β€²UTR with high affinity. Deletion of these elements dramatically reduces the affinity of ElrA for the cyclin E1 3β€²UTR, abolishes polyadenylation and destabilizes the mRNA. Together, these findings provide compelling evidence that ElrA functions in polyadenylation and stabilization of cyclin E1 mRNA via binding these elements

    Phylogeny.fr: robust phylogenetic analysis for the non-specialist

    Get PDF
    Phylogenetic analyses are central to many research areas in biology and typically involve the identification of homologous sequences, their multiple alignment, the phylogenetic reconstruction and the graphical representation of the inferred tree. The Phylogeny.fr platform transparently chains programs to automatically perform these tasks. It is primarily designed for biologists with no experience in phylogeny, but can also meet the needs of specialists; the first ones will find up-to-date tools chained in a phylogeny pipeline to analyze their data in a simple and robust way, while the specialists will be able to easily build and run sophisticated analyses. Phylogeny.fr offers three main modes. The β€˜One Click’ mode targets non-specialists and provides a ready-to-use pipeline chaining programs with recognized accuracy and speed: MUSCLE for multiple alignment, PhyML for tree building, and TreeDyn for tree rendering. All parameters are set up to suit most studies, and users only have to provide their input sequences to obtain a ready-to-print tree. The β€˜Advanced’ mode uses the same pipeline but allows the parameters of each program to be customized by users. The β€˜A la Carte’ mode offers more flexibility and sophistication, as users can build their own pipeline by selecting and setting up the required steps from a large choice of tools to suit their specific needs. Prior to phylogenetic analysis, users can also collect neighbors of a query sequence by running BLAST on general or specialized databases. A guide tree then helps to select neighbor sequences to be used as input for the phylogeny pipeline. Phylogeny.fr is available at: http://www.phylogeny.fr

    Identification of Lipases Involved in PBAN Stimulated Pheromone Production in Bombyx mori Using the DGE and RNAi Approaches

    Get PDF
    BACKGROUND: Pheromone biosynthesis activating neuropeptide (PBAN) is a neurohormone that regulates sex pheromone synthesis in female moths. Bombyx mori is a model organism that has been used to explore the signal transduction pattern of PBAN, which is mediated by a G-protein coupled receptor (GPCR). Although significant progress has been made in elucidating PBAN-regulated lipolysis that releases the precursor of the sex pheromone, little is known about the molecular components involved in this step. To better elucidate the molecular mechanisms of PBAN-stimulated lipolysis of cytoplasmic lipid droplets (LDs), the associated lipase genes involved in PBAN- regulated sex pheromone biosynthesis were identified using digital gene expression (DGE) and subsequent RNA interference (RNAi). RESULTS: Three DGE libraries were constructed from pheromone glands (PGs) at different developed stages, namely, 72 hours before eclosion (-72 h), new emergence (0 h) and 72 h after eclosion (72 h), to investigate the gene expression profiles during PG development. The DGE evaluated over 5.6 million clean tags in each PG sample and revealed numerous genes that were differentially expressed at these stages. Most importantly, seven lipases were found to be richly expressed during the key stage of sex pheromone synthesis and release (new emergence). RNAi-mediated knockdown confirmed for the first time that four of these seven lipases play important roles in sex pheromone synthesis. CONCLUSION: This study has identified four lipases directly involved in PBAN-stimulated sex pheromone biosynthesis, which improve our understanding of the lipases involved in releasing bombykol precursors from triacylglycerols (TAGs) within the cytoplasmic LDs

    The Genome of Borrelia recurrentis, the Agent of Deadly Louse-Borne Relapsing Fever, Is a Degraded Subset of Tick-Borne Borrelia duttonii

    Get PDF
    In an effort to understand how a tick-borne pathogen adapts to the body louse, we sequenced and compared the genomes of the recurrent fever agents Borrelia recurrentis and B. duttonii. The 1,242,163–1,574,910-bp fragmented genomes of B. recurrentis and B. duttonii contain a unique 23-kb linear plasmid. This linear plasmid exhibits a large polyT track within the promoter region of an intact variable large protein gene and a telomere resolvase that is unique to Borrelia. The genome content is characterized by several repeat families, including antigenic lipoproteins. B. recurrentis exhibited a 20.4% genome size reduction and appeared to be a strain of B. duttonii, with a decaying genome, possibly due to the accumulation of genomic errors induced by the loss of recA and mutS. Accompanying this were increases in the number of impaired genes and a reduction in coding capacity, including surface-exposed lipoproteins and putative virulence factors. Analysis of the reconstructed ancestral sequence compared to B. duttonii and B. recurrentis was consistent with the accelerated evolution observed in B. recurrentis. Vector specialization of louse-borne pathogens responsible for major epidemics was associated with rapid genome reduction. The correlation between gene loss and increased virulence of B. recurrentis parallels that of Rickettsia prowazekii, with both species being genomic subsets of less-virulent strains

    Modeling SAGE tag formation and its effects on data interpretation within a Bayesian framework

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Serial Analysis of Gene Expression (SAGE) is a high-throughput method for inferring mRNA expression levels from the experimentally generated sequence based tags. Standard analyses of SAGE data, however, ignore the fact that the probability of generating an observable tag varies across genes and between experiments. As a consequence, these analyses result in biased estimators and posterior probability intervals for gene expression levels in the transcriptome.</p> <p>Results</p> <p>Using the yeast <it>Saccharomyces cerevisiae </it>as an example, we introduce a new Bayesian method of data analysis which is based on a model of SAGE tag formation. Our approach incorporates the variation in the probability of tag formation into the interpretation of SAGE data and allows us to derive exact joint and approximate marginal posterior distributions for the mRNA frequency of genes detectable using SAGE. Our analysis of these distributions indicates that the frequency of a gene in the tag pool is influenced by its mRNA frequency, the cleavage efficiency of the anchoring enzyme (AE), and the number of informative and uninformative AE cleavage sites within its mRNA.</p> <p>Conclusion</p> <p>With a mechanistic, model based approach for SAGE data analysis, we find that inter-genic variation in SAGE tag formation is large. However, this variation can be estimated and, importantly, accounted for using the methods we develop here. As a result, SAGE based estimates of mRNA frequencies can be adjusted to remove the bias introduced by the SAGE tag formation process.</p
    • …
    corecore