78 research outputs found

    Expression profiling of the schizont and trophozoite stages of Plasmodium falciparum with a long-oligonucleotide microarray

    Get PDF
    BACKGROUND: The worldwide persistence of drug-resistant Plasmodium falciparum, the most lethal variety of human malaria, is a global health concern. The P. falciparum sequencing project has brought new opportunities for identifying molecular targets for antimalarial drug and vaccine development. RESULTS: We developed a software package, ArrayOligoSelector, to design an open reading frame (ORF)-specific DNA microarray using the publicly available P. falciparum genome sequence. Each gene was represented by one or more long 70 mer oligonucleotides selected on the basis of uniqueness within the genome, exclusion of low-complexity sequence, balanced base composition and proximity to the 3' end. A first-generation microarray representing approximately 6,000 ORFs of the P. falciparum genome was constructed. Array performance was evaluated through the use of control oligonucleotide sets with increasing levels of introduced mutations, as well as traditional northern blotting. Using this array, we extensively characterized the gene-expression profile of the intraerythrocytic trophozoite and schizont stages of P. falciparum. The results revealed extensive transcriptional regulation of genes specialized for processes specific to these two stages. CONCLUSIONS: DNA microarrays based on long oligonucleotides are powerful tools for the functional annotation and exploration of the P. falciparum genome. Expression profiling of trophozoites and schizonts revealed genes associated with stage-specific processes and may serve as the basis for future drug targets and vaccine development

    Transcriptomic and proteomic analyses of Desulfovibrio vulgaris biofilms: carbon and energy flow contribute to the distinct biofilm growth state.

    Get PDF
    BackgroundDesulfovibrio vulgaris Hildenborough is a sulfate-reducing bacterium (SRB) that is intensively studied in the context of metal corrosion and heavy-metal bioremediation, and SRB populations are commonly observed in pipe and subsurface environments as surface-associated populations. In order to elucidate physiological changes associated with biofilm growth at both the transcript and protein level, transcriptomic and proteomic analyses were done on mature biofilm cells and compared to both batch and reactor planktonic populations. The biofilms were cultivated with lactate and sulfate in a continuously fed biofilm reactor, and compared to both batch and reactor planktonic populations.ResultsThe functional genomic analysis demonstrated that biofilm cells were different compared to planktonic cells, and the majority of altered abundances for genes and proteins were annotated as hypothetical (unknown function), energy conservation, amino acid metabolism, and signal transduction. Genes and proteins that showed similar trends in detected levels were particularly involved in energy conservation such as increases in an annotated ech hydrogenase, formate dehydrogenase, pyruvate:ferredoxin oxidoreductase, and rnf oxidoreductase, and the biofilm cells had elevated formate dehydrogenase activity. Several other hydrogenases and formate dehydrogenases also showed an increased protein level, while decreased transcript and protein levels were observed for putative coo hydrogenase as well as a lactate permease and hyp hydrogenases for biofilm cells. Genes annotated for amino acid synthesis and nitrogen utilization were also predominant changers within the biofilm state. Ribosomal transcripts and proteins were notably decreased within the biofilm cells compared to exponential-phase cells but were not as low as levels observed in planktonic, stationary-phase cells. Several putative, extracellular proteins (DVU1012, 1545) were also detected in the extracellular fraction from biofilm cells.ConclusionsEven though both the planktonic and biofilm cells were oxidizing lactate and reducing sulfate, the biofilm cells were physiologically distinct compared to planktonic growth states due to altered abundances of genes/proteins involved in carbon/energy flow and extracellular structures. In addition, average expression values for multiple rRNA transcripts and respiratory activity measurements indicated that biofilm cells were metabolically more similar to exponential-phase cells although biofilm cells are structured differently. The characterization of physiological advantages and constraints of the biofilm growth state for sulfate-reducing bacteria will provide insight into bioremediation applications as well as microbially-induced metal corrosion

    The structure of pyogenecin immunity protein, a novel bacteriocin-like immunity protein from Streptococcus pyogenes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Many Gram-positive lactic acid bacteria (LAB) produce anti-bacterial peptides and small proteins called bacteriocins, which enable them to compete against other bacteria in the environment. These peptides fall structurally into three different classes, I, II, III, with class IIa being pediocin-like single entities and class IIb being two-peptide bacteriocins. Self-protective cognate immunity proteins are usually co-transcribed with these toxins. Several examples of cognates for IIa have already been solved structurally. <it>Streptococcus pyogenes</it>, closely related to LAB, is one of the most common human pathogens, so knowledge of how it competes against other LAB species is likely to prove invaluable.</p> <p>Results</p> <p>We have solved the crystal structure of the gene-product of locus Spy_2152 from <it>S. pyogenes</it>, (PDB:<ext-link ext-link-id="2fu2" ext-link-type="pdb">2fu2</ext-link>), and found it to comprise an anti-parallel four-helix bundle that is structurally similar to other bacteriocin immunity proteins. Sequence analyses indicate this protein to be a possible immunity protein protective against class IIa or IIb bacteriocins. However, given that <it>S. pyogenes </it>appears to lack any IIa pediocin-like proteins but does possess class IIb bacteriocins, we suggest this protein confers immunity to IIb-like peptides.</p> <p>Conclusions</p> <p>Combined structural, genomic and proteomic analyses have allowed the identification and <it>in silico </it>characterization of a new putative immunity protein from <it>S. pyogenes</it>, possibly the first structure of an immunity protein protective against potential class IIb two-peptide bacteriocins. We have named the two pairs of putative bacteriocins found in <it>S. pyogenes </it>pyogenecin 1, 2, 3 and 4.</p

    Functional responses of methanogenic archaea to syntrophic growth.

    Get PDF
    Methanococcus maripaludis grown syntrophically with Desulfovibrio vulgaris was compared with M. maripaludis monocultures grown under hydrogen limitation using transcriptional, proteomic and metabolite analyses. These measurements indicate a decrease in transcript abundance for energy-consuming biosynthetic functions in syntrophically grown M. maripaludis, with an increase in transcript abundance for genes involved in the energy-generating central pathway for methanogenesis. Compared with growth in monoculture under hydrogen limitation, the response of paralogous genes, such as those coding for hydrogenases, often diverged, with transcripts of one variant increasing in relative abundance, whereas the other was little changed or significantly decreased in abundance. A common theme was an apparent increase in transcripts for functions using H(2) directly as reductant, versus those using the reduced deazaflavin (coenzyme F(420)). The greater importance of direct reduction by H(2) was supported by improved syntrophic growth of a deletion mutant in an F(420)-dependent dehydrogenase of M. maripaludis. These data suggest that paralogous genes enable the methanogen to adapt to changing substrate availability, sustaining it under environmental conditions that are often near the thermodynamic threshold for growth. Additionally, the discovery of interspecies alanine transfer adds another metabolic dimension to this environmentally relevant mutualism

    GraPE: fast and scalable Graph Processing and Embedding

    Full text link
    Graph Representation Learning methods have enabled a wide range of learning problems to be addressed for data that can be represented in graph form. Nevertheless, several real world problems in economy, biology, medicine and other fields raised relevant scaling problems with existing methods and their software implementation, due to the size of real world graphs characterized by millions of nodes and billions of edges. We present GraPE, a software resource for graph processing and random walk based embedding, that can scale with large and high-degree graphs and significantly speed up-computation. GraPE comprises specialized data structures, algorithms, and a fast parallel implementation that displays everal orders of magnitude improvement in empirical space and time complexity compared to state of the art software resources, with a corresponding boost in the performance of machine learning methods for edge and node label prediction and for the unsupervised analysis of graphs.GraPE is designed to run on laptop and desktop computers, as well as on high performance computing cluster

    KG-COVID-19: A Framework to Produce Customized Knowledge Graphs for COVID-19 Response.

    Get PDF
    Integrated, up-to-date data about SARS-CoV-2 and COVID-19 is crucial for the ongoing response to the COVID-19 pandemic by the biomedical research community. While rich biological knowledge exists for SARS-CoV-2 and related viruses (SARS-CoV, MERS-CoV), integrating this knowledge is difficult and time-consuming, since much of it is in siloed databases or in textual format. Furthermore, the data required by the research community vary drastically for different tasks; the optimal data for a machine learning task, for example, is much different from the data used to populate a browsable user interface for clinicians. To address these challenges, we created KG-COVID-19, a flexible framework that ingests and integrates heterogeneous biomedical data to produce knowledge graphs (KGs), and applied it to create a KG for COVID-19 response. This KG framework also can be applied to other problems in which siloed biomedical data must be quickly integrated for different research applications, including future pandemics

    KG-Hub-building and exchanging biological knowledge graphs.

    Get PDF
    MOTIVATION: Knowledge graphs (KGs) are a powerful approach for integrating heterogeneous data and making inferences in biology and many other domains, but a coherent solution for constructing, exchanging, and facilitating the downstream use of KGs is lacking. RESULTS: Here we present KG-Hub, a platform that enables standardized construction, exchange, and reuse of KGs. Features include a simple, modular extract-transform-load pattern for producing graphs compliant with Biolink Model (a high-level data model for standardizing biological data), easy integration of any OBO (Open Biological and Biomedical Ontologies) ontology, cached downloads of upstream data sources, versioned and automatically updated builds with stable URLs, web-browsable storage of KG artifacts on cloud infrastructure, and easy reuse of transformed subgraphs across projects. Current KG-Hub projects span use cases including COVID-19 research, drug repurposing, microbial-environmental interactions, and rare disease research. KG-Hub is equipped with tooling to easily analyze and manipulate KGs. KG-Hub is also tightly integrated with graph machine learning (ML) tools which allow automated graph ML, including node embeddings and training of models for link prediction and node classification. AVAILABILITY AND IMPLEMENTATION: https://kghub.org

    The Sorcerer II Global Ocean Sampling Expedition: Expanding the Universe of Protein Families

    Get PDF
    Metagenomics projects based on shotgun sequencing of populations of micro-organisms yield insight into protein families. We used sequence similarity clustering to explore proteins with a comprehensive dataset consisting of sequences from available databases together with 6.12 million proteins predicted from an assembly of 7.7 million Global Ocean Sampling (GOS) sequences. The GOS dataset covers nearly all known prokaryotic protein families. A total of 3,995 medium- and large-sized clusters consisting of only GOS sequences are identified, out of which 1,700 have no detectable homology to known families. The GOS-only clusters contain a higher than expected proportion of sequences of viral origin, thus reflecting a poor sampling of viral diversity until now. Protein domain distributions in the GOS dataset and current protein databases show distinct biases. Several protein domains that were previously categorized as kingdom specific are shown to have GOS examples in other kingdoms. About 6,000 sequences (ORFans) from the literature that heretofore lacked similarity to known proteins have matches in the GOS data. The GOS dataset is also used to improve remote homology detection. Overall, besides nearly doubling the number of current proteins, the predicted GOS proteins also add a great deal of diversity to known protein families and shed light on their evolution. These observations are illustrated using several protein families, including phosphatases, proteases, ultraviolet-irradiation DNA damage repair enzymes, glutamine synthetase, and RuBisCO. The diversity added by GOS data has implications for choosing targets for experimental structure characterization as part of structural genomics efforts. Our analysis indicates that new families are being discovered at a rate that is linear or almost linear with the addition of new sequences, implying that we are still far from discovering all protein families in nature

    Towards a Rigorous Network of Protein-Protein Interactions of the Model Sulfate Reducer Desulfovibrio vulgaris Hildenborough

    Get PDF
    Protein–protein interactions offer an insight into cellular processes beyond what may be obtained by the quantitative functional genomics tools of proteomics and transcriptomics. The aforementioned tools have been extensively applied to study Escherichia coli and other aerobes and more recently to study the stress response behavior of Desulfovibrio vulgaris Hildenborough, a model obligate anaerobe and sulfate reducer and the subject of this study. Here we carried out affinity purification followed by mass spectrometry to reconstruct an interaction network among 12 chromosomally encoded bait and 90 prey proteins based on 134 bait-prey interactions identified to be of high confidence. Protein-protein interaction data are often plagued by the lack of adequate controls and replication analyses necessary to assess confidence in the results, including identification of potential false positives. We addressed these issues through the use of biological replication, exponentially modified protein abundance indices, results from an experimental negative control, and a statistical test to assign confidence to each putative interacting pair applicable to small interaction data studies. We discuss the biological significance of metabolic features of D. vulgaris revealed by these protein-protein interaction data and the observed protein modifications. These include the distinct role of the putative carbon monoxide-induced hydrogenase, unique electron transfer routes associated with different oxidoreductases, and the possible role of methylation in regulating sulfate reduction

    Impact of elevated nitrate on sulfate-reducing bacteria: A comparative study of Desulfovibrio vulgaris

    Get PDF
    Sulfate-reducing bacteria have been extensively studied for their potential in heavy-metal bioremediation. However, the occurrence of elevated nitrate in contaminated environments has been shown to inhibit sulfate reduction activity. Although the inhibition has been suggested to result from the competition with nitrate-reducing bacteria, the possibility of direct inhibition of sulfate reducers by elevated nitrate needs to be explored. Using Desulfovibrio vulgaris as a model sulfate-reducing bacterium, functional genomics analysis reveals that osmotic stress contributed to growth inhibition by nitrate as shown by the upregulation of the glycine/betaine transporter genes and the relief of nitrate inhibition by osmoprotectants. The observation that significant growth inhibition was effected by 70 mM NaNO{sub 3} but not by 70 mM NaCl suggests the presence of inhibitory mechanisms in addition to osmotic stress. The differential expression of genes characteristic of nitrite stress responses, such as the hybrid cluster protein gene, under nitrate stress condition further indicates that nitrate stress response by D. vulgaris was linked to components of both osmotic and nitrite stress responses. The involvement of the oxidative stress response pathway, however, might be the result of a more general stress response. Given the low similarities between the response profiles to nitrate and other stresses, less-defined stress response pathways could also be important in nitrate stress, which might involve the shift in energy metabolism. The involvement of nitrite stress response upon exposure to nitrate may provide detoxification mechanisms for nitrite, which is inhibitory to sulfate-reducing bacteria, produced by microbial nitrate reduction as a metabolic intermediate and may enhance the survival of sulfate-reducing bacteria in environments with elevated nitrate level
    corecore