96,411 research outputs found
PTOMSM: A modified version of Topological Overlap Measure used for predicting Protein-Protein Interaction Network
A variety of methods are developed to integrating diverse biological data to predict novel interaction relationship between proteins. However, traditional integration can only generate protein interaction pairs within existing relationships. Therefore, we propose a modified version of Topological Overlap Measure to identify not only extant direct PPIs links, but also novel protein interactions that can be indirectly inferred from various relationships between proteins. Our method is more powerful than a naïve Bayesian-network-based integration in PPI prediction, and could generate more reliable candidate PPIs. Furthermore, we examined the influence of the sizes of training and test datasets on prediction, and further demonstrated the effectiveness of PTOMSM in predicting PPI. More importantly, this method can be extended naturally to predict other types of biological networks, and may be combined with Bayesian method to further improve the prediction
metaSHARK: software for automated metabolic network prediction from DNA sequence and its application to the genomes of Plasmodium falciparum and Eimeria tenella
The metabolic SearcH And Reconstruction Kit
(metaSHARK) is a new fully automated software package
for the detection of enzyme-encoding genes
within unannotated genome data and their visualization
in the context of the surrounding metabolic network.
The gene detection package (SHARKhunt) runs
on a Linux systemand requires only a set of raw DNA
sequences (genomic, expressed sequence tag and/
or genome survey sequence) as input. Its output
may be uploaded to our web-based visualization
tool (SHARKview) for exploring and comparing data
from different organisms. We first demonstrate the
utility of the software by comparing its results for
the raw Plasmodium falciparum genome with the
manual annotations available at the PlasmoDB and
PlasmoCyc websites. We then apply SHARKhunt to
the unannotated genome sequences of the coccidian
parasite Eimeria tenella and observe that, at an
E-value cut-off of 10(-20), our software makes 142
additional assertions of enzymatic function compared
with a recent annotation package working
with translated open reading frame sequences. The
ability of the software to cope with low levels of
sequence coverage is investigated by analyzing
assemblies of the E.tenella genome at estimated
coverages from 0.5x to 7.5x. Lastly, as an example
of how metaSHARK can be used to evaluate the
genomic evidence for specific metabolic pathways,
we present a study of coenzyme A biosynthesis in
P.falciparum and E.tenella
Protein (Multi-)Location Prediction: Using Location Inter-Dependencies in a Probabilistic Framework
Knowing the location of a protein within the cell is important for
understanding its function, role in biological processes, and potential use as
a drug target. Much progress has been made in developing computational methods
that predict single locations for proteins, assuming that proteins localize to
a single location. However, it has been shown that proteins localize to
multiple locations. While a few recent systems have attempted to predict
multiple locations of proteins, they typically treat locations as independent
or capture inter-dependencies by treating each locations-combination present in
the training set as an individual location-class. We present a new method and a
preliminary system we have developed that directly incorporates
inter-dependencies among locations into the multiple-location-prediction
process, using a collection of Bayesian network classifiers. We evaluate our
system on a dataset of single- and multi-localized proteins. Our results,
obtained by incorporating inter-dependencies are significantly higher than
those obtained by classifiers that do not use inter-dependencies. The
performance of our system on multi-localized proteins is comparable to a top
performing system (YLoc+), without restricting predictions to be based only on
location-combinations present in the training set.Comment: Peer-reviewed and presented as part of the 13th Workshop on
Algorithms in Bioinformatics (WABI2013
A combined approach for comparative exoproteome analysis of Corynebacterium pseudotuberculosis
Background: Bacterial exported proteins represent key components of the host-pathogen interplay. Hence, we
sought to implement a combined approach for characterizing the entire exoproteome of the pathogenic
bacterium Corynebacterium pseudotuberculosis, the etiological agent of caseous lymphadenitis (CLA) in sheep and
goats.
Results: An optimized protocol of three-phase partitioning (TPP) was used to obtain the C. pseudotuberculosis
exoproteins, and a newly introduced method of data-independent MS acquisition (LC-MSE) was employed for
protein identification and label-free quantification. Additionally, the recently developed tool SurfG+ was used for in
silico prediction of sub-cellular localization of the identified proteins. In total, 93 different extracellular proteins of
C. pseudotuberculosis were identified with high confidence by this strategy; 44 proteins were commonly identified
in two different strains, isolated from distinct hosts, then composing a core C. pseudotuberculosis exoproteome.
Analysis with the SurfG+ tool showed that more than 75% (70/93) of the identified proteins could be predicted as
containing signals for active exportation. Moreover, evidence could be found for probable non-classical export of
most of the remaining proteins.
Conclusions: Comparative analyses of the exoproteomes of two C. pseudotuberculosis strains, in addition to
comparison with other experimentally determined corynebacterial exoproteomes, were helpful to gain novel
insights into the contribution of the exported proteins in the virulence of this bacterium. The results presented
here compose the most comprehensive coverage of the exoproteome of a corynebacterial species so far
Synuclein expression in the lizard Anolis carolinensis
The synuclein (syn) family comprises three proteins: alpha-, beta- and gamma-syns. In humans, they are involved in neurodegenerative diseases such as Parkinson's disease and in tumors. Members of the syn family were sequenced in representative species of all vertebrates and the comparative analysis of amino acid sequences suggests that syns are evolutionarily conserved, but information about their expression in vertebrate lineages is still scarce and completely lacking in reptiles. In this study, the expression of genes coding for alpha-, beta- and gamma-syns was analyzed in the green lizard Anolis carolinensis by semiquantitative RT-PCR and Western blot. Results demonstrate good expression levels of the three syns in the lizard nervous system, similarly to human syns. This, together with the high identity between lizard and human syns, suggests that these proteins fulfill evolutionarily conserved functions. However, differences between lizard and humans in the expression of syn variants (two different variants of gamma-syn were detected in A. carolinensis) and differences in some amino acids in key positions for the regulation of protein conformation and affinity for lipid and metal ions also suggest that these proteins may have acquired different functional specializations in the two lineages
Agents in Bioinformatics
The scope of the Technical Forum Group (TFG) on Agents in Bioinformatics (BIOAGENTS) was to inspire collaboration between the agent and bioinformatics communities with the aim of creating an opportunity to propose a different (agent-based) approach to the development of computational frameworks both for data analysis in bioinformatics and for system modelling in computational biology. During the day, the participants examined the future of research on agents in bioinformatics primarily through 12 invited talks selected to cover the most relevant topics. From the discussions, it became clear that there are many perspectives to the field, ranging from bio-conceptual languages for agent-based simulation, to the definition of bio-ontology-based declarative languages for use by information agents, and to the use of Grid agents, each of which requires further exploration. The interactions between participants encouraged the development of applications that describe a way of creating agent-based simulation models of biological systems, starting from an hypothesis and inferring new knowledge (or relations) by mining and analysing the huge amount of public biological data. In this report we summarise and reflect on the presentations and discussions
The benefits of in silico modeling to identify possible small-molecule drugs and their off-target interactions
Accepted for publication in a future issue of Future Medicinal Chemistry.The research into the use of small molecules as drugs continues to be a key driver in the development of molecular databases, computer-aided drug design software and collaborative platforms. The evolution of computational approaches is driven by the essential criteria that a drug molecule has to fulfill, from the affinity to targets to minimal side effects while having adequate absorption, distribution, metabolism, and excretion (ADME) properties. A combination of ligand- and structure-based drug development approaches is already used to obtain consensus predictions of small molecule activities and their off-target interactions. Further integration of these methods into easy-to-use workflows informed by systems biology could realize the full potential of available data in the drug discovery and reduce the attrition of drug candidates.Peer reviewe
- âŠ