100 research outputs found

    Rampant Adaptive Evolution in Regions of Proteins with Unknown Function in Drosophila simulans

    Get PDF
    Adaptive protein evolution is pervasive in Drosophila. Genomic studies, thus far, have analyzed each protein as a single entity. However, the targets of adaptive events may be localized to particular parts of proteins, such as protein domains or regions involved in protein folding. We compared the population genetic mechanisms driving sequence polymorphism and divergence in defined protein domains and non-domain regions. Interestingly, we find that non-domain regions of proteins are more frequent targets of directional selection. Protein domains are also evolving under directional selection, but appear to be under stronger purifying selection than non-domain regions. Non-domain regions of proteins clearly play a major role in adaptive protein evolution on a genomic scale and merit future investigations of their functional properties

    The InterPro protein families and domains database: 20 years on

    Get PDF
    The InterPro database (https://www.ebi.ac.uk/interpro/) provides an integrative classification of protein sequences into families, and identifies functionally important domains and conserved sites. InterProScan is the underlying software that allows protein and nucleic acid sequences to be searched against InterPro's signatures. Signatures are predictive models which describe protein families, domains or sites, and are provided by multiple databases. InterPro combines signatures representing equivalent families, domains or sites, and provides additional information such as descriptions, literature references and Gene Ontology (GO) terms, to produce a comprehensive resource for protein classification. Founded in 1999, InterPro has become one of the most widely used resources for protein family annotation. Here, we report the status of InterPro (version 81.0) in its 20th year of operation, and its associated software, including updates to database content, the release of a new website and REST API, and performance improvements in InterProScan

    ComPath: comparative enzyme analysis and annotation in pathway/subsystem contexts

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Once a new genome is sequenced, one of the important questions is to determine the presence and absence of biological pathways. Analysis of biological pathways in a genome is a complicated task since a number of biological entities are involved in pathways and biological pathways in different organisms are not identical. Computational pathway identification and analysis thus involves a number of computational tools and databases and typically done in comparison with pathways in other organisms. This computational requirement is much beyond the capability of biologists, so information systems for reconstructing, annotating, and analyzing biological pathways are much needed. We introduce a new comparative pathway analysis workbench, ComPath, which integrates various resources and computational tools using an interactive spreadsheet-style web interface for reliable pathway analyses.</p> <p>Results</p> <p>ComPath allows users to compare biological pathways in multiple genomes using a spreadsheet style web interface where various sequence-based analysis can be performed either to compare enzymes (e.g. sequence clustering) and pathways (e.g. pathway hole identification), to search a genome for <it>de novo </it>prediction of enzymes, or to annotate a genome in comparison with reference genomes of choice. To fill in pathway holes or make <it>de novo </it>enzyme predictions, multiple computational methods such as FASTA, Whole-HMM, CSR-HMM (a method of our own introduced in this paper), and PDB-domain search are integrated in ComPath. Our experiments show that FASTA and CSR-HMM search methods generally outperform Whole-HMM and PDB-domain search methods in terms of sensitivity, but FASTA search performs poorly in terms of specificity, detecting more false positive as E-value cutoff increases. Overall, CSR-HMM search method performs best in terms of both sensitivity and specificity. Gene neighborhood and pathway neighborhood (global network) visualization tools can be used to get context information that is complementary to conventional KEGG map representation.</p> <p>Conclusion</p> <p>ComPath is an interactive workbench for pathway reconstruction, annotation, and analysis where experts can perform various sequence, domain, context analysis, using an intuitive and interactive spreadsheet-style interface. </p

    RASOnD - A comprehensive resource and search tool for RAS superfamily oncogenes from various species

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The Ras superfamily plays an important role in the control of cell signalling and division. Mutations in the Ras genes convert them into active oncogenes. The Ras oncogenes form a major thrust of global cancer research as they are involved in the development and progression of tumors. This has resulted in the exponential growth of data on Ras superfamily across different public databases and in literature. However, no dedicated public resource is currently available for data mining and analysis on this family. The present database was developed to facilitate straightforward accession, retrieval and analysis of information available on Ras oncogenes from one particular site.</p> <p>Description</p> <p>We have developed the RAS Oncogene Database (RASOnD) as a comprehensive knowledgebase that provides integrated and curated information on a single platform for oncogenes of Ras superfamily. RASOnD encompasses exhaustive genomics and proteomics data existing across diverse publicly accessible databases. This resource presently includes overall 199,046 entries from 101 different species. It provides a search tool to generate information about their nucleotide and amino acid sequences, single nucleotide polymorphisms, chromosome positions, orthologies, motifs, structures, related pathways and associated diseases. We have implemented a number of user-friendly search interfaces and sequence analysis tools. At present the user can (i) browse the data (ii) search any field through a simple or advance search interface and (iii) perform a BLAST search and subsequently CLUSTALW multiple sequence alignment by selecting sequences of Ras oncogenes. The Generic gene browser, GBrowse, JMOL for structural visualization and TREEVIEW for phylograms have been integrated for clear perception of retrieved data. External links to related databases have been included in RASOnD.</p> <p>Conclusions</p> <p>This database is a resource and search tool dedicated to Ras oncogenes. It has utility to cancer biologists and cell molecular biologists as it is a ready source for research, identification and elucidation of the role of these oncogenes. The data generated can be used for understanding the relationship between the Ras oncogenes and their association with cancer. The database updated monthly is freely accessible online at <url>http://202.141.47.181/rasond/</url> and <url>http://www.aiims.edu/RAS.html</url>.</p

    Calbindin-D32k Is Localized to a Subpopulation of Neurons in the Nervous System of the Sea Cucumber Holothuria glaberrima (Echinodermata)

    Get PDF
    Members of the calbindin subfamily serve as markers of subpopulations of neurons within the vertebrate nervous system. Although markers of these proteins are widely available and used, their application to invertebrate nervous systems has been very limited. In this study we investigated the presence and distribution of members of the calbindin subfamily in the sea cucumber Holothuria glaberrima (Selenka, 1867). Immunohistological experiments with antibodies made against rat calbindin 1, parvalbumin, and calbindin 2, showed that these antibodies labeled cells and fibers within the nervous system of H. glaberrima. Most of the cells and fibers were co-labeled with the neural-specific marker RN1, showing their neural specificity. These were distributed throughout all of the nervous structures, including the connective tissue plexi of the body wall and podia. Bioinformatics analyses of the possible antigen recognized by these markers showed that a calbindin 2-like protein present in the sea urchin Strongylocentrotus purpuratus, corresponded to the calbindin-D32k previously identified in other invertebrates. Western blots with anti-calbindin 1 and anti-parvalbumin showed that these markers recognized an antigen of approximately 32 kDa in homogenates of radial nerve cords of H. glaberrima and Lytechinus variegatus. Furthermore, immunoreactivity with anti-calbindin 1 and anti-parvalbumin was obtained to a fragment of calbindin-D32k of H. glaberrima. Our findings suggest that calbindin-D32k is present in invertebrates and its sequence is more similar to the vertebrate calbindin 2 than to calbindin 1. Thus, characterization of calbindin-D32k in echinoderms provides an important view of the evolution of this protein family and represents a valuable marker to study the nervous system of invertebrates

    Plasmodium falciparum Hep1 is required to prevent the self aggregation of PfHsp70-3

    Get PDF
    The majority of mitochondrial proteins are encoded in the nucleus and need to be imported from the cytosol into the mitochondria, and molecular chaperones play a key role in the efficient translocation and proper folding of these proteins in the matrix. One such molecular chaperone is the eukaryotic mitochondrial heat shock protein 70 (Hsp70); however, it is prone to self-aggregation and requires the presence of an essential zinc-finger protein, Hsp70-escort protein 1 (Hep1), to maintain its structure and function. PfHsp70-3, the only Hsp70 predicted to localize in the mitochondria of P. falciparum, may also rely on a Hep1 orthologue to prevent self-aggregation. In this study, we identified a putative Hep1 orthologue in P. falciparum and co-expression of PfHsp70-3 and PfHep1 enhanced the solubility of PfHsp70-3. PfHep1 suppressed the thermally induced aggregation of PfHsp70-3 but not the aggregation of malate dehydrogenase or citrate synthase, thus showing specificity for PfHsp70-3. Zinc ions were indeed essential for maintaining the function of PfHep1, as EDTA chelation abrogated its abilities to suppress the aggregation of PfHsp70-3. Soluble and functional PfHsp70-3, acquired by co-expression with PfHep-1, will facilitate the biochemical characterisation of this particular Hsp70 protein and its evaluation as a drug target for the treatment of malaria

    AXY3 encodes a α-xylosidase that impacts the structure and accessibility of the hemicellulose xyloglucan in Arabidopsis plant cell walls

    Get PDF
    Xyloglucan is the most abundant hemicellulose in the walls of dicots such as Arabidopsis. It is part of the load-bearing structure of a plant cell and its metabolism is thought to play a major role in cell elongation. However, the molecular mechanism by which xyloglucan carries out this and other functions in planta is not well understood. We performed a forward genetic screen utilizing xyloglucan oligosaccharide mass profiling on chemically mutagenized Arabidopsis seedlings to identify mutants with altered xyloglucan structures termed axy-mutants. One of the identified mutants, axy3.1, contains xyloglucan with a higher proportion of non-fucosylated xyloglucan subunits. Mapping revealed that axy3.1 contains a point mutation in XYLOSIDASE1 (XYL1) known to encode for an apoplastic glycoside hydrolase releasing xylosyl residues from xyloglucan oligosaccharides at the non-reducing end. The data support the hypothesis that AXY3/XYL1 is an essential component of the apoplastic xyloglucan degradation machinery and as a result of the lack of function in the various axy3-alleles leads not only to an altered xyloglucan structure but also a xyloglucan that is less tightly associated with other wall components. However, the plant can cope with the excess xyloglucan relatively well as the mutant does not display any visible growth or morphological phenotypes with the notable exception of shorter siliques and reduced fitness. Taken together, these results demonstrate that plant apoplastic hydrolases have a larger impact on wall polymer structure and function than previously thought

    Characterization of a Novel Binding Protein for Fortilin/TCTP — Component of a Defense Mechanism against Viral Infection in Penaeus monodon

    Get PDF
    The Fortilin (also known as TCTP) in Penaeus monodon (PmFortilin) and Fortilin Binding Protein 1 (FBP1) have recently been shown to interact and to offer protection against the widespread White Spot Syndrome Virus infection. However, the mechanism is yet unknown. We investigated this interaction in detail by a number of in silico and in vitro analyses, including prediction of a binding site between PmFortilin/FBP1 and docking simulations. The basis of the modeling analyses was well-conserved PmFortilin orthologs, containing a Ca2+-binding domain at residues 76–110 representing a section of the helical domain, the translationally controlled tumor protein signature 1 and 2 (TCTP_1, TCTP_2) at residues 45–55 and 123–145, respectively. We found the pairs Cys59 and Cys76 formed a disulfide bond in the C-terminus of FBP1, which is a common structural feature in many exported proteins and the “x–G–K–K” pattern of the amidation site at the end of the C-terminus. This coincided with our previous work, where we found the “x–P–P–x” patterns of an antiviral peptide also to be located in the C-terminus of FBP1. The combined bioinformatics and in vitro results indicate that FBP1 is a transmembrane protein and FBP1 interact with N-terminal region of PmFortilin

    Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium

    Get PDF
    Clostridium autoethanogenum is an acetogenic bacterium capable of producing high value commodity chemicals and biofuels from the C1 gases present in synthesis gas. This common industrial waste gas can act as the sole energy and carbon source for the bacterium that converts the low value gaseous components into cellular building blocks and industrially relevant products via the action of the reductive acetyl-CoA (Wood-Ljungdahl) pathway. Current research efforts are focused on the enhancement and extension of product formation in this organism via synthetic biology approaches. However, crucial to metabolic modelling and directed pathway engineering is a reliable and comprehensively annotated genome sequence

    Characterization of Profilin Polymorphism in Pollen with a Focus on Multifunctionality

    Get PDF
    Profilin, a multigene family involved in actin dynamics, is a multiple partners-interacting protein, as regard of the presence of at least of three binding domains encompassing actin, phosphoinositide lipids, and poly-L-proline interacting patches. In addition, pollen profilins are important allergens in several species like Olea europaea L. (Ole e 2), Betula pendula (Bet v 2), Phleum pratense (Phl p 12), Zea mays (Zea m 12) and Corylus avellana (Cor a 2). In spite of the biological and clinical importance of these molecules, variability in pollen profilin sequences has been poorly pointed out up until now. In this work, a relatively high number of pollen profilin sequences have been cloned, with the aim of carrying out an extensive characterization of their polymorphism among 24 olive cultivars and the above mentioned plant species. Our results indicate a high level of variability in the sequences analyzed. Quantitative intra-specific/varietal polymorphism was higher in comparison to inter-specific/cultivars comparisons. Multi-optional posttranslational modifications, e.g. phosphorylation sites, physicochemical properties, and partners-interacting functional residues have been shown to be affected by profilin polymorphism. As a result of this variability, profilins yielded a clear taxonomic separation between the five plant species. Profilin family multifunctionality might be inferred by natural variation through profilin isovariants generated among olive germplasm, as a result of polymorphism. The high variability might result in both differential profilin properties and differences in the regulation of the interaction with natural partners, affecting the mechanisms underlying the transmission of signals throughout signaling pathways in response to different stress environments. Moreover, elucidating the effect of profilin polymorphism in adaptive responses like actin dynamics, and cellular behavior, represents an exciting research goal for the future
    corecore