58 research outputs found

    GeneViTo: Visualizing gene-product functional and structural features in genomic datasets

    Get PDF
    BACKGROUND: The availability of increasing amounts of sequence data from completely sequenced genomes boosts the development of new computational methods for automated genome annotation and comparative genomics. Therefore, there is a need for tools that facilitate the visualization of raw data and results produced by bioinformatics analysis, providing new means for interactive genome exploration. Visual inspection can be used as a basis to assess the quality of various analysis algorithms and to aid in-depth genomic studies. RESULTS: GeneViTo is a JAVA-based computer application that serves as a workbench for genome-wide analysis through visual interaction. The application deals with various experimental information concerning both DNA and protein sequences (derived from public sequence databases or proprietary data sources) and meta-data obtained by various prediction algorithms, classification schemes or user-defined features. Interaction with a Graphical User Interface (GUI) allows easy extraction of genomic and proteomic data referring to the sequence itself, sequence features, or general structural and functional features. Emphasis is laid on the potential comparison between annotation and prediction data in order to offer a supplement to the provided information, especially in cases of "poor" annotation, or an evaluation of available predictions. Moreover, desired information can be output in high quality JPEG image files for further elaboration and scientific use. A compilation of properly formatted GeneViTo input data for demonstration is available to interested readers for two completely sequenced prokaryotes, Chlamydia trachomatis and Methanococcus jannaschii. CONCLUSIONS: GeneViTo offers an inspectional view of genomic functional elements, concerning data stemming both from database annotation and analysis tools for an overall analysis of existing genomes. The application is compatible with Linux or Windows ME-2000-XP operating systems, provided that the appropriate Java Runtime Environment is already installed in the system

    Refinement of 2-Amino-6-(4-methyl-1-piperazinyl)-4-(tricyclo[3.3.1.1 3,7

    Full text link

    Prediction of peptide and protein propensity for amyloid formation

    Get PDF
    Understanding which peptides and proteins have the potential to undergo amyloid formation and what driving forces are responsible for amyloid-like fiber formation and stabilization remains limited. This is mainly because proteins that can undergo structural changes, which lead to amyloid formation, are quite diverse and share no obvious sequence or structural homology, despite the structural similarity found in the fibrils. To address these issues, a novel approach based on recursive feature selection and feed-forward neural networks was undertaken to identify key features highly correlated with the self-assembly problem. This approach allowed the identification of seven physicochemical and biochemical properties of the amino acids highly associated with the self-assembly of peptides and proteins into amyloid-like fibrils (normalized frequency of β-sheet, normalized frequency of β-sheet from LG, weights for β-sheet at the window position of 1, isoelectric point, atom-based hydrophobic moment, helix termination parameter at position j+1 and ΔGº values for peptides extrapolated in 0 M urea). Moreover, these features enabled the development of a new predictor (available at http://cran.r-project.org/web/packages/appnn/index.html) capable of accurately and reliably predicting the amyloidogenic propensity from the polypeptide sequence alone with a prediction accuracy of 84.9 % against an external validation dataset of sequences with experimental in vitro, evidence of amyloid formation

    The Distribution of GYR- and YLP-Like Motifs in Drosophila Suggests a General Role in Cuticle Assembly and Other Protein-Protein Interactions

    Get PDF
    Background: Arthropod cuticle is composed predominantly of a self-assembling matrix of chitin and protein. Genes encoding structural cuticular proteins are remarkably abundant in arthropod genomes, yet there has been no systematic survey of conserved motifs across cuticular protein families. Methodology/Principal Findings: Two short sequence motifs with conserved tyrosines were identified in Drosophila cuticular proteins that were similar to the GYR and YLP Interpro domains. These motifs were found in members of the CPR, Tweedle, CPF/CPFL, and (in Anopheles gambiae) CPLCG cuticular protein families, and the Dusky/Miniature family of cuticleassociated proteins. Tweedle proteins have a characteristic motif architecture that is shared with the Drosophila protein GCR1 and its orthologs in other species, suggesting that GCR1 is also cuticular. A resilin repeat, which has been shown to confer elasticity, matched one of the motifs; a number of other Drosophila proteins of unknown function exhibit a motif architecture similar to that of resilin. The motifs were also present in some proteins of the peritrophic matrix and the eggshell, suggesting molecular convergence among distinct extracellular matrices. More surprisingly, gene regulation, development, and proteolysis were statistically over-represented ontology terms for all non-cuticular matches in Drosophila. Searches against other arthropod genomes indicate that the motifs are taxonomically widespread. Conclusions: This survey suggests a more general definition for GYR and YLP motifs and reveals their contribution to severa

    Structure of a lectin from Canavalia gladiata seeds: new structural insights for old molecules

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Lectins are mainly described as simple carbohydrate-binding proteins. Previous studies have tried to identify other binding sites, which possible recognize plant hormones, secondary metabolites, and isolated amino acid residues. We report the crystal structure of a lectin isolated from <it>Canavalia gladiata </it>seeds (CGL), describing a new binding pocket, which may be related to pathogen resistance activity in ConA-like lectins; a site where a non-protein amino-acid, α-aminobutyric acid (Abu), is bound.</p> <p>Results</p> <p>The overall structure of native CGL and complexed with α-methyl-mannoside and Abu have been refined at 2.3 Å and 2.31 Å resolution, respectively. Analysis of the electron density maps of the CGL structure shows clearly the presence of Abu, which was confirmed by mass spectrometry.</p> <p>Conclusion</p> <p>The presence of Abu in a plant lectin structure strongly indicates the ability of lectins on carrying secondary metabolites. Comparison of the amino acids composing the site with other legume lectins revealed that this site is conserved, providing an evidence of the biological relevance of this site. This new action of lectins strengthens their role in defense mechanisms in plants.</p

    Cooperativity among Short Amyloid Stretches in Long Amyloidogenic Sequences

    Get PDF
    Amyloid fibrillar aggregates of polypeptides are associated with many neurodegenerative diseases. Short peptide segments in protein sequences may trigger aggregation. Identifying these stretches and examining their behavior in longer protein segments is critical for understanding these diseases and obtaining potential therapies. In this study, we combined machine learning and structure-based energy evaluation to examine and predict amyloidogenic segments. Our feature selection method discovered that windows consisting of long amino acid segments of ∼30 residues, instead of the commonly used short hexapeptides, provided the highest accuracy. Weighted contributions of an amino acid at each position in a 27 residue window revealed three cooperative regions of short stretch, resemble the β-strand-turn-β-strand motif in A-βpeptide amyloid and β-solenoid structure of HET-s(218–289) prion (C). Using an in-house energy evaluation algorithm, the interaction energy between two short stretches in long segment is computed and incorporated as an additional feature. The algorithm successfully predicted and classified amyloid segments with an overall accuracy of 75%. Our study revealed that genome-wide amyloid segments are not only dependent on short high propensity stretches, but also on nearby residues

    Transcriptomics of the Bed Bug (Cimex lectularius)

    Get PDF
    BACKGROUND: Bed bugs (Cimex lectularius) are blood-feeding insects poised to become one of the major pests in households throughout the United States. Resistance of C. lectularius to insecticides/pesticides is one factor thought to be involved in its sudden resurgence. Despite its high-impact status, scant knowledge exists at the genomic level for C. lectularius. Hence, we subjected the C. lectularius transcriptome to 454 pyrosequencing in order to identify potential genes involved in pesticide resistance. METHODOLOGY AND PRINCIPAL FINDINGS: Using 454 pyrosequencing, we obtained a total of 216,419 reads with 79,596,412 bp, which were assembled into 35,646 expressed sequence tags (3902 contigs and 31744 singletons). Nearly 85.9% of the C. lectularius sequences showed similarity to insect sequences, but 44.8% of the deduced proteins of C. lectularius did not show similarity with sequences in the GenBank non-redundant database. KEGG analysis revealed putative members of several detoxification pathways involved in pesticide resistance. Lamprin domains, Protein Kinase domains, Protein Tyrosine Kinase domains and cytochrome P450 domains were among the top Pfam domains predicted for the C. lectularius sequences. An initial assessment of putative defense genes, including a cytochrome P450 and a glutathione-S-transferase (GST), revealed high transcript levels for the cytochrome P450 (CYP9) in pesticide-exposed versus pesticide-susceptible C. lectularius populations. A significant number of single nucleotide polymorphisms (296) and microsatellite loci (370) were predicted in the C. lectularius sequences. Furthermore, 59 putative sequences of Wolbachia were retrieved from the database. CONCLUSIONS: To our knowledge this is the first study to elucidate the genetic makeup of C. lectularius. This pyrosequencing effort provides clues to the identification of potential detoxification genes involved in pesticide resistance of C. lectularius and lays the foundation for future functional genomics studies

    Bacterial β-barrel outer membrane proteins: A common structural theme implicated in a wide variety of functional roles

    No full text
    β-barrel outer membrane proteins constitute the second and less well-studied class of transmembrane proteins. They are present exclusively in the outer membrane of Gram-negative bacteria and presumably in the outer membrane of mitochondria and chloroplasts. During the last few years, remarkable advances have been made towards an understanding of their functional and structural features. It is now wellknown that β-barrels are performing a large variety of biologically important functions for the bacterial cell. Such functions include acting as specific or non-specific channels, receptors for various compounds, enzymes, translocation channels, structural proteins, and adhesion proteins. All these functional roles are of great importance for the survival of the bacterial cell under various environmental conditions or for the pathogenic properties expressed by these organisms. This chapter reviews the currently available literature regarding the structure and function of bacterial outer membrane proteins. We emphasize the functional diversity expressed by a common structural motif such as the β-barrel, and we provide evidence from the current literature for dozens of newly discovered families of transmembrane β-barrels. © 2009, IGI Global
    • …
    corecore