111 research outputs found
A gene expression atlas for different kinds of stress in the mouse brain
Stressful experiences are part of everyday life and animals have evolved physiological and behavioral responses aimed at coping with stress and maintaining homeostasis. However, repeated or intense stress can induce maladaptive reactions leading to behavioral disorders. Adaptations in the brain, mediated by changes in gene expression, have a crucial role in the stress response. Recent years have seen a tremendous increase in studies on the transcriptional effects of stress. The input raw data are freely available from public repositories and represent a wealth of information for further global and integrative retrospective analyses. We downloaded from the Sequence Read Archive 751 samples (SRA-experiments), from 18 independent BioProjects studying the effects of different stressors on the brain transcriptome in mice. We performed a massive bioinformatics re-analysis applying a single, standardized pipeline for computing differential gene expression. This data mining allowed the identification of novel candidate stress-related genes and specific signatures associated with different stress conditions. The large amount of computational results produced was systematized in the interactive “Stress Mice Portal”
HPC-REDItools: A novel HPC-aware tool for improved large scale RNA-editing analysis
Background: RNA editing is a widespread co-/post-transcriptional mechanism that alters primary RNA sequences through the modification of specific nucleotides and it can increase both the transcriptome and proteome diversity. The automatic detection of RNA-editing from RNA-seq data is computational intensive and limited to small data sets, thus preventing a reliable genome-wide characterisation of such process. Results: In this work we introduce HPC-REDItools, an upgraded tool for accurate RNA-editing events discovery from large dataset repositories. Availability: https://github.com/BioinfoUNIBA/REDItools2. Conclusions: HPC-REDItools is dramatically faster than the previous version, REDItools, enabling big-data analysis by means of a MPI-based implementation and scaling almost linearly with the number of available cores
ASPicDB: a database of annotated transcript and protein variants generated by alternative splicing
Alternative splicing is emerging as a major mechanism for the expansion of the transcriptome and proteome diversity, particularly in human and other vertebrates. However, the proportion of alternative transcripts and proteins actually endowed with functional activity is currently highly debated. We present here a new release of ASPicDB which now provides a unique annotation resource of human protein variants generated by alternative splicing. A total of 256 939 protein variants from 17 191 multi-exon genes have been extensively annotated through state of the art machine learning tools providing information of the protein type (globular and transmembrane), localization, presence of PFAM domains, signal peptides, GPI-anchor propeptides, transmembrane and coiled-coil segments. Furthermore, full-length variants can be now specifically selected based on the annotation of CAGE-tags and polyA signal and/or polyA sites, marking transcription initiation and termination sites, respectively. The retrieval can be carried out at gene, transcript, exon, protein or splice site level allowing the selection of data sets fulfilling one or more features settled by the user. The retrieval interface also enables the selection of protein variants showing specific differences in the annotated features. ASPicDB is available at http://www.caspur.it/ASPicDB/
Model of the complex of Parathyroid hormone-2 receptor and Tuberoinfundibular peptide of 39 residues
<p>Abstract</p> <p>Background</p> <p>We aim to propose interactions between the parathyroid hormone-2 receptor (PTH2R) and its ligand the tuberoinfundibular peptide of 39 residues (TIP39) by constructing a homology model of their complex. The two related peptides parathyroid hormone (PTH) and parathyroid hormone related protein (PTHrP) are compared with the complex to examine their interactions.</p> <p>Findings</p> <p>In the model, the hydrophobic N-terminus of TIP39 is buried in a hydrophobic part of the central cavity between helices 3 and 7. Comparison of the peptide sequences indicates that the main discriminator between the agonistic peptides TIP39 and PTH and the inactive PTHrP is a tryptophan-phenylalanine replacement. The model indicates that the smaller phenylalanine in PTHrP does not completely occupy the binding site of the larger tryptophan residue in the other peptides. As only TIP39 causes internalisation of the receptor and the primary difference being an aspartic acid in position 7 of TIP39 that interacts with histidine 396 in the receptor, versus isoleucine/histidine residues in the related hormones, this might be a trigger interaction for the events that cause internalisation.</p> <p>Conclusions</p> <p>A model is constructed for the complex and a trigger interaction for full agonistic activation between aspartic acid 7 of TIP39 and histidine 396 in the receptor is proposed.</p
The metagenomic approach and causality in virology
Nowadays, the metagenomic approach has been a very important tool in the discovery of new viruses in environmental and biological samples. Here we discuss how these discoveries may help to elucidate the etiology of diseases and the criteria necessary to establish a causal association between a virus and a disease
Disulphide Bridges of Phospholipase C of Chlamydomonas reinhardtii Modulates Lipid Interaction and Dimer Stability
BACKGROUND: Phospholipase C (PLC) is an enzyme that plays pivotal role in a number of signaling cascades. These are active in the plasma membrane and triggers cellular responses by catalyzing the hydrolysis of membrane phospholipids and thereby generating the secondary messengers. Phosphatidylinositol-PLC (PI-PLC) specifically interacts with phosphoinositide and/or phosphoinositol and catalyzes specific cleavage of sn-3- phosphodiester bond. Several isoforms of PLC are known to form and function as dimer but very little is known about the molecular basis of the dimerization and its importance in the lipid interaction. PRINCIPAL FINDINGS: We herein report that, the disruption of disulphide bond of a novel PI-specific PLC of C. reinhardtii (CrPLC) can modulate its interaction affinity with a set of phospholipids and also the stability of its dimer. CrPLC was found to form a mixture of higher oligomeric states with monomer and dimer as major species. Dimer adduct of CrPLC disappeared in the presence of DTT, which suggested the involvement of disulphide bond(s) in CrPLC oligomerization. Dimer-monomer equilibrium studies with the isolated fractions of CrPLC monomer and dimer supported the involvement of covalent forces in the dimerization of CrPLC. A disulphide bridge was found to be responsible for the dimerization and Cys7 seems to be involved in the formation of the disulphide bond. This crucial disulphide bond also modulated the lipid affinity of CrPLC. Oligomers of CrPLC were also captured in in vivo condition. CrPLC was mainly found to be localized in the plasma membrane of the cell. The cell surface localization of CrPLC may have significant implication in the downstream regulatory function of CrPLC. SIGNIFICANCE: This study helps in establishing the role of CrPLC (or similar proteins) in the quaternary structure of the molecule its affinities during lipid interactions
EasyCluster: a fast and efficient gene-oriented clustering tool for large-scale transcriptome data
<p>Abstract</p> <p>Background</p> <p>ESTs and full-length cDNAs represent an invaluable source of evidence for inferring reliable gene structures and discovering potential alternative splicing events. In newly sequenced genomes, these tasks may not be practicable owing to the lack of appropriate training sets. However, when expression data are available, they can be used to build EST clusters related to specific genomic transcribed <it>loci</it>. Common strategies recently employed to this end are based on sequence similarity between transcripts and can lead, in specific conditions, to inconsistent and erroneous clustering. In order to improve the cluster building and facilitate all downstream annotation analyses, we developed a simple genome-based methodology to generate gene-oriented clusters of ESTs when a genomic sequence and a pool of related expressed sequences are provided. Our procedure has been implemented in the software EasyCluster and takes into account the spliced nature of ESTs after an <it>ad hoc </it>genomic mapping.</p> <p>Methods</p> <p>EasyCluster uses the well-known GMAP program in order to perform a very quick EST-to-genome mapping in addition to the detection of reliable splice sites. Given a genomic sequence and a pool of ESTs/FL-cDNAs, EasyCluster starts building genomic and EST local databases and runs GMAP. Subsequently, it parses results creating an initial collection of pseudo-clusters by grouping ESTs according to the overlap of their genomic coordinates on the same strand. In the final step, EasyCluster refines the clustering by again running GMAP on each pseudo-cluster and groups together ESTs sharing at least one splice site.</p> <p>Results</p> <p>The higher accuracy of EasyCluster with respect to other clustering tools has been verified by means of a manually cured benchmark of human EST clusters. Additional datasets including the Unigene cluster Hs.122986 and ESTs related to the human <it>HOXA </it>gene family have also been used to demonstrate the better clustering capability of EasyCluster over current genome-based web service tools such as ASmodeler and BIPASS. EasyCluster has also been used to provide a first compilation of gene-oriented clusters in the <it>Ricinus communis </it>oilseed plant for which no Unigene clusters are yet available, as well as an evaluation of the alternative splicing in this plant species.</p
Assessment of orthologous splicing isoforms in human and mouse orthologous genes
<p>Abstract</p> <p>Background</p> <p>Recent discoveries have highlighted the fact that alternative splicing and alternative transcripts are the rule, rather than the exception, in metazoan genes. Since multiple transcript and protein variants expressed by the same gene are, by definition, structurally distinct and need not to be functionally equivalent, the concept of gene orthology should be extended to the transcript level in order to describe evolutionary relationships between structurally similar transcript variants. In other words, the identification of true orthology relationships between gene products now should progress beyond primary sequence and "splicing orthology", consisting in ancestrally shared exon-intron structures, is required to define orthologous isoforms at transcript level.</p> <p>Results</p> <p>As a starting step in this direction, in this work we performed a large scale human- mouse gene comparison with a twofold goal: first, to assess if and to which extent traditional gene annotations such as RefSeq capture genuine splicing orthology; second, to provide a more detailed annotation and quantification of true human-mouse orthologous transcripts defined as transcripts of orthologous genes exhibiting the same splicing patterns.</p> <p>Conclusions</p> <p>We observed an identical exon/intron structure for 32% of human and mouse orthologous genes. This figure increases to 87% using less stringent criteria for gene structure similarity, thus implying that for about 13% of the human RefSeq annotated genes (and about 25% of the corresponding transcripts) we could not identify any mouse transcript showing sufficient similarity to be confidently assigned as a splicing ortholog. Our data suggest that current gene and transcript data may still be rather incomplete - with several splicing variants still unknown. The observation that alternative splicing produces large numbers of alternative transcripts and proteins, some of them conserved across species and others truly species-specific, suggests that, still maintaining the conventional definition of gene orthology, a new concept of "splicing orthology" can be defined at transcript level.</p
Common and Distant Structural Characteristics of Feruloyl Esterase Families from Aspergillus oryzae
Background: Feruloyl esterases (FAEs) are important biomass degrading accessory enzymes due to their capability of cleaving the ester links between hemicellulose and pectin to aromatic compounds of lignin, thus enhancing the accessibility of plant tissues to cellulolytic and hemicellulolytic enzymes. FAEs have gained increased attention in the area of biocatalytic transformations for the synthesis of value added compounds with medicinal and nutritional applications. Following the increasing attention on these enzymes, a novel descriptor based classification system has been proposed for FAEs resulting into 12 distinct families and pharmacophore models for three FAE sub-families have been developed. Methodology/Principal Findings: The feruloylome of Aspergillus oryzae contains 13 predicted FAEs belonging to six sub-families based on our recently developed descriptor-based classification system. The three-dimensional structures of the 13 FAEs were modeled for structural analysis of the feruloylome. The three genes coding for three enzymes, viz., A.O.2, A.O.8 and A.O.10 from the feruloylome of A. oryzae, representing sub-families with unknown functional features, were heterologously expressed in Pichia pastoris, characterized for substrate specificity and structural characterization through CD spectroscopy. Common feature-based pharamacophore models were developed according to substrate specificity characteristics of the three enzymes. The active site residues were identified for the three expressed FAEs by determining the titration curves of amino acid residues as a function of the pH by applying molecular simulations. Conclusions/Significance: Our findings on the structure-function relationships and substrate specificity of the FAEs of A. oryzae will be instrumental for further understanding of the FAE families in the novel classification system. The developed pharmacophore models could be applied for virtual screening of compound databases for short listing the putative substrates prior to docking studies or for post-processing docking results to remove false positives. Our study exemplifies how computational predictions can complement to the information obtained through experimental methods. © 2012 Udatha et al.published_or_final_versio
Differentiating Protein-Coding and Noncoding RNA: Challenges and Ambiguities
The assumption that RNA can be readily classified into either protein-coding or non-protein–coding categories has pervaded biology for close to 50 years. Until recently, discrimination between these two categories was relatively straightforward: most transcripts were clearly identifiable as protein-coding messenger RNAs (mRNAs), and readily distinguished from the small number of well-characterized non-protein–coding RNAs (ncRNAs), such as transfer, ribosomal, and spliceosomal RNAs. Recent genome-wide studies have revealed the existence of thousands of noncoding transcripts, whose function and significance are unclear. The discovery of this hidden transcriptome and the implicit challenge it presents to our understanding of the expression and regulation of genetic information has made the need to distinguish between mRNAs and ncRNAs both more pressing and more complicated. In this Review, we consider the diverse strategies employed to discriminate between protein-coding and noncoding transcripts and the fundamental difficulties that are inherent in what may superficially appear to be a simple problem. Misannotations can also run in both directions: some ncRNAs may actually encode peptides, and some of those currently thought to do so may not. Moreover, recent studies have shown that some RNAs can function both as mRNAs and intrinsically as functional ncRNAs, which may be a relatively widespread phenomenon. We conclude that it is difficult to annotate an RNA unequivocally as protein-coding or noncoding, with overlapping protein-coding and noncoding transcripts further confounding this distinction. In addition, the finding that some transcripts can function both intrinsically at the RNA level and to encode proteins suggests a false dichotomy between mRNAs and ncRNAs. Therefore, the functionality of any transcript at the RNA level should not be discounted
- …