20 research outputs found

    Transcriptome map of mouse isochores

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The availability of fully sequenced genomes and the implementation of transcriptome technologies have increased the studies investigating the expression profiles for a variety of tissues, conditions, and species. In this study, using RNA-seq data for three distinct tissues (brain, liver, and muscle), we investigate how base composition affects mammalian gene expression, an issue of prime practical and evolutionary interest.</p> <p>Results</p> <p>We present the transcriptome map of the mouse isochores (DNA segments with a fairly homogeneous base composition) for the three different tissues and the effects of isochores' base composition on their expression activity. Our analyses also cover the relations between the genes' expression activity and their localization in the isochore families.</p> <p>Conclusions</p> <p>This study is the first where next-generation sequencing data are used to associate the effects of both genomic and genic compositional properties to their corresponding expression activity. Our findings confirm previous results, and further support the existence of a relationship between isochores and gene expression. This relationship corroborates that isochores are primarily a product of evolutionary adaptation rather than a simple by-product of neutral evolutionary processes.</p

    Prediction of peptide and protein propensity for amyloid formation

    Get PDF
    Understanding which peptides and proteins have the potential to undergo amyloid formation and what driving forces are responsible for amyloid-like fiber formation and stabilization remains limited. This is mainly because proteins that can undergo structural changes, which lead to amyloid formation, are quite diverse and share no obvious sequence or structural homology, despite the structural similarity found in the fibrils. To address these issues, a novel approach based on recursive feature selection and feed-forward neural networks was undertaken to identify key features highly correlated with the self-assembly problem. This approach allowed the identification of seven physicochemical and biochemical properties of the amino acids highly associated with the self-assembly of peptides and proteins into amyloid-like fibrils (normalized frequency of ÎČ-sheet, normalized frequency of ÎČ-sheet from LG, weights for ÎČ-sheet at the window position of 1, isoelectric point, atom-based hydrophobic moment, helix termination parameter at position j+1 and ΔGÂș values for peptides extrapolated in 0 M urea). Moreover, these features enabled the development of a new predictor (available at http://cran.r-project.org/web/packages/appnn/index.html) capable of accurately and reliably predicting the amyloidogenic propensity from the polypeptide sequence alone with a prediction accuracy of 84.9 % against an external validation dataset of sequences with experimental in vitro, evidence of amyloid formation

    REAL: REad ALigner is an efficient short-read mapping tool

    No full text
    International audienc

    REAL: An efficient REad ALigner for next generation sequencing reads

    No full text
    International audienc

    Amyloidogenic determinants are usually not buried

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Amyloidoses are a group of usually fatal diseases, probably caused by protein misfolding and subsequent aggregation into amyloid fibrillar deposits. The mechanisms involved in amyloid fibril formation are largely unknown and are the subject of current, intensive research. In an attempt to identify possible amyloidogenic regions in proteins for further experimental investigation, we have developed and present here a publicly available online tool that utilizes five different and independently published methods, to form a consensus prediction of amyloidogenic regions in proteins, using only protein primary structure data.</p> <p>Results</p> <p>It appears that the consensus prediction tool is slightly more objective than individual prediction methods alone and suggests several previously not identified amino acid stretches as potential amyloidogenic determinants, which (although several of them may be overpredictions) require further experimental studies. The tool is available at: <url>http://biophysics.biol.uoa.gr/AMYLPRED</url>. Utilizing molecular graphics programs, like O and PyMOL, as well as the algorithm DSSP, it was found that nearly all experimentally verified amyloidogenic determinants (short peptide stretches favouring aggregation and subsequent amyloid formation), and several predicted, with the aid of the tool AMYLPRED, but not experimentally verified amyloidogenic determinants, are located on the surface of the relevant amyloidogenic proteins. This finding may be important in efforts directed towards inhibiting amyloid fibril formation.</p> <p>Conclusion</p> <p>The most significant result of this work is the observation that virtually all, to date, experimentally determined amyloidogenic determinants and the majority of predicted, but not yet experimentally verified short amyloidogenic stretches, lie 'exposed' on the surface of the relevant amyloidogenic proteins, and also several of them have the ability to act as conformational 'switches'. Experiments, focused on these fragments, should be performed to test this idea.</p

    GapMis-OMP: Pairwise Short-Read Alignment on Multi-core Architectures

    No full text
    Part 8: First Workshop on Algorithms for Data and Text Mining in Bioinformatics (WADTMB 2012)International audiencePairwise sequence alignment has received a new motivation due to the advent of next-generation sequencing technologies, particularly so for the application of re-sequencing—the assembly of a genome directed by a reference sequence. After the fast alignment between a factor of the reference sequence and a high-quality fragment of a short read by a short-read alignment programme, an important problem is to find the alignment between a relatively short succeeding factor of the reference sequence and the remaining low-quality fragment of the read allowing a number of mismatches and the insertion of a single gap in the alignment. In this article, we present GapMis-OMP, a tool for pairwise short-read alignment that works on multi-core architectures. It is designed to compute the alignments between all the sequences in a first set of sequences and all those from a second one in parallel. The presented experimental results demonstrate that GapMis-OMP is more efficient than most popular tools
    corecore