253 research outputs found

    Detection of splice junctions from paired-end RNA-seq data by SpliceMap

    Get PDF
    Alternative splicing is a prevalent post-transcriptional process, which is not only important to normal cellular function but is also involved in human diseases. The newly developed second generation sequencing technique provides high-throughput data (RNA-seq data) to study alternative splicing events in different types of cells. Here, we present a computational method, SpliceMap, to detect splice junctions from RNA-seq data. This method does not depend on any existing annotation of gene structures and is capable of finding novel splice junctions with high sensitivity and specificity. It can handle long reads (50–100 nt) and can exploit paired-read information to improve mapping accuracy. Several parameters are included in the output to indicate the reliability of the predicted junction and help filter out false predictions. We applied SpliceMap to analyze 23 million paired 50-nt reads from human brain tissue. The results show at this depth of sequencing, RNA-seq can support reliable detection of splice junctions except for those that are present at very low level. Compared to current methods, SpliceMap can achieve 12% higher sensitivity without sacrificing specificity

    Alpha-particle-induced complex chromosome exchanges transmitted through extra-thymic lymphopoiesis in vitro show evidence of emerging genomic instability

    Get PDF
    Human exposure to high-linear energy transfer α-particles includes environmental (e.g. radon gas and its decay progeny), medical (e.g. radiopharmaceuticals) and occupational (nuclear industry) sources. The associated health risks of α-particle exposure for lung cancer are well documented however the risk estimates for leukaemia remain uncertain. To further our understanding of α-particle effects in target cells for leukaemogenesis and also to seek general markers of individual exposure to α-particles, this study assessed the transmission of chromosomal damage initially-induced in human haemopoietic stem and progenitor cells after exposure to high-LET α-particles. Cells surviving exposure were differentiated into mature T-cells by extra-thymic T-cell differentiation in vitro. Multiplex fluorescence in situ hybridisation (M-FISH) analysis of naïve T-cell populations showed the occurrence of stable (clonal) complex chromosome aberrations consistent with those that are characteristically induced in spherical cells by the traversal of a single α-particle track. Additionally, complex chromosome exchanges were observed in the progeny of irradiated mature T-cell populations. In addition to this, newly arising de novo chromosome aberrations were detected in cells which possessed clonal markers of α-particle exposure and also in cells which did not show any evidence of previous exposure, suggesting ongoing genomic instability in these populations. Our findings support the usefulness and reliability of employing complex chromosome exchanges as indicators of past or ongoing exposure to high-LET radiation and demonstrate the potential applicability to evaluate health risks associated with α-particle exposure.This work was supported by the Department of Health, UK. Contract RRX95 (RMA NSDTG)

    Characterisation of Inactivation Domains and Evolutionary Strata in Human X Chromosome through Markov Segmentation

    Get PDF
    Markov segmentation is a method of identifying compositionally different subsequences in a given symbolic sequence. We have applied this technique to the DNA sequence of the human X chromosome to analyze its compositional structure. The human X chromosome is known to have acquired DNA through distinct evolutionary events and is believed to be composed of five evolutionary strata. In addition, in female mammals all copies of X chromosome in excess of one are transcriptionally inactivated. The location of a gene is correlated with its ability to undergo inactivation, but correlations between evolutionary strata and inactivation domains are less clear. Our analysis provides an accurate estimate of the location of stratum boundaries and gives a high–resolution map of compositionally different regions on the X chromosome. This leads to the identification of a novel stratum, as well as segments wherein a group of genes either undergo inactivation or escape inactivation in toto. We identify oligomers that appear to be unique to inactivation domains alone

    Improved annotation of 3' untranslated regions and complex loci by combination of strand-specific direct RNA sequencing, RNA-seq and ESTs

    Get PDF
    The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct annotation is particularly important when interpreting the results of RNA-seq experiments where short sequence reads are mapped against the genome and assigned to genes according to the annotation. Inconsistencies in annotations between the reference and the experimental system can lead to incorrect interpretation of the effect on RNA expression of an experimental treatment or mutation in the system under study. Until recently, the genome-wide annotation of 3-prime untranslated regions received less attention than coding regions and the delineation of intron/exon boundaries. In this paper, data produced for samples in Human, Chicken and A. thaliana by the novel single-molecule, strand-specific, Direct RNA Sequencing technology from Helicos Biosciences which locates 3-prime polyadenylation sites to within +/- 2 nt, were combined with archival EST and RNA-Seq data. Nine examples are illustrated where this combination of data allowed: (1) gene and 3-prime UTR re-annotation (including extension of one 3-prime UTR by 5.9 kb); (2) disentangling of gene expression in complex regions; (3) clearer interpretation of small RNA expression and (4) identification of novel genes. While the specific examples displayed here may become obsolete as genome sequences and their annotations are refined, the principles laid out in this paper will be of general use both to those annotating genomes and those seeking to interpret existing publically available annotations in the context of their own experimental dataComment: 44 pages, 9 figure

    Insights gained from the reverse engineering of gene networks in keloid fibroblasts

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Keloids are protrusive claw-like scars that have a propensity to recur even after surgery, and its molecular etiology remains elusive. The goal of reverse engineering is to infer gene networks from observational data, thus providing insight into the inner workings of a cell. However, most attempts at modeling biological networks have been done using simulated data. This study aims to highlight some of the issues involved in working with experimental data, and at the same time gain some insights into the transcriptional regulatory mechanism present in keloid fibroblasts.</p> <p>Methods</p> <p>Microarray data from our previous study was combined with microarray data obtained from the literature as well as new microarray data generated by our group. For the physical approach, we used the fREDUCE algorithm for correlating expression values to binding motifs. For the influence approach, we compared the Bayesian algorithm BANJO with the information theoretic method ARACNE in terms of performance in recovering known influence networks obtained from the KEGG database. In addition, we also compared the performance of different normalization methods as well as different types of gene networks.</p> <p>Results</p> <p>Using the physical approach, we found consensus sequences that were active in the keloid condition, as well as some sequences that were responsive to steroids, a commonly used treatment for keloids. From the influence approach, we found that BANJO was better at recovering the gene networks compared to ARACNE and that transcriptional networks were better suited for network recovery compared to cytokine-receptor interaction networks and intracellular signaling networks. We also found that the NFKB transcriptional network that was inferred from normal fibroblast data was more accurate compared to that inferred from keloid data, suggesting a more robust network in the keloid condition.</p> <p>Conclusions</p> <p>Consensus sequences that were found from this study are possible transcription factor binding sites and could be explored for developing future keloid treatments or for improving the efficacy of current steroid treatments. We also found that the combination of the Bayesian algorithm, RMA normalization and transcriptional networks gave the best reconstruction results and this could serve as a guide for future influence approaches dealing with experimental data.</p

    Supported and valued? A survey of Early Career Researchers’ experiences and perceptions of youth and adult involvement in mental health, self-harm and suicide research

    Get PDF
    BackgroundPatient and public involvement (PPI) in mental health research, including self-harm and suicide research, is desirable (as with other health topics) but may involve specific challenges given the perceived sensitivity of the topic. This is particularly so when involving young people. We explore the experiences and perceptions of Early Career Researchers (ECRs) undertaking youth and adult involvement work in mental health, self-harm and/or suicide research. We consider current practice, barriers and facilitators.MethodsAn online survey of a convenience sample of ECRs (N = 41) undertaking research on mental health, self-harm and/or suicide. Questions examined the perceived value of involvement work, involvement methods used, funding availability and the extent to which researchers felt knowledgeable, supported and confident in their involvement activities. Descriptive statistics are presented with appropriate tests. Open-ended questions, related to barriers and facilitators for involvement work, were subjected to an inductive thematic analysis.ResultsYouth and adult involvement work were valued to a similar extent, though institutions were reported to value youth involvement to a lesser extent. Researchers’ knowledge, confidence and support ratings were comparable for youth and adult involvement. The involvement methods used with young people and adults were also similar, with analysing data being the least popular method used and developing resources (e.g. information sheets) being the most popular method used. Less than a third of participants reported that funding was available for their research involvement activities. Barriers to involvement in research on mental health, self-harm and suicide were: ethical issues and perceived risk; real costs (in terms of money/time) versus perceived value; and the challenge of recruiting people. Facilitators to involvement work were: expert examples, expertise and guidelines; and investment in involvement work.ConclusionsECRs in the fields of mental health, self-harm and suicide are engaged in youth and adult involvement work. They value (find worthwhile) youth and adult involvement work to a similarly high extent, but feel their institutions may regard youth involvement slightly less highly than adult involvement. ECRs rate themselves as feeling similarly knowledgeable, confident and supported when doing involvement activities with both age groups. Nonetheless, significant barriers to involvement work on these topics are reported and are generally issues that need to be tackled at an institutional level (ethical/governance issues and lack of funding)

    Normalized long read RNA sequencing in chicken reveals transcriptome complexity similar to human

    Get PDF
    Background: Despite the significance of chicken as a model organism, our understanding of the chicken transcriptome is limited compared to human. This issue is common to all non-human vertebrate annotations due to the difficulty in transcript identification from short read RNAseq data. While previous studies have used single molecule long read sequencing for transcript discovery, they did not perform RNA normalization and 5'-cap selection which may have resulted in lower transcriptome coverage and truncated transcript sequences. Results: We sequenced normalised chicken brain and embryo RNA libraries with Pacific Bioscience Iso-Seq. 5' cap selection was performed on the embryo library to provide methodological comparison. From these Iso-Seq sequencing projects, we have identified 60 k transcripts and 29 k genes within the chicken transcriptome. Of these, more than 20 k are novel lncRNA transcripts with ~3 k classified as sense exonic overlapping lncRNA, which is a class that is underrepresented in many vertebrate annotations. The relative proportion of alternative transcription events revealed striking similarities between the chicken and human transcriptomes while also providing explanations for previously observed genomic differences. Conclusions: Our results indicate that the chicken transcriptome is similar in complexity compared to human, and provide insights into other vertebrate biology. Our methodology demonstrates the potential of Iso-Seq sequencing to rapidly expand our knowledge of transcriptomics

    Bovine Genome Database: integrated tools for genome annotation and discovery

    Get PDF
    The Bovine Genome Database (BGD; http://BovineGenome.org) strives to improve annotation of the bovine genome and to integrate the genome sequence with other genomics data. BGD includes GBrowse genome browsers, the Apollo Annotation Editor, a quantitative trait loci (QTL) viewer, BLAST databases and gene pages. Genome browsers, available for both scaffold and chromosome coordinate systems, display the bovine Official Gene Set (OGS), RefSeq and Ensembl gene models, non-coding RNA, repeats, pseudogenes, single-nucleotide polymorphism, markers, QTL and alignments to complementary DNAs, ESTs and protein homologs. The Bovine QTL viewer is connected to the BGD Chromosome GBrowse, allowing for the identification of candidate genes underlying QTL. The Apollo Annotation Editor connects directly to the BGD Chado database to provide researchers with remote access to gene evidence in a graphical interface that allows editing and creating new gene models. Researchers may upload their annotations to the BGD server for review and integration into the subsequent release of the OGS. Gene pages display information for individual OGS gene models, including gene structure, transcript variants, functional descriptions, gene symbols, Gene Ontology terms, annotator comments and links to National Center for Biotechnology Information and Ensembl. Each gene page is linked to a wiki page to allow input from the research community
    corecore