66 research outputs found

    HMMSplicer: A Tool for Efficient and Sensitive Discovery of Known and Novel Splice Junctions in RNA-Seq Data

    Get PDF
    Background: High-throughput sequencing of an organism’s transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the reference genome, especially in the case where a splice junction is previously unknown. Methodology/Principal Findings: Here we introduce HMMSplicer, an accurate and efficient algorithm for discovering canonical and non-canonical splice junctions in short read datasets. HMMSplicer identifies more splice junctions than currently available algorithms when tested on publicly available A. thaliana, P. falciparum, and H. sapiens datasets without a reduction in specificity. Conclusions/Significance: HMMSplicer was found to perform especially well in compact genomes and on genes with low expression levels, alternative splice isoforms, or non-canonical splice junctions. Because HHMSplicer does not rely on prebuilt gene models, the products of inexact splicing are also detected. For H. sapiens, we find 3.6 % of 39 splice sites and 1.4% of 59 splice sites are inexact, typically differing by 3 bases in either direction. In addition, HMMSplicer provides a score for every predicted junction allowing the user to set a threshold to tune false positive rates depending on the needs of the experiment. HMMSplicer is implemented in Python. Code and documentation are freely available a

    Two-pion Bose-Einstein correlations in central Pb-Pb collisions at sNN\sqrt{s_{\rm NN}} = 2.76 TeV

    Get PDF
    The first measurement of two-pion Bose-Einstein correlations in central Pb-Pb collisions at sNN=2.76\sqrt{s_{\rm NN}} = 2.76 TeV at the Large Hadron Collider is presented. We observe a growing trend with energy now not only for the longitudinal and the outward but also for the sideward pion source radius. The pion homogeneity volume and the decoupling time are significantly larger than those measured at RHIC.Comment: 17 pages, 5 captioned figures, 1 table, authors from page 12, published version, figures at http://aliceinfo.cern.ch/ArtSubmission/node/388

    Suppression of charged particle production at large transverse momentum in central Pb-Pb collisions at sNN=2.76\sqrt{s_{\rm NN}} = 2.76 TeV

    Get PDF
    Inclusive transverse momentum spectra of primary charged particles in Pb-Pb collisions at sNN\sqrt{s_{_{\rm NN}}} = 2.76 TeV have been measured by the ALICE Collaboration at the LHC. The data are presented for central and peripheral collisions, corresponding to 0-5% and 70-80% of the hadronic Pb-Pb cross section. The measured charged particle spectra in η<0.8|\eta|<0.8 and 0.3<pT<200.3 < p_T < 20 GeV/cc are compared to the expectation in pp collisions at the same sNN\sqrt{s_{\rm NN}}, scaled by the number of underlying nucleon-nucleon collisions. The comparison is expressed in terms of the nuclear modification factor RAAR_{\rm AA}. The result indicates only weak medium effects (RAAR_{\rm AA} \approx 0.7) in peripheral collisions. In central collisions, RAAR_{\rm AA} reaches a minimum of about 0.14 at pT=6p_{\rm T}=6-7GeV/cc and increases significantly at larger pTp_{\rm T}. The measured suppression of high-pTp_{\rm T} particles is stronger than that observed at lower collision energies, indicating that a very dense medium is formed in central Pb-Pb collisions at the LHC.Comment: 15 pages, 5 captioned figures, 3 tables, authors from page 10, published version, figures at http://aliceinfo.cern.ch/ArtSubmission/node/98

    A Classification Method Based on Principal Components of SELDI Spectra to Diagnose of Lung Adenocarcinoma

    Get PDF
    Lung cancer is the leading cause of cancer death worldwide, but techniques for effective early diagnosis are still lacking. Proteomics technology has been applied extensively to the study of the proteins involved in carcinogenesis. In this paper, a classification method was developed based on principal components of surface-enhanced laser desorption/ionization (SELDI) spectral data. This method was applied to SELDI spectral data from 71 lung adenocarcinoma patients and 24 healthy individuals. Unlike other peak-selection-based methods, this method takes each spectrum as a unity. The aim of this paper was to demonstrate that this unity-based classification method is more robust and powerful as a method of diagnosis than peak-selection-based methods.The results showed that this classification method, which is based on principal components, has outstanding performance with respect to distinguishing lung adenocarcinoma patients from normal individuals. Through leaving-one-out, 19-fold, 5-fold and 2-fold cross-validation studies, we found that this classification method based on principal components completely outperforms peak-selection-based methods, such as decision tree, classification and regression tree, support vector machine, and linear discriminant analysis.The classification method based on principal components of SELDI spectral data is a robust and powerful means of diagnosing lung adenocarcinoma. We assert that the high efficiency of this classification method renders it feasible for large-scale clinical use

    Enteropathy-associated T cell lymphoma subtypes are characterized by loss of function of SETD2

    Get PDF
    Enteropathy-associated T cell lymphoma (EATL) is a lethal, and the most common, neoplastic complication of celiac disease. Here, we defined the genetic landscape of EATL through whole-exome sequencing of 69 EATL tumors. SETD2 was the most frequently silenced gene in EATL (32% of cases). The JAK-STAT pathway was the most frequently mutated pathway, with frequent mutations in STAT5B as well as JAK1 , JAK3 , STAT3 , and SOCS1 . We also identified mutations in KRAS , TP53 , and TERT . Type I EATL and type II EATL (monomorphic epitheliotropic intestinal T cell lymphoma) had highly overlapping genetic alterations indicating shared mechanisms underlying their pathogenesis. We modeled the effects of SETD2 loss in vivo by developing a T cell–specific knockout mouse. These mice manifested an expansion of γδ T cells, indicating novel roles for SETD2 in T cell development and lymphomagenesis. Our data render the most comprehensive genetic portrait yet of this uncommon but lethal disease and may inform future classification schemes

    DNA glycosylases: in DNA repair and beyond

    Get PDF
    The base excision repair machinery protects DNA in cells from the damaging effects of oxidation, alkylation, and deamination; it is specialized to fix single-base damage in the form of small chemical modifications. Base modifications can be mutagenic and/or cytotoxic, depending on how they interfere with the template function of the DNA during replication and transcription. DNA glycosylases play a key role in the elimination of such DNA lesions; they recognize and excise damaged bases, thereby initiating a repair process that restores the regular DNA structure with high accuracy. All glycosylases share a common mode of action for damage recognition; they flip bases out of the DNA helix into a selective active site pocket, the architecture of which permits a sensitive detection of even minor base irregularities. Within the past few years, it has become clear that nature has exploited this ability to read the chemical structure of DNA bases for purposes other than canonical DNA repair. DNA glycosylases have been brought into context with molecular processes relating to innate and adaptive immunity as well as to the control of DNA methylation and epigenetic stability. Here, we summarize the key structural and mechanistic features of DNA glycosylases with a special focus on the mammalian enzymes, and then review the evidence for the newly emerging biological functions beyond the protection of genome integrity
    corecore