Search CORE

45 research outputs found

RNA-Seq analysis of splicing in Plasmodium falciparum uncovers new splice junctions, alternative splicing and splicing of antisense transcripts.

Author: DeRisi Joseph L
Dimon Michelle T
Sorber Katherine
Publication venue: eScholarship, University of California
Publication date: 17/01/2011
Field of study

Over 50% of genes in Plasmodium falciparum, the deadliest human malaria parasite, contain predicted introns, yet experimental characterization of splicing in this organism remains incomplete. We present here a transcriptome-wide characterization of intraerythrocytic splicing events, as captured by RNA-Seq data from four timepoints of a single highly synchronous culture. Gene model-independent analysis of these data in conjunction with publically available RNA-Seq data with HMMSplicer, an in-house developed splice site detection algorithm, revealed a total of 977 new 5' GU-AG 3' and 5 new 5' GC-AG 3' junctions absent from gene models and ESTs (11% increase to the current annotation). In addition, 310 alternative splicing events were detected in 254 (4.5%) genes, most of which truncate open reading frames. Splicing events antisense to gene models were also detected, revealing complex transcriptional arrangements within the parasite's transcriptome. Interestingly, antisense introns overlap sense introns more than would be expected by chance, perhaps indicating a functional relationship between overlapping transcripts or an inherent organizational property of the transcriptome. Independent experimental validation confirmed over 30 new antisense and alternative junctions. Thus, this largest assemblage of new and alternative splicing events to date in Plasmodium falciparum provides a more precise, dynamic view of the parasite's transcriptome

PubMed Central

eScholarship - University of California

Recommended from our members

The lincRNA MIRAT binds to IQGAP1 and modulates the MAPK pathway in NRAS mutant melanoma.

Author: Burlingame Alma
Daud Adil
Dimon Michelle
Esteve-Puig Rosaura
Gajjala Abhinay
Gho Deborah
Ho Wilson
Johnston Katia
Lai Kevin
Lin Kevin
Moy Adrian
Ortiz-Urda Susana
Oses Prieto Juan
Posch Christian
Rappersberger Klemens
Sanlorenzo Martina
Vujic Igor
Vujic Marin
Zekhtser Mitchell
Publication venue: eScholarship, University of California
Publication date: 01/07/2018
Field of study

Despite major advances in targeted melanoma therapies, drug resistance limits their efficacy. Long noncoding RNAs (lncRNAs) are transcriptome elements that do not encode proteins but are important regulatory molecules. LncRNAs have been implicated in cancer development and response to different therapeutics and are thus potential treatment targets; however, the majority of their functions and molecular interactions remain unexplored. In this study, we identify a novel cytoplasmic intergenic lincRNA (MIRAT), which is upregulated following prolonged MAPK inhibition in NRAS mutant melanoma and modulates MAPK signaling by binding to the MEK scaffold protein IQGAP1. Collectively, our results present MIRAT's direct modulatory effect on the MAPK pathway and highlight the relevance of cytoplasmic lncRNAs as potential targets in drug resistant cancer

eScholarship - University of California

Lehigh Valley Health Network: LVHN Scholarly Works

Automated detection and staging of malaria parasites from cytological smears using convolutional neural networks

Author: Ando Dale Michael
Andradi-Brown Clare
Andrew Dean W.
Ashdown George W.
Baum Jake
Boyle Michelle J.
Chmielewski Jill
Cunningham Kane A.
Davidson Mira S.
Dimon Michelle
Dvorin Jeffrey D.
Gurung Pratima
Jeninga Myriam D.
O’Donnell Aidan J.
Petter Michaela
Prommana Parichat
Reece Sarah E.
Uthaipibull Chairat
Wilson Danny W.
Yahiya Sabrina
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 02/08/2021
Field of study

Microscopic examination of blood smears remains the gold standard for laboratory inspection and diagnosis of malaria. Smear inspection is, however, time-consuming and dependent on trained microscopists with results varying in accuracy. We sought to develop an automated image analysis method to improve accuracy and standardization of smear inspection that retains capacity for expert confirmation and image archiving. Here, we present a machine learning method that achieves red blood cell (RBC) detection, differentiation between infected/uninfected cells, and parasite life stage categorization from unprocessed, heterogeneous smear images. Based on a pretrained Faster Region-Based Convolutional Neural Networks (R-CNN) model for RBC detection, our model performs accurately, with an average precision of 0.99 at an intersection-over-union threshold of 0.5. Application of a residual neural network-50 model to infected cells also performs accurately, with an area under the receiver operating characteristic curve of 0.98. Finally, combining our method with a regression model successfully recapitulates intraerythrocytic developmental cycle with accurate lifecycle stage categorization. Combined with a mobile-friendly web-based interface, called PlasmoCount, our method permits rapid navigation through and review of results for quality assurance. By standardizing assessment of Giemsa smears, our method markedly improves inspection reproducibility and presents a realistic route to both routine lab and future field-based automated malaria diagnosis

PubMed Central

Edinburgh Research Explorer

ReCombine: A Suite of Programs for Detection and Analysis of Meiotic Recombination in Whole-Genome Datasets

Author: Ashwini Oke
B Langmead
Carol M. Anderson
DD Perkins
E Mancera
E Mancera
E Martini
EA Winzeler
FW Stahl
GA Cromie
H Li
H Zhao
HP Papazian
Illumina
J Qi
J van Oeveren
Jennifer C. Fung
JH McCusker
JM Cherry
Joseph L. DeRisi
JW Szostak
K Sorber
Michael Lichten
Michelle T. Dimon
MS McPeek
Q Zhao
R Bourgon
R Li
R Li
S Kurtz
Stacy Y. Chen
SY Chen
T de los Santos
T Hassold
W Wei
Z Ning
Publication venue: Public Library of Science
Publication date: 25/10/2011
Field of study

In meiosis, the exchange of DNA between chromosomes by homologous recombination is a critical step that ensures proper chromosome segregation and increases genetic diversity. Products of recombination include reciprocal exchanges, known as crossovers, and non-reciprocal gene conversions or non-crossovers. The mechanisms underlying meiotic recombination remain elusive, largely because of the difficulty of analyzing large numbers of recombination events by traditional genetic methods. These traditional methods are increasingly being superseded by high-throughput techniques capable of surveying meiotic recombination on a genome-wide basis. Next-generation sequencing or microarray hybridization is used to genotype thousands of polymorphic markers in the progeny of hybrid yeast strains. New computational tools are needed to perform this genotyping and to find and analyze recombination events. We have developed a suite of programs, ReCombine, for using short sequence reads from next-generation sequencing experiments to genotype yeast meiotic progeny. Upon genotyping, the program CrossOver, a component of ReCombine, then detects recombination products and classifies them into categories based on the features found at each location and their distribution among the various chromatids. CrossOver is also capable of analyzing segregation data from microarray experiments or other sources. This package of programs is designed to allow even researchers without computational expertise to use high-throughput, whole-genome methods to study the molecular mechanisms of meiotic recombination

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Long March: A Sample Preparation Technique that Enhances Contig Length and Coverage by High-Throughput Short-Read Sequencing

Author: A Janulaitis
AF Siegel
Armin Hekele
Charles Chiu
CJ Stoeckert Jr
CT Wai
Dale Webster
ER Mardis
F Mashayekhi
F Mathieu-Daude
F Sanger
H Okamoto
H Wakaguri
J Shendure
J. Graham Ruby
JO Korbel
Joseph L. DeRisi
Katherine Sorber
KE Wommack
M Chaisson
M Hafner
M Petrusyte
M Pop
M Ronaghi
Mark A. Batzer
Michelle Dimon
MJ Gardner
N Whiteford
O Salas-Solano
R Knight
RA Holt
RJ Roberts
RL Warren
SF Altschul
SF Altschul
SM Hadi
TS Seo
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

High-throughput short-read technologies have revolutionized DNA sequencing by drastically reducing the cost per base of sequencing information. Despite producing gigabases of sequence per run, these technologies still present obstacles in resequencing and de novo assembly applications due to biased or insufficient target sequence coverage. We present here a simple sample preparation method termed the “long march” that increases both contig lengths and target sequence coverage using high-throughput short-read technologies. By incorporating a Type IIS restriction enzyme recognition motif into the sequencing primer adapter, successive rounds of restriction enzyme cleavage and adapter ligation produce a set of nested sub-libraries from the initial amplicon library. Sequence reads from these sub-libraries are offset from each other with enough overlap to aid assembly and contig extension. We demonstrate the utility of the long march in resequencing of the Plasmodium falciparum transcriptome, where the number of genomic bases covered was increased by 39%, as well as in metagenomic analysis of a serum sample from a patient with hepatitis B virus (HBV)-related acute liver failure, where the number of HBV bases covered was increased by 42%. We also offer a theoretical optimization of the long march for de novo sequence assembly

CiteSeerX

Public Library of Science (PLOS)

Crossref

PubMed Central

eScholarship - University of California

HMMSplicer: A Tool for Efficient and Sensitive Discovery of Known and Novel Splice Junctions in RNA-Seq Data

Author: A Ameur
A Mortazavi
B Langmead
BT Wilhelm
C Sidrauski
C Trapnell
C Trapnell
Cynthia Gibas
D Ramsköld
DA Benson
DW Bryant
ET Wang
F De Bona
F Lu
GA Heap
GE Crooks
H Li
H Li
H Nagasaki
H Richard
H Yoshida
JC Dohm
Joseph L. DeRisi
JS Cox
K Sorber
Katherine Sorber
KD Pruitt
KF Au
L Baum
M Deutsch
M Yano
MC Wahl
Michelle T. Dimon
MJ Gardner
PJ Shepard
Q Pan
R Li
R Lister
S Sen
S Stamm
TW Nilsen
U Nagalakshmi
WJ Kent
WJ Kent
Z Wang
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Background: High-throughput sequencing of an organism’s transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the reference genome, especially in the case where a splice junction is previously unknown. Methodology/Principal Findings: Here we introduce HMMSplicer, an accurate and efficient algorithm for discovering canonical and non-canonical splice junctions in short read datasets. HMMSplicer identifies more splice junctions than currently available algorithms when tested on publicly available A. thaliana, P. falciparum, and H. sapiens datasets without a reduction in specificity. Conclusions/Significance: HMMSplicer was found to perform especially well in compact genomes and on genes with low expression levels, alternative splice isoforms, or non-canonical splice junctions. Because HHMSplicer does not rely on prebuilt gene models, the products of inexact splicing are also detected. For H. sapiens, we find 3.6 % of 39 splice sites and 1.4% of 59 splice sites are inexact, typically differing by 3 bases in either direction. In addition, HMMSplicer provides a score for every predicted junction allowing the user to set a threshold to tune false positive rates depending on the needs of the experiment. HMMSplicer is implemented in Python. Code and documentation are freely available a

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

RNA-seq analyses of blood-induced changes in gene expression in the mosquito vector species, Aedes aegypti

Abstract Background Hematophagy is a common trait of insect vectors of disease. Extensive genome-wide transcriptional changes occur in mosquitoes after blood meals, and these are related to digestive and reproductive processes, among others. Studies of these changes are expected to reveal molecular targets for novel vector control and pathogen transmission-blocking strategies. The mosquito <it>Aedes aegypti </it>(Diptera, Culicidae), a vector of Dengue viruses, Yellow Fever Virus (YFV) and Chikungunya virus (CV), is the subject of this study to look at genome-wide changes in gene expression following a blood meal. Results Transcriptional changes that follow a blood meal in <it>Ae. aegypti </it>females were explored using RNA-seq technology. Over 30% of more than 18,000 investigated transcripts accumulate differentially in mosquitoes at five hours after a blood meal when compared to those fed only on sugar. Forty transcripts accumulate only in blood-fed mosquitoes. The list of regulated transcripts correlates with an enhancement of digestive activity and a suppression of environmental stimuli perception and innate immunity. The alignment of more than 65 million high-quality short reads to the <it>Ae. aegypti </it>reference genome permitted the refinement of the current annotation of transcript boundaries, as well as the discovery of novel transcripts, exons and splicing variants. <it>Cis</it>-regulatory elements (CRE) and <it>cis</it>-regulatory modules (CRM) enriched significantly at the 5'end flanking sequences of blood meal-regulated genes were identified. Conclusions This study provides the first global view of the changes in transcript accumulation elicited by a blood meal in <it>Ae. aegypti </it>females. This information permitted the identification of classes of potentially co-regulated genes and a description of biochemical and physiological events that occur immediately after blood feeding. The data presented here serve as a basis for novel vector control and pathogen transmission-blocking strategies including those in which the vectors are modified genetically to express anti-pathogen effector molecules.</p

Crossref

Archivio Istituzionale della Ricerca - Università degli Studi di Pavia

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

IMSA: Integrated metagenomic sequence analysis for identification of exogenous reads in a host genomic background

Author: AA Pragman
AD Kostic
AJ Saldanha
AL Kistler
B Langmead
C Conway
C Runckel
D Hernandez
DA Benson
G Grard
H Feng
H Feng
Henry M. Wood
J Cheval
J Handelsman
J Yang
JC Lagier
Mark R. Liles
MB Eisen
MD Stenglein
Michelle T. Dimon
Pamela H. Rabbitts
Sarah T. Arron
SF Altschul
Z Lin
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Metagenomics, the study of microbial genomes within diverse environments, is a rapidly developing field. The identification of microbial sequences within a host organism enables the study of human intestinal, respiratory, and skin microbiota, and has allowed the identification of novel viruses in diseases such as Merkel cell carcinoma. There are few publicly available tools for metagenomic high throughput sequence analysis. We present Integrated Metagenomic Sequence Analysis (IMSA), a flexible, fast, and robust computational analysis pipeline that is available for public use. IMSA takes input sequence from high throughput datasets and uses a user-defined host database to filter out host sequence. IMSA then aligns the filtered reads to a user-defined universal database to characterize exogenous reads within the host background. IMSA assigns a score to each node of the taxonomy based on read frequency, and can output this as a taxonomy report suitable for cluster analysis or as a taxonomy map (TaxMap). IMSA also outputs the specific sequence reads assigned to a taxon of interest for downstream analysis. We demonstrate the use of IMSA to detect pathogens and normal flora within sequence data from a primary human cervical cancer carrying HPV16, a primary human cutaneous squamous cell carcinoma carrying HPV 16, the CaSki cell line carrying HPV16, and the HeLa cell line carrying HPV18

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

White Rose Research Online

Recommended from our members

Genomic approaches to the study of splicing in Plasmodium falciparum and other organisms using high throughput sequencing

Author: Dimon Michelle Therese
Publication venue: eScholarship, University of California
Publication date: 01/01/2010
Field of study

In the last five years, high throughput sequencing has revolutionized biological research. The ability to quickly generate millions of short sequence reads enables studies that would have been inconceivable even 10 years ago. This work focuses on RNA-Seq, the application of high throughput sequencing to an organism's transcriptome. We describe a method of library preparation that improves sequence coverage, a new algorithm for detecting splice junctions in the datasets, and finally, application of these techniques to the study of splicing in Plasmodium falciparum.The long march is a technique for Solexa library preparation that increases contig length and target sequence coverage. The long march incorporates a Type IIS restriction enzyme into the sequencing primer adapter. Each round of marching cuts off the initial part of the read and ligates a new adapter downstream, creating overlapping reads. Validation on P. falciparum genomic and human hepatitis B virus positive samples showed 39% and 42%, respectively, increases in numbers of bases covered.Next we developed an algorithm to detect spliced reads crossing exon-exon junctions in RNA-Seq datasets. Our algorithm uses an unbiased approach, relying only on the read dataset and a reference genome, detecting canonical and noncanonical splice junctions. This works by dividing reads in half for initial seeding in the reference genome then using an HMM, trained on the input data, to determine the optimal splice position. Our algorithm provides a score for each splice junction, which allows researchers to tune the false positive rate to the requirements of their experiment. This approach identifies more splice junctions than currently available algorithms, without a reduction in specificity, when tested on publicly available datasets for Arabidopsis thaliana, Plasmodium falciparum, and Homo sapiens.Finally, our library preparation technique and splice detection algorithm were used to study splicing in P. falciparum. Both our data and publicly available datasets were used to identify splicing events in the blood stages of the parasite. We confirmed 6,678 previously known introns and identified 977 novel introns with canonical splice edges. In addition, we detected 310 alternative slicing events as well as splicing events antisense to known transcripts

eScholarship - University of California