Search CORE

FindPeaks 3.1: a tool for identifying areas of enrichment from massively parallel short-read sequencing technology

Author: Anthony P. Fejes
Gordon Robertson
Matthew Bainbridge
Mikhail Bilenky
Prof Alfonso Valencia
Richard Varhol
Steven J. M. Jones
Vz S
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

Summary: Next-generation sequencing can provide insight into protein–DNA association events on a genome-wide scale, and is being applied in an increasing number of applications in genomics and meta-genomics research. However, few software applications are available for interpreting these experiments. We present here an efficient application for use with chromatin-immunoprecipitation (ChIP-Seq) experimental data that includes novel functionality for identifying areas of gene enrichment and transcription factor binding site locations, as well as for estimating DNA fragment size distributions in enriched areas. The FindPeaks application can generate UCSC compatible custom ‘WIG’ track files from aligned-read files for short-read sequencing technology. The software application can be executed on any platform capable of running a Java Runtime Environment. Memory requirements are proportional to the number of sequencing reads analyzed; typically 4 GB permits processing of up to 40 million reads

CiteSeerX

Genomic analysis of a rare human tumor

Author: An Jianghong
Bilenky Mikhail
Birol Inanc
Butterfield Yaron S
Cezard Timothee
Chuah Eric
Corbett Richard
Fejes Anthony
Griffith Malachi
Griffith Obi L
Hirst Martin
Holt Robert A
Huntsman David G
Jones Steven JM
Laskin Janessa
Li Yvonne Y
Marra Marco A
Martin Montgomery
Mayo Michael
Melnyk Nataliya
Moore Richard A
Morin Ryan D
Pugh Trevor J
Severson Tesa
Shah Sohrab P
Sutcliffe Margaret
Tam Angela
Terry Jefferson
Thiessen Nina
Thomson Thomas
Varhol Richard
Yee John
Zeng Thomas
Zhao Yongjun
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Springer - Publisher Connector

Comprehensive molecular portraits of human breast tumours

Author: Ally A.
Ardlie K.
Auman J.
Balasundaram M.
Beroukhim R.
Birol I.
Butterfield Y.
Carlsen R.
Carter C.
Carter S.
Chao H.
Cherniack A.
Chin L.
Chu A.
Chuah E.
Chun H.
Coope R.
Crenshaw A.
Dhalla N.
Ding L.
Dooling D.
Fan C.
Fulton L.
Fulton R.
Gabriel S.
Gentry J.
Getz G.
Guin R.
He X.
Hernandez B.
Hirst C.
Hirst M.
Hoadley K.
Holt R.
Iglesia M.
Jones S.
Kalicki-Veizer J.
Koboldt D.
Kucherlapati R.
Lee D.
Li H.
Li L.
Mardis E.
Marra M.
Mayo M.
McLellan M.
McMichael J.
Meyerson M.
Moore R.
Mungall A.
Nguyen H.
Onofrio R.
Pho N.
Pleasance E.
Prat A.
Robertson A.
Saksena G.
Schein J.
Schmidt H.
Schumacher S.
Shafiei A.
Shi Y.
Silva G.
Sipahimalani P.
Slobodan J.
Stoll D.
Tabak B.
Tam A.
Thiessen N.
Topal M.
Turman Y.
Varhol Richard
Wilson R.
Winckler W.
Wye N.
Zeng T.
Zhao Y.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

We analysed primary breast cancers by genomic DNA copy number arrays, DNA methylation, exome sequencing, messenger RNA arrays, microRNA sequencing and reverse-phase protein arrays. Our ability to integrate information across platforms provided key insights into previously defined gene expression subtypes and demonstrated the existence of four main breast cancer classes when combining data from five platforms, each of which shows significant molecular heterogeneity. Somatic mutations in only three genes (TP53, PIK3CA and GATA3) occurred at.10% incidence across all breast cancers; however, there were numerous subtype-associated and novel gene mutations including the enrichment of specific mutations in GATA3, PIK3CA and MAP3K1 with the luminal A subtype. We identified two novel protein-expression-defined subgroups, possibly produced by stromal/microenvironmental elements, and integrated analyses identified specific signalling pathways dominant in each molecular subtype including a HER2/phosphorylated HER2/EGFR/phosphorylated EGFR signature within the HER2-enriched expression subtype. Comparison of basal-like breast tumours with high-grade serous ovarian tumours showed many molecular commonalities, indicating a related aetiology and similar therapeutic opportunities. The biological finding of the four main breast cancer subtypes caused by different subsets of genetic and epigenetic abnormalities raises the hypothesis that much of the clinically observable plasticity and heterogeneity occurs within, and not across, these major biological subtypes of breast cancer. © 2012 Macmillan Publishers Limited. All rights reserved

Diposit Digital de la Universitat de Barcelona

Carolina Digital Repository

Multiplatform Analysis of 12 Cancer Types Reveals Molecular Classification within and across Tissues of Origin

Author: Abbott Rachel
Abbott Scott
Akbani Rehan
Aksoy B. Arman
Aldape Kenneth
Ally Adrian
Amin Samirkumar
Anastassiou Dimitris
Auman J. Todd
Baggerly Keith A.
Balasundaram Miruna
Balu Saianand
Baylin Stephen B.
Benz Christopher C.
Benz Stephen C.
Berman Benjamin P.
Bernard Brady
Bhatt Ami S.
Birol Inanc
Black Aaron D.
Bodenheimer Tom
Bootwalla Moiz S.
Bowen Jay
Bressler Ryan
Bristow Christopher A.
Brooks Angela N.
Broom Bradley
Buda Elizabeth
Burton Robert
Butterfield Yaron S.N.
Byers Lauren A.
Carlin Daniel
Carter Scott L.
Casasent Tod D.
Chang Kyle
Chanock Stephen
Chen Zhong
Cherniack Andrew D.
Chin Lynda
Cho Dong Yeon
Cho Juok
Chu Andy
Chuah Eric
Chun Hye Jung E.
Cibulskis Kristian
Ciriello Giovanni
Cleland James
Cline Melisssa
Collisson Eric A.
Craft Brian
Creighton Chad J.
Danilova Ludmila
Davidsen Tanja
Davis Caleb
Dees Nathan D.
Delehaunty Kim
Demchok John A.
Dhalla Noreen
DiCara Daniel
Ding Li
Dinh Huyen
Dobson Jason R.
Dodda Deepti
Doddapaneni Harsha Vardhan
Donehower Lawrence
Dooling David J.
Dresdner Gideon
Drummond Jennifer
Eakin Andrea
Edgerton Mary
Eldred Jim M.
Eley Greg
Ellrott Kyle
Fan Cheng
Fei Suzanne
Felau Ina
Frazer Scott
Freeman Samuel S.
Frick Jessica
Fronick Catrina C.
Fulton Lucinda L.
Fulton Robert
Gabriel Stacey B.
Gao Jianjiong
Gastier-Foster Julie M.
Gehlenborg Nils
George Myra
Getz Gad
Gibbs Richard
Goldman Mary
Gonzalez-Perez Abel
Gross Benjamin
Guin Ranabir
Gunaratne Preethi
Hadjipanayis Angela
Hamilton Mark P.
Hamilton Stanley R.
Han Leng
Han Yi
Harper Hollie A.
Haseley Psalm
Haussler David
Hayes D. Neil
Heiman David I.
Helman Elena
Helsel Carmen
Herbrich Shelley M.
Herman James G.
Hinoue Toshinori
Hirst Carrie
Hirst Martin
Hoadley Katherine A.
Holt Robert A.
Hoyle Alan P.
Iype Lisa
Jacobsen Anders
Jeffreys Stuart R.
Jensen Mark A.
Jones Corbin D.
Jones Steven J.M.
Ju Zhenlin
Jung Joonil
Kahles Andre
Kahn Ari
Kalicki-Veizer Joelle
Kalra Divya
Kanchi Krishna Latha
Kandoth Cyriac
Kane David W.
Kim Hoon
Kim Jaegil
Knijnenburg Theo
Koboldt Daniel C.
Kovar Christie
Kramer Roger
Kreisberg Richard
Kucherlapati Raju
Ladanyi Marc
Laird Peter W.
Lander Eric S.
Larson David E.
Lawrence Michael S.
Lee Darlene
Lee Eunjung
Lee Semin
Lee William
Lehmann Kjong Van
Leinonen Kalle
Leiserson Max D.M.
Leraas Kristen M.
Lerner Seth
Levine Douglas A.
Lewis Lora
Ley Timothy J.
Li Haiyan I.
Li Jun
Li Wei
Liang Han
Lichtenberg Tara M.
Lin Jake
Lin Ling
Lin Pei
Liu Wenbin
Liu Yingchun
Liu Yuexin
Lopez-Bigas Nuria
Lorenzi Philip L.
Lu Charles
Lu Yiling
Luquette Lovelace J.
Ma Singer
Magrini Vincent J.
Mahadeshwar Harshad S.
Mardis Elaine R.
Margolin Adam A.
Marra Marco A.
Mayo Michael
McAllister Cynthia
McGuire Sean E.
McLellan Michael D.
McMichael Joshua F.
Melott James
Meng Shaowu
Meyerson Matthew
Mieczkowski Piotr A.
Miller Christopher A.
Miller Martin L.
Miller Michael
Mills Gordon B.
Moore Richard A.
Morgan Margaret
Morton Donna
Mose Lisle E.
Mungall Andrew J.
Muzny Donna
Ng Sam
Nguyen Lam
Niu Beifang
Noble Michael S.
Noushmehr Houtan
O'Laughlin Michelle
Ojesina Akinyemi I.
Omberg Larsson
Ozenberger Brad
Pantazi Angeliki
Parfenov Michael
Park Peter J.
Parker Joel S.
Paull Evan
Pedamallu Chandra Sekhar
Perou Charles M.
Pihl Todd
Pohl Craig
Pot David
Protopopov Alexei
Przytycka Teresa
Radenbaugh Amie
Ramirez Nilsa C.
Ramirez Ricardo
Raphael Benjamin J.
Reid Jeffrey
Ren Xiaojia
Reva Boris
Reynolds Sheila M.
Rhie Suhn K.
Roach Jeffrey
Robertson A. Gordon
Rovira Hector
Ryan Michael
Rätsch Gunnar
Saksena Gordon
Salama Sofie
Sander Chris
Santoso Netty
Schein Jacqueline E.
Schmidt Heather
Schultz Nikolaus
Schumacher Steven E.
Seidman Jonathan
Senbabaoglu Yasin
Seth Sahil
Sharpe Samantha
Shen Hui
Shen Ronglai
Sheth Margi
Shi Yan
Shmulevich Ilya
Silva Grace O.
Simons Janae V.
Sinha Rileen
Sipahimalani Payal
Smith Scott M.
Sofia Heidi J.
Sokolov Artem
Soloway Mathew G.
Song Xingzhi
Sougnez Carrie
Spellman Paul
Staudt Louis
Stewart Chip
Stojanov Petar
Stuart Joshua M.
Su Xiaoping
Sumer S. Onur
Sun Yichao
Swatloski Teresa
Tabak Barbara
Tam Angela
Tamborero David
Tan Donghui
Tang Jiabin
Tarnuzzer Roy
Taylor Barry S.
Thiessen Nina
Thorsson Vesteinn
Triche Timothy
Uzunangelov Vladislav
Van Den Berg David J.
Van Waes Carter
Van't Veer Laura J.
Vandin Fabio
Varhol Richard J.
Vaske Charles J.
Veluvolu Umadevi
Verhaak Roeland
Voet Doug
Walker Jason
Wallis John W.
Waltman Peter
Wan Yunhu
Wang Min
Wang Wenyi
Wang Zhining
Waring Scot
Weinhold Nils
Weinstein John N.
Weisenberger Daniel J.
Wendl Michael C.
Wheeler David
Wilkerson Matthew D.
Wilson Richard K.
Wise Lisa
Wolf Denise M.
Wong Andrew
Wu Chang Jiun
Wu Chia Chin
Wu Hsin Ta
Wu Junyuan
Wylie Todd
Xi Liu
Xi Ruibin
Xia Zheng
Xu Andrew W.
Yang Da
Yang Liming
Yang Lixing
Yang Tai Hsien Ou
Yang Yang
Yao Jun
Yao Rong
Yau Christina
Ye Kai
Yoshihara Kosuke
Yuan Yuan
Yung Alfred K.
Zack Travis
Zeng Dong
Zenklusen Jean Claude
Zhang Hailei
Zhang Jianhua
Zhang Jiashan
Zhang Nianxiang
Zhang Qunyuan
Zhang Wei
Zhao Wei
Zheng Siyuan
Zhu Jing
Zmuda Erik
Zou Lihua
Publication venue
Publication date: 01/01/2014
Field of study

Recent genomic analyses of pathologically-defined tumor types identify “within-a-tissue” disease subtypes. However, the extent to which genomic signatures are shared across tissues is still unclear. We performed an integrative analysis using five genome-wide platforms and one proteomic platform on 3,527 specimens from 12 cancer types, revealing a unified classification into 11 major subtypes. Five subtypes were nearly identical to their tissue-of-origin counterparts, but several distinct cancer types were found to converge into common subtypes. Lung squamous, head & neck, and a subset of bladder cancers coalesced into one subtype typified by TP53 alterations, TP63 amplifications, and high expression of immune and proliferation pathway genes. Of note, bladder cancers split into three pan-cancer subtypes. The multi-platform classification, while correlated with tissue-of-origin, provides independent information for predicting clinical outcomes. All datasets are available for data-mining from a unified resource to support further biological discoveries and insights into novel therapeutic strategies

The expression level of small non-coding RNAs derived from the first exon of protein-coding genes is predictive of cancer status

Author: Andrew J Mungall
Andy Chu
Athanasios Zovoilis
Gingeras T
Marco Marra
Richard Moore
Richard Varhol
Steven JM Jones
Tina Wong
Publication venue: 'Wiley'
Publication date: 01/01/2014
Field of study

Small non-coding RNAs (smRNAs) are known to be significantly enriched near the transcriptional start sites of genes. However, the functional relevance of these smRNAs remains unclear, and they have not been associated with human disease. Within the cancer genome atlas project (TCGA), we have generated small RNA datasets for many tumor types. In prior cancer studies, these RNAs have been regarded as transcriptional "noise," due to their apparent chaotic distribution. In contrast, we demonstrate their striking potential to distinguish efficiently between cancer and normal tissues and classify patients with cancer to subgroups of distinct survival outcomes. This potential to predict cancer status is restricted to a subset of these smRNAs, which is encoded within the first exon of genes, highly enriched within CpG islands and negatively correlated with DNA methylation levels. Thus, our data show that genome-wide changes in the expression levels of small non-coding RNAs within first exons are associated with cancer. Synopsis The expression of small non-coding RNAs encoded within the first exon of genes can be used to efficiently identify cancer samples and classify patients into subgroups of different survival. Such pan-cancer association is the first link between these RNAs and disease. Exon 1 small non-coding RNAs (smRNAs) can distinguish between cancer and normal tissues. The prediction potential of exon 1 smRNAs differs from that of other smRNAs around transcriptional start sites (TSS). smRNA locations around TSS are conserved between different individuals. smRNA locations are enriched within CpG islands and their levels negatively correlated with DNA methylation. The expression of small non-coding RNAs encoded within the first exon of genes can be used to efficiently identify cancer samples and classify patients into subgroups of different survival. Such pan-cancer association is the first link between these RNAs and disease. © 2014 The Authors

De novo transcriptome assembly with ABySS

Author: Birol I.
Connors J.
Gascoyne R.
Hirst M.
Horsman D.
Jackman S.
Jones S.
Marra M.
Morin R.
Nielsen C.
Qian J.
Schein J.
Stazyk G.
Varhol Richard
Zhao Y.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2009
Field of study

Motivation: Whole transcriptome shotgun sequencing data from non-normalized samples offer unique opportunities to study the metabolic states of organisms. One can deduce gene expression levels using sequence coverage as a surrogate, identify coding changes or discover novel isoforms or transcripts. Especially for discovery of novel events, de novo assembly of transcriptomes is desirable. Results: Transcriptome from tumor tissue of a patient with follicular lymphoma was sequenced with 36 base pair (bp) single- and paired-end reads on the Illumina Genome Analyzer II platform. We assembled ~194 million reads using ABySS into 66 921 contigs 100 bp or longer, with a maximum contig length of 10 951 bp, representing over 30 million base pairs of unique transcriptome sequence, or roughly 1% of the genome. © The Author 2009. Published by Oxford University Press. All rights reserved

Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing

Author: Anthony Fejes
Helen McDonald
Marco A. Marra
Martin Hirst
Martin Krzywinski
Matthew Bainbridge
Richard Varhol
Ryan D. Morin
Steven J.M. Jones
Thierry-Mieg D.
Trevor J. Pugh
Publication venue: 'Future Science Ltd'
Publication date: 01/01/2008
Field of study

Sequence-based methods for transcriptome characterization have typically relied on generation of either serial analysis of gene expression tags or expressed sequence tags. Although such approaches have the potential to enumerate transcripts by counting sequence tags derived from them, they typically do not robustly survey the majority of transcripts along their entire length. Here we show that massively parallel sequencing of randomly primed cDNAs, using a next-generation sequencing-by-synthesis technology, offers the potential to generate relative measures of mRNA and individual exon abundance while simultaneously profiling the prevalence of both annotated and novel exons and exon-splicing events. This technique identifies known single nucleotide polymorphisms (SNPs) as well as novel single-base variants. Analysis of these variants, and previously unannotated splicing events in the HeLa S3 cell line, reveals an overrepresentation of gene categories including those previously implicated in cancer

The Lung Microbiome In COPD

Author: Adam S.
Birol I.
Dimitriu P.
Elliott M.
Friedman J.
Gosselink J.
Hayashi S.
He A.
Hogg J.
McDonough J.
Miller D.
Mohn W.
Moore R.
Sin D.
Sze M.
Varhol Richard
Zhao Y.
Publication venue
Publication date: 01/01/2011
Field of study