Search CORE

376 research outputs found

MGMR: leveraging RNA-Seq population data to optimize expression estimation

Author: A Oshlack
B Li
B Li
B Pasaniuc
C Trapnell
Eran Halperin
JH Bullard
JK Pickrell
KA et al. Frazer
L Pachter
MD Robinson
Ron Shamir
Roye Rozov
SB Montgomery
TP Minka
Publication venue: BioMed Central
Publication date: 01/04/2012
Field of study

Abstract Background RNA-Seq is a technique that uses Next Generation Sequencing to identify transcripts and estimate transcription levels. When applying this technique for quantification, one must contend with reads that align to multiple positions in the genome (multireads). Previous efforts to resolve multireads have shown that RNA-Seq expression estimation can be improved using probabilistic allocation of reads to genes. These methods use a probabilistic generative model for data generation and resolve ambiguity using likelihood-based approaches. In many instances, RNA-seq experiments are performed in the context of a population. The generative models of current methods do not take into account such population information, and it is an open question whether this information can improve quantification of the individual samples Results In order to explore the contribution of population level information in RNA-seq quantification, we apply a hierarchical probabilistic generative model, which assumes that expression levels of different individuals are sampled from a Dirichlet distribution with parameters specific to the population, and reads are sampled from the distribution of expression levels. We introduce an optimization procedure for the estimation of the model parameters, and use HapMap data and simulated data to demonstrate that the model yields a significant improvement in the accuracy of expression levels of paralogous genes. Conclusions We provide a proof of principal of the benefit of drawing on population commonalities to estimate expression. The results of our experiments demonstrate this approach can be beneficial, primarily for estimation at the gene level.</p

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

FusionFinder: A Software Tool to Identify Expressed Gene Fusion Candidates from RNA-Seq Data

Author: A McPherson
A Sboner
Alex H. Beesley
B Escobar
B Langmead
C Trapnell
C Trapnell
CA Maher
CA Maher
CB Lozzio
D Kim
Denise Anderson
E Shtivelman
FJ Novo
H Edgren
H Ge
H Li
JD Rowley
JE Stajich
JK Maranchie
JL Byrne
JZ Levin
K Inaki
Katherine Thompson-Wicking
Kim W. Carter
MA Quail
MD Robinson
P Flicek
PC Nowell
Q Zhao
Richard W. Francis
S Nacu
SA Forbes
Steve Horvath
T Maniatis
T Takahashi
Ursula R. Kees
W Wuyts
Y Li
Z Wang
Publication venue: Public Library of Science
Publication date: 27/06/2012
Field of study

The hallmarks of many haematological malignancies and solid tumours are chromosomal translocations, which may lead to gene fusions. Recently, next-generation sequencing techniques at the transcriptome level (RNA-Seq) have been used to verify known and discover novel transcribed gene fusions. We present FusionFinder, a Perl-based software designed to automate the discovery of candidate gene fusion partners from single-end (SE) or paired-end (PE) RNA-Seq read data. FusionFinder was applied to data from a previously published analysis of the K562 chronic myeloid leukaemia (CML) cell line. Using FusionFinder we successfully replicated the findings of this study and detected additional previously unreported fusion genes in their dataset, which were confirmed experimentally. These included two isoforms of a fusion involving the genes BRK1 and VHL, whose co-deletion has previously been associated with the prevalence and severity of renal-cell carcinoma. FusionFinder is made freely available for non-commercial use and can be downloaded from the project website (http://bioinformatics.childhealthresearch.org.au/software/fusionfinder/)

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

The Echinococcus canadensis (G7) genome: A key knowledge of parasitic platyhelminth human diseases

Author: A Bankevich
A Gurevich
A Lomsadze
A Lomsadze
Adolfo Fox
AM Bolger
Anna C. M. Salim
B Hendrich
B Langmead
C Bermudez-Santana
C Hahn
C Holt
C Jiang
C Trapnell
CA Alvarez Rojas
CCM Budke
D Kim
D Takai
DP McManus
DR Zerbino
E Elkayam
E Keibler
E Quevillon
F Jeanmougin
F Kiefer
F Mohn
Federico Camicia
Flávio M. Gomes Araújo
G Abrusán
G Parra
GSC Slater
Guilherme Oliveira
H Li
H Zheng
I Korf
IJ Tsai
IJ Tsai
J Eckert
JK Nono
JM Bart
JP Hewitson
Juliana Assis
K Arnold
K Matsuo
K Thivierge
K Wasik
KJ Fryxell
KK Geyer
KK Geyer
L Han
L Han
L Kamenetzky
L Kamenetzky
L Kamenetzky
L Li
LA Kelley
Laura Kamenetzky
LD Moore
Lucas L. Maldonado
M Ashburner
M Biasini
M Cucher
M Cucher
M Krzywinski
M Marín
M Nakao
M Nakao
M Nakao
M Nakao
M Nakao
M Rosenzvit
M Sajid
M Stanke
MA Cucher
Mara Rosenzvit
Marcela Cucher
MC Rosenzvit
MW Robinson
N Guex
N Macchiaroli
N Schürmann
Natalia Macchiaroli
ND Young
O Bogdanović
P Carninci
P Cingolani
P Danecek
PM Muzulin
PM Schantz
PS Craig
R Luo
R Schneider
RD Finn
RJ Klose
S Assefa
S Maillard
S Saxonov
S Yi
SF Altschul
SM Sadjjadi
TD Otto
TD Otto
TM Lowe
U Koziol
U Saarma
W Pan
Y Moriya
Y Safonova
YA Medvedeva
Z Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2017
Field of study

Background: The parasite Echinococcus canadensis (G7) (phylum Platyhelminthes, class Cestoda) is one of the causative agents of echinococcosis. Echinococcosis is a worldwide chronic zoonosis affecting humans as well as domestic and wild mammals, which has been reported as a prioritized neglected disease by the World Health Organisation. No genomic data, comparative genomic analyses or efficient therapeutic and diagnostic tools are available for this severe disease. The information presented in this study will help to understand the peculiar biological characters and to design species-specific control tools. Results: We sequenced, assembled and annotated the 115-Mb genome of E. canadensis (G7). Comparative genomic analyses using whole genome data of three Echinococcus species not only confirmed the status of E. canadensis (G7) as a separate species but also demonstrated a high nucleotide sequences divergence in relation to E. granulosus (G1). The E. canadensis (G7) genome contains 11,449 genes with a core set of 881 orthologs shared among five cestode species. Comparative genomics revealed that there are more single nucleotide polymorphisms (SNPs) between E. canadensis (G7) and E. granulosus (G1) than between E. canadensis (G7) and E. multilocularis. This result was unexpected since E. canadensis (G7) and E. granulosus (G1) were considered to belong to the species complex E. granulosus sensu lato. We described SNPs in known drug targets and metabolism genes in the E. canadensis (G7) genome. Regarding gene regulation, we analysed three particular features: CpG island distribution along the three Echinococcus genomes, DNA methylation system and small RNA pathway. The results suggest the occurrence of yet unknown gene regulation mechanisms in Echinococcus. Conclusions: This is the first work that addresses Echinococcus comparative genomics. The resources presented here will promote the study of mechanisms of parasite development as well as new tools for drug discovery. The availability of a high-quality genome assembly is critical for fully exploring the biology of a pathogenic organism. The E. canadensis (G7) genome presented in this study provides a unique opportunity to address the genetic diversity among the genus Echinococcus and its particular developmental features. At present, there is no unequivocal taxonomic classification of Echinococcus species; however, the genome-wide SNPs analysis performed here revealed the phylogenetic distance among these three Echinococcus species. Additional cestode genomes need to be sequenced to be able to resolve their phylogeny.Fil: Maldonado, Lucas Luciano. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Investigaciones en Microbiología y Parasitología Médica. Universidad de Buenos Aires. Facultad de Medicina. Instituto de Investigaciones en Microbiología y Parasitología Médica; ArgentinaFil: Assis, Juliana. Fundación Oswaldo Cruz; BrasilFil: Gomes Araújo, Flávio M.. Fundación Oswaldo Cruz; BrasilFil: Salim, Anna C. M.. Fundación Oswaldo Cruz; BrasilFil: Macchiaroli, Natalia. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Investigaciones en Microbiología y Parasitología Médica. Universidad de Buenos Aires. Facultad de Medicina. Instituto de Investigaciones en Microbiología y Parasitología Médica; ArgentinaFil: Cucher, Marcela Alejandra. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Investigaciones en Microbiología y Parasitología Médica. Universidad de Buenos Aires. Facultad de Medicina. Instituto de Investigaciones en Microbiología y Parasitología Médica; ArgentinaFil: Camicia, Federico. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Investigaciones en Microbiología y Parasitología Médica. Universidad de Buenos Aires. Facultad de Medicina. Instituto de Investigaciones en Microbiología y Parasitología Médica; ArgentinaFil: Fox, Adolfo. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Investigaciones en Microbiología y Parasitología Médica. Universidad de Buenos Aires. Facultad de Medicina. Instituto de Investigaciones en Microbiología y Parasitología Médica; ArgentinaFil: Rosenzvit, Mara Cecilia. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Investigaciones en Microbiología y Parasitología Médica. Universidad de Buenos Aires. Facultad de Medicina. Instituto de Investigaciones en Microbiología y Parasitología Médica; ArgentinaFil: Oliveira, Guilherme. Instituto Tecnológico Vale; Brasil. Fundación Oswaldo Cruz; BrasilFil: Kamenetzky, Laura. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Investigaciones en Microbiología y Parasitología Médica. Universidad de Buenos Aires. Facultad de Medicina. Instituto de Investigaciones en Microbiología y Parasitología Médica; Argentin

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

CONICET Digital

PubMed Central

FigShare

Improving gene-set enrichment analysis of RNA-Seq data with small replicates

Author: A Liberzon
A Subramanian
BR Zeeberg
BS Carver
C Lee
C Trapnell
CW Law
CW Law
D Eddelbuettel
D Nam
D Nam
D Nam
D Nam
D Nam
D Wu
DC Koboldt
Dongmei Li
Dougu Nam
F Rapaport
GK Smyth
H Jiang
H Li
HL Li
J Li
J Li
JC Marioni
JH Bullard
JJ Goeman
JK Pickrell
JK Schwarz
JX Feng
KA Gray
MA Dillies
MA Newton
MD Robinson
MD Robinson
MD Robinson
MD Young
ME Ritchie
MI Love
Q Xiong
Q Xiong
S Anders
S Song
Seon-Young Kim
Sora Yoon
U Nagalakshmi
V Saxena
W Huang da
WT Barry
X Wang
X Wang
Z Wang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 09/11/2016
Field of study

Deregulated pathways identified from transcriptome data of two sample groups have played a key role in many genomic studies. Gene-set enrichment analysis (GSEA) has been commonly used for pathway or functional analysis of microarray data, and it is also being applied to RNA-seq data. However, most RNA-seq data so far have only small replicates. This enforces to apply the gene-permuting GSEA method (or preranked GSEA) which results in a great number of false positives due to the inter-gene correlation in each gene-set. We demonstrate that incorporating the absolute gene statistic in one-tailed GSEA considerably improves the false-positive control and the overall discriminatory ability of the gene-permuting GSEA methods for RNA-seq data. To test the performance, a simulation method to generate correlated read counts within a gene-set was newly developed, and a dozen of currently available RNA-seq enrichment analysis methods were compared, where the proposed methods outperformed others that do not account for the inter-gene correlation. Analysis of real RNA-seq data also supported the proposed methods in terms of false positive control, ranks of true positives and biological relevance. An efficient R package (AbsFilterG- SEA) coded with C++ (Rcpp) is available from CRAN.open

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

ScholarWorks@UNIST

FigShare

CodingQuarry: Highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts

Author: A Guida
A Kumar
A Lomsadze
AD Neverov
Alison C Testa
AM McGuire
AV Lukashin
BJ Haas
BJ Haas
BJ Haas
BJ Loftus
BL Cantarel
C Camacho
C Holt
C Trapnell
C Trapnell
C Zhao
D Cullen
D Kim
D Martinez
DHD Kulp
DM Kupfer
GC Cerqueira
I Korf
I Reid
J Liu
James K Hane
JE Galagan
JK Hane
KJ Hoff
KR Christie
L Wang
M Berg Van Den
M Burset
M Dashtban
M Kozak
M Marcet-Houben
M Martin
M Stanke
M Stanke
M Stanke
MG Grabherr
N Rhind
NR Coordinators
R Dean
R Leinonen
RD Finn
Richard P Oliver
RP Oliver
RY Eberhardt
SB Hedges
Simon R Ellwood
SL Forsburg
SR Ellwood
T Steijger
TL Friesen
TU Consortium
V Ter-Hovhannisyan
VM Bruno
WM Vos de
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Background: The impact of gene annotation quality on functional and comparative genomics makes gene prediction an important process, particularly in non-model species, including many fungi. Sets of homologous protein sequences are rarely complete with respect to the fungal species of interest and are often small or unreliable, especially when closely related species have not been sequenced or annotated in detail. In these cases, protein homology-based evidence fails to correctly annotate many genes, or significantly improve ab initio predictions. Generalised hidden Markov models (GHMM) have proven to be invaluable tools in gene annotation and, recently, RNA-seq has emerged as a cost-effective means to significantly improve the quality of automated gene annotation. As these methods do not require sets of homologous proteins, improving gene prediction from these resources is of benefit to fungal researchers. While many pipelines now incorporate RNA-seq data in training GHMMs, there has been relatively little investigation into additionally combining RNA-seq data at the point of prediction, and room for improvement in this area motivates this study. Results: CodingQuarry is a highly accurate, self-training GHMM fungal gene predictor designed to work with assembled, aligned RNA-seq transcripts. RNA-seq data informs annotations both during gene-model training and in prediction. Our approach capitalises on the high quality of fungal transcript assemblies by incorporating predictions made directly from transcript sequences. Correct predictions are made despite transcript assembly problems, including those caused by overlap between the transcripts of adjacent gene loci. Stringent benchmarking against high-confidence annotation subsets showed CodingQuarry predicted 91.3% of Schizosaccharomyces pombe genes and 90.4% of Saccharomyces cerevisiae genes perfectly. These results are 4-5% better than those of AUGUSTUS, the next best performing RNA-seq driven gene predictor tested. Comparisons against whole genome Sc. pombe and S. cerevisiae annotations further substantiate a 4-5% improvement in the number of correctly predicted genes. Conclusions: We demonstrate the success of a novel method of incorporating RNA-seq data into GHMM fungal gene prediction. This shows that a high quality annotation can be achieved without relying on protein homology or a training set of genes. CodingQuarry is freely available (https://sourceforge.net/projects/codingquarry/), and suitable for incorporation into genome annotation pipelines

Crossref

Springer - Publisher Connector

PubMed Central

espace@Curtin

SeqGene: a comprehensive software solution for mining exome- and transcriptome- sequencing data

Author: A Mortazavi
AL Dixon
B Langmead
BE Stranger
BT Wilhelm
C Trapnell
DB Johnson
DC Koboldt
ER Mardis
ES Venkatraman
GA Heap
GK Smyth
H Li
H Li
J Wang
JC Marioni
JI Kim
JK Pickrell
JT Robinson
KA Frazer
L Wang
M Kanehisa
MF Moffatt
N Cloonan
PA Fujita
Q Zhao
R Goya
R Li
R Lister
RM Durbin
S Sherry
SB Montgomery
SB Ng
SD Nimer
TJP Hubbard
V Ramensky
W Cookson
X Yi
XJ Yan
Xutao Deng
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The popularity of massively parallel exome and transcriptome sequencing projects demands new data mining tools with a comprehensive set of features to support a wide range of analysis tasks. Results SeqGene, a new data mining tool, supports mutation detection and annotation, dbSNP and 1000 Genome data integration, RNA-Seq expression quantification, mutation and coverage visualization, allele specific expression (ASE), differentially expressed genes (DEGs) identification, copy number variation (CNV) analysis, and gene expression quantitative trait loci (eQTLs) detection. We also developed novel methods for testing the association between SNP and expression and identifying genotype-controlled DEGs. We showed that the results generated from SeqGene compares favourably to other existing methods in our case studies. Conclusion SeqGene is designed as a general-purpose software package. It supports both paired-end reads and single reads generated on most sequencing platforms; it runs on all major types of computers; it supports arbitrary genome assemblies for arbitrary organisms; and it scales well to support both large and small scale sequencing projects. The software homepage is http://seqgene.sourceforge.net.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Field pathogenomics reveals the emergence of a diverse wheat yellow rust population

Author: A Gross
A Stamatakis
A Untergasser
AJ Westermann
AM Zaki
Amelia Hubbard
B Langmead
BA McDonald
C Firth
C Trapnell
CC Linde
Clare M Lewis
Claude de Vallavieille-Pope
Cristobal Uauy
D Cantu
D Cantu
Diane GO Saunders
H Goyeau
H Li
H Li
I Letunic
J Popp
JA Kolmer
Jane Thomas
JD Jones
JJ Burdon
JK Pritchard
JK Taubenberger
JPRE Dimmock
K Tamura
Kentaro Yoshida
M Mboup
M Trick
MD Bennett
MD Robinson
MS Hovmoller
MS Hovmoller
P Cingolani
P Librado
PA Wilkinson
R Park
R Rodriguez-Guerra
RA Edwards
RH Priestley
Ricardo H Ramirez-Gonzalez
Rosemary Bayles
RP Singh
S Ali
S Anders
S Raffaele
S Wang
S Wright
SD Atkins
SN Naccache
Sophien Kamoun
TR Sharma
W Chen
W Zheng
X Didelot
XM Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

BACKGROUND: Emerging and re-emerging pathogens imperil public health and global food security. Responding to these threats requires improved surveillance and diagnostic systems. Despite their potential, genomic tools have not been readily applied to emerging or re-emerging plant pathogens such as the wheat yellow (stripe) rust pathogen Puccinia striiformis f. sp. tritici (PST). This is due largely to the obligate parasitic nature of PST, as culturing PST isolates for DNA extraction remains slow and tedious. RESULTS: To counteract the limitations associated with culturing PST, we developed and applied a field pathogenomics approach by transcriptome sequencing infected wheat leaves collected from the field in 2013. This enabled us to rapidly gain insights into this emerging pathogen population. We found that the PST population across the United Kingdom (UK) underwent a major shift in recent years. Population genetic structure analyses revealed four distinct lineages that correlated to the phenotypic groups determined through traditional pathology-based virulence assays. Furthermore, the genetic diversity between members of a single population cluster for all 2013 PST field samples was much higher than that displayed by historical UK isolates, revealing a more diverse population of PST. CONCLUSIONS: Our field pathogenomics approach uncovered a dramatic shift in the PST population in the UK, likely due to a recent introduction of a diverse set of exotic PST lineages. The methodology described herein accelerates genetic analysis of pathogen populations and circumvents the difficulties associated with obligate plant pathogens. In principle, this strategy can be widely applied to a variety of plant pathogens

Crossref

Springer - Publisher Connector

PubMed Central

University of East Anglia digital repository

ProdInra

Noisy Splicing Drives mRNA Isoform Diversity in Human Cells

Author: A Ameur
A Mortazavi
AB Carvalho
AJ Matlin
Athma A. Pai
B Modrek
B Modrek
C Trapnell
C Trapnell
C Zhang
C Zhang
CI Castillo-Davis
D Baek
DA Benson
DL Black
E Kim
E Melamud
Emmanouil T. Dermitzakis
ET l Wang
F Hsu
GW Yeo
H Li
H Tilgner
HB Fraser
JC Marioni
JK Pickrell
JL Parmley
Jonathan K. Pritchard
Joseph K. Pickrell
JQ Wu
KD Pruitt
KD Pruitt
KF Au
KL Fox-Walsh
KS Pollard
LD Hurst
LD Hurst
M Guttman
M Hiller
M Lynch
M Lynch
M Lynch
M Roy
M Sultan
M Yassour
M Zavolan
N Spies
O Jaillon
P Kolasinska-Zwierz
Q Pan
R Andersson
R Sorek
RF Luco
S Schwartz
SB Montgomery
T Kwan
TJP Hubbard
TM Chern
WG Fairbrother
XHF Zhang
Y Barash
Y Dou
Y Yu
Yoav Gilad
Z Wang
Z Wang
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

While the majority of multiexonic human genes show some evidence of alternative splicing, it is unclear what fraction of observed splice forms is functionally relevant. In this study, we examine the extent of alternative splicing in human cells using deep RNA sequencing and de novo identification of splice junctions. We demonstrate the existence of a large class of low abundance isoforms, encompassing approximately 150,000 previously unannotated splice junctions in our data. Newly-identified splice sites show little evidence of evolutionary conservation, suggesting that the majority are due to erroneous splice site choice. We show that sequence motifs involved in the recognition of exons are enriched in the vicinity of unconserved splice sites. We estimate that the average intron has a splicing error rate of approximately 0.7% and show that introns in highly expressed genes are spliced more accurately, likely due to their shorter length. These results implicate noisy splicing as an important property of genome evolution

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Comparative analysis of neural transcriptomes and functional implication of unannotated intronic expression

Author: A Mortazavi
A Smit
AI Saeed
AM Khalil
B Rhead
C Trapnell
C Trapnell
CA Thomas Jr
CC Cheung
D Klevebring
DH Geschwind
E Birney
ER Graf
ES Lander
F Hsu
F Polleux
GJ Faulkner
Gong Chen
H Jaaro-Peled
H Li
H van Bakel
Hong Ma
J Cheng
J Paysan
J Yu
JB Kim
JK Pickrell
JQ Wu
K Fejes-Toth
K Kuhlbrodt
K Okita
K Takahashi
K Takahashi
KA Kenyon
LJ Core
M Guttman
M Kanehisa
M Mangone
M Missler
M Safran
M Sun
MA Faghihi
MA Faghihi
MC Marchetto
N Dong
O Nenadic
P Bertone
P Carninci
P Kapranov
PP Amaral
PP Chan
RH Waterston
RJ Taft
S Griffiths-Jones
S Katayama
SF Altschul
T Aoi
TM Chern
U Nagalakshmi
UA Orom
X Han
Y Okazaki
Yaqiong Wang
Yazhou Sun
Yi Hu
Z Du
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The transcriptome and its regulation bridge the genome and the phenome. Recent RNA-seq studies unveiled complex transcriptomes with previously unknown transcripts and functions. To investigate the characteristics of neural transcriptomes and possible functions of previously unknown transcripts, we analyzed and compared nine recent RNA-seq datasets corresponding to tissues/organs ranging from stem cell, embryonic brain cortex to adult whole brain. Results We found that the neural and stem cell transcriptomes share global similarity in both gene and chromosomal expression, but are quite different from those of liver or muscle. We also found an unusually high level of unannotated expression in mouse embryonic brains. The intronic unannotated expression was found to be strongly associated with genes annotated for neurogenesis, axon guidance, negative regulation of transcription, and neural transmission. These functions are the hallmarks of the late embryonic stage cortex, and crucial for synaptogenesis and neural circuit formation. Conclusions Our results revealed unique global and local landscapes of neural transcriptomes. It also suggested potential functional roles for previously unknown transcripts actively expressed in the developing brain cortex. Our findings provide new insights into potentially novel genes, gene functions and regulatory mechanisms in early brain development.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Population Differences in Transcript-Regulator Expression Quantitative Trait Loci

Author: A Mortazavi
A Schwartzman
A Siepel
A Subramanian
A Vinuela
Ahsan Huda
AL Price
B Langmead
BE Stranger
BE Stranger
BH McArdle
C Trapnell
C Trapnell
C Ye
D Lv
Daniel J. Kliebenstein
DC Guo
DJ Kliebenstein
DL Nicolae
DM Ruden
DO Kennedy
E Choy
E Grundberg
E Wingender
E Wingender
EE Schadt
EE Schadt
ER Gamazon
ER Gamazon
G Yvert
GA Heap
GJ Bates
J Coulombe-Huntington
J Ding
JC Schisler
JE Wigginton
JK Pickrell
JL McCauley
JL Min
JM Akey
JM Bhasin
Jun Lu
L Liu
L Liu
L Parts
L Raskin
LA Hindorff
Liwen Liu
M Holden
M Krull
M Morley
MA Zapala
MG Naylor
N Hubner
Oliver Hofmann
PC Bennetta
Pierre R. Bushel
Q Jiang
R Breitling
R Edgar
RA Irizarry
Ray McGovern
RE Tiedemann
RS Spielman
S Duan
S Kim
S Li
SB Montgomery
SK Sarkar
T Barrett
T Breslin
T Kwan
T Zuo
W Jin
W Zhang
W Zou
Winston Hide
Xihong Lin
Y Benjamini
Y Idaghdour
Y Xu
Publication venue: Public Library of Science
Publication date: 27/03/2012
Field of study

Gene expression quantitative trait loci (eQTL) are useful for identifying single nucleotide polymorphisms (SNPs) associated with diseases. At times, a genetic variant may be associated with a master regulator involved in the manifestation of a disease. The downstream target genes of the master regulator are typically co-expressed and share biological function. Therefore, it is practical to screen for eQTLs by identifying SNPs associated with the targets of a transcript-regulator (TR). We used a multivariate regression with the gene expression of known targets of TRs and SNPs to identify TReQTLs in European (CEU) and African (YRI) HapMap populations. A nominal p-value of <1×10−6 revealed 234 SNPs in CEU and 154 in YRI as TReQTLs. These represent 36 independent (tag) SNPs in CEU and 39 in YRI affecting the downstream targets of 25 and 36 TRs respectively. At a false discovery rate (FDR) = 45%, one cis-acting tag SNP (within 1 kb of a gene) in each population was identified as a TReQTL. In CEU, the SNP (rs16858621) in Pcnxl2 was found to be associated with the genes regulated by CREM whereas in YRI, the SNP (rs16909324) was linked to the targets of miRNA hsa-miR-125a. To infer the pathways that regulate expression, we ranked TReQTLs by connectivity within the structure of biological process subtrees. One TReQTL SNP (rs3790904) in CEU maps to Lphn2 and is associated (nominal p-value = 8.1×10−7) with the targets of the X-linked breast cancer suppressor Foxp3. The structure of the biological process subtree and a gene interaction network of the TReQTL revealed that tumor necrosis factor, NF-kappaB and variants in G-protein coupled receptors signaling may play a central role as communicators in Foxp3 functional regulation. The potential pleiotropic effect of the Foxp3 TReQTLs was gleaned from integrating mRNA-Seq data and SNP-set enrichment into the analysis

Public Library of Science (PLOS)

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

FigShare