Search CORE

628 research outputs found

A whole-genome assembly of the domestic cow, Bos taurus

Author: Delcher A. L.
Florea L.
Hanrahan F.
Kelley D. R.
Marçais G.
Pertea G.
Puiu D.
Roberts M.
Salzberg S. L.
Schatz M. C.
Sonstegard T. S.
Subramanian P.
Van Tassell C. P.
Yorke J. A.
Zimin A. V.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Background: The genome of the domestic cow, Bos taurus, was sequenced using a mixture of hierarchical and whole-genome shotgun sequencing methods. Results: We have assembled the 35 million sequence reads and applied a variety of assembly improvement techniques, creating an assembly of 2.86 billion base pairs that has multiple improvements over previous assemblies: it is more complete, covering more of the genome; thousands of gaps have been closed; many erroneous inversions, deletions, and translocations have been corrected; and thousands of single-nucleotide errors have been corrected. Our evaluation using independent metrics demonstrates that the resulting assembly is substantially more accurate and complete than alternative versions. Conclusions: By using independent mapping data and conserved synteny between the cow and human genomes, we were able to construct an assembly with excellent large-scale contiguity in which a large majority (approximately 91%) of the genome has been placed onto the 30 B. taurus chromosomes. We constructed a new cow-human synteny map that expands upon previous maps. We also identified for the first time a portion of the B. taurus Y chromosome. © 2009 Zimin et al.; licensee BioMed Central Ltd

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Springer - Publisher Connector

PubMed Central

Digital Repository at the University of Maryland

Identification of stable reference genes for quantitative PCR in koalas

Author: A Kappel
A Radonic
B Artegiani
CA Waugh
CL Andersen
D Kim
DG Ginzinger
F Almeida-Oliveira
F Bartz
G Bamias
GS Meers
GS Simmons
IE Maher
IE Maher
J Vandesompele
J Woinarski
J Yperman
JJ Hanger
K Ahn
KC Thomas
KM Morris
M Kubista
M Pertea
MA Valasek
MW Pfaffl
N Silver
OT Ong
P Brym
PD Lee
PJ Canfield
QL Zhang
R Tarlinton
R Tarlinton
RK Das
S Bages
SA Bustin
SA Bustin
T Burmeister
T Nolan
V Gonzalez-Astudillo
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

To better understand host and immune response to diseases, gene expression studies require identification of reference genes with stable expression for accurate normalisation. This study describes the identification and testing of reference genes with stable expression profiles in koala lymph node tissues across two genetically distinct koala populations. From the 25 most stable genes identified in transcriptome analysis, 11 genes were selected for verification using reverse transcription quantitative PCR, in addition to the commonly used ACTB and GAPDH genes. The expression data were analysed using stable genes statistical software - geNorm, BestKeeper, NormFinder, the comparative ΔCt method and RefFinder. All 13 genes showed relative stability in expression in koala lymph node tissues, however Tmem97 and Hmg20a were identified as the most stable genes across the two koala populations

Repository@Nottingham

Adelaide Research & Scholarship

ResearchOnline at James Cook University

Directory of Open Access Journals

UQ eSpace (University of Queensland)

De Novo Transcriptome Assembly and Comparative Analysis Elucidate Complicated Mechanism Regulating Astragalus chrysochlorus Response to Selenium Stimuli

Author: A Conesa
A Mortazavi
A Shrift
B Winkel-Shirley
Baohong Zhang
CY Hung
D Van Hoewyk
E Bedir
ER Alford
G Pertea
GR Valmonte
I Calis
IJ Pickering
J Kang
JJ Cappa
JL Freeman
K Kabała
L McHale
M Machado
M Schiavon
M Sura-de Jong
MG Grabherr
N Turgut-Kara
Neslihan Turgut-Kara
O Cakir
SR Strickler
X Li
XB Liu
Xianlong Zhang
Y Okushima
Ö Çakır
Özgür Çakır
Ş Arı
Şule Arı
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2015
Field of study

Astragalus species are medicinal plants that are used in the world for years. Some Astragalus species are known for selenium accumulation and tolerance and one of them is Astragalus chrysochlorus, a secondary selenium accumulator. In this study, we employed Illumina deep sequencing technology for the first time to de novo assemble A. chrysochlorus transcriptome and identify the differentially expressed genes after selenate treatment. Totally, 59,656 unigenes were annotated with different databases and 53,960 unigenes were detected in NR database. Transcriptome in A. chrysochlorus is closer to Glycine max than other plant species with 43,1 percentage of similarity. Annotated unigenes were also used for gene ontology enrichment and pathway enrichment analysis. The most significant genes and pathways were ABC transporters, plant pathogen interaction, biosynthesis of secondary metabolites and carbohydrate metabolism. Our results will help to enlighten the selenium accumulation and tolerance mechanisms, respectively in plants

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

İstanbul Üniversitesi Açık Erişim Sistemi

The ScholarShip (East Carolina University)

The Francis Crick Institute

Genome-Wide Functional Analysis of the Cotton Transcriptome by Creating an Integrated EST Database

Author: B Hendrix
B Zhang
B Zhang
Baohong Zhang
BH Zhang
BH Zhang
BH Zhang
C An
CE Pearson
Christos A. Ouzounis
DL Nicolae
DP Bartel
F Li
F Xie
FL Xie
Fuliang Xie
G Pertea
Guiling Sun
HC Wang
HS Guo
J Hattori
J Jurka
J Zhang
JA Udall
John W. Stiller
K Schneider
LS Venne
M Ashburner
M Bozhko
M Kanehisa
M Krawczak
M Seki
MJ Aukerman
MP Sanchez de la Hoz
O Voinnet
P Brodersen
R Schwab
R Sunkar
RK Varshney
S Griffiths-Jones
S Wang
S Zeng
SF Altschul
SK Kantartzi
W Rychlik
X Huang
YA Chen
YH Park
ZJ Chen
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

A total of 28,432 unique contigs (25,371 in consensus contigs and 3,061 as singletons) were assembled from all 268,786 cotton ESTs currently available. Several in silico approaches [comparative genomics, Blast, Gene Ontology (GO) analysis, and pathway enrichment by Kyoto Encyclopedia of Genes and Genomes (KEGG)] were employed to investigate global functions of the cotton transcriptome. Cotton EST contigs were clustered into 5,461 groups with a maximum cluster size of 196 members. A total of 27,956 indel mutants and 149,616 single nucleotide polymorphisms (SNPs) were identified from consensus contigs. Interestingly, many contigs with significantly high frequencies of indels or SNPs encode transcription factors and protein kinases. In a comparison with six model plant species, cotton ESTs show the highest overall similarity to grape. A total of 87 cotton miRNAs were identified; 59 of these have not been reported previously from experimental or bioinformatics investigations. We also predicted 3,260 genes as miRNAs targets, which are associated with multiple biological functions, including stress response, metabolism, hormone signal transduction and fiber development. We identified 151 and 4,214 EST-simple sequence repeats (SSRs) from contigs and raw ESTs respectively. To make these data widely available, and to facilitate access to EST-related genetic information, we integrated our results into a comprehensive, fully downloadable web-based cotton EST database (www.leonxie.com)

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Francis Crick Institute

The ScholarShip (East Carolina University)

CleanEST: a database of cleansed EST libraries

Author: B. Lee
Benson
Boguski
Cui
Ewing
G. Shin
Kelso
Mao
Negre
Pertea
Pruitt
Seluja
Smith
Sorek
Wang
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

The EST division of GenBank, dbEST, is widely used in many applications such as gene discovery and verification of exon–intron structure. However, the use of EST sequences in the dbEST libraries is often hampered by inconsistent terminology used to describe the library sources and by the presence of contaminated sequences. Here, we describe CleanEST, a novel database server that classified dbEST libraries and removes contaminants. We classified all dbEST libraries according to species and sequencing center. In addition, we further classified human EST libraries by anatomical and pathological systems according to eVOC ontologies. For each dbEST library, we provide two different cleansed sequences: ‘pre-cleansed’ and ‘user-cleansed’. To generate pre-cleansed sequences, we cleansed sequences in dbEST by alignment of EST sequences against well-known contamination sources: UniVec, Escherichia coli, mitochondria and chloroplast (for plant). To provide user-cleansed sequences, we built an automatic user-cleansing pipeline, in which sequences of a user-selected library are cleansed on-the-fly according to user-selected options. The server is available at http://cleanest.kobic.re.kr/ and the database is updated monthly

Crossref

KRIBB Open Access Repository

PubMed Central

Convergent recombination suppression suggests role of sexual selection in guppy sex chromosome formation.

Author: A Kortrschal
A Lindholm
A Loytynoja
A McKenna
A Rimmer
AE Houde
AE Houde
AE Wright
AE Wright
AE Wright
AE Wright
AP Lisachov
AR Quinlan
B Charlesworth
B Langmead
B Sandkam
B Vicoso
B Vicoso
B Vicoso
B Vicoso
BA Fraser
BJA Pollux
C Dufresnes
D Bachtrog
D Bachtrog
D Bachtrog
D Kim
DJ Kemp
DR Kelley
E Axelsson
E Eden
E Eden
E Paradis
G Lunter
H Li
H Li
H Skaletsky
H Skaletsky
I Nanda
J Haldane
J Hough
J Kitano
JA Endler
JA Endler
JA Endler
JA Endler
JE Mank
JE Mank
JE Mank
JE Mank
JE Mank
JP Masly
JRW Russell
K Reichwald
KP Arunkumar
M Lohse
M Pertea
M Stock
M White
M Winge
MD Robinson
MD Robinson
N Tripathi
P Flicek
PW Harrison
Q Zhou
R Bergero
R Chikhi
RA Fisher
RB Luo
RH Devlin
RP Meisel
RP Meisel
RW Meredith
S Anders
SF Altschul
SP Gordon
SP Gordon
T Jombart
T Kamiya
T Lenormand
W Traut
WR Rice
ZH Yang
ZY Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Sex chromosomes evolve once recombination is halted between a homologous pair of chromosomes. The dominant model of sex chromosome evolution posits that recombination is suppressed between emerging X and Y chromosomes in order to resolve sexual conflict. Here we test this model using whole genome and transcriptome resequencing data in the guppy, a model for sexual selection with many Y-linked colour traits. We show that although the nascent Y chromosome encompasses nearly half of the linkage group, there has been no perceptible degradation of Y chromosome gene content or activity. Using replicate wild populations with differing levels of sexually antagonistic selection for colour, we also show that sexual selection leads to greater expansion of the non-recombining region and increased Y chromosome divergence. These results provide empirical support for longstanding models of sex chromosome catalysis, and suggest an important role for sexual selection and sexual conflict in genome evolution

ISTA Research Explorer (Institute of Science and Technology Austria)

Crossref

UCL Discovery

PubMed Central

IST Austria: PubRep (Institute of Science and Technology)

White Rose Research Online

Swepub

The TIGR Gene Indices: clustering and assembling EST and known genes and integration with eukaryotic genomes

Author: Antonescu V.
Chan A.
Cheung F.
Karamycheva S.
Lee Y.
Pertea G.
Quackenbush J.
Sultana R.
Sunkara S.
Tsai J.
Publication venue: Oxford University Press
Publication date: 17/12/2004
Field of study

Although the list of completed genome sequencing projects has expanded rapidly, sequencing and analysis of expressed sequence tags (ESTs) remain a primary tool for discovery of novel genes in many eukaryotes and a key element in genome annotation. The TIGR Gene Indices (http://www.tigr.org/tdb/tgi) are a collection of 77 species-specific databases that use a highly refined protocol to analyze gene and EST sequences in an attempt to identify and characterize expressed transcripts and to present them on the Web in a user-friendly, consistent fashion. A Gene Index database is constructed for each selected organism by first clustering, then assembling EST and annotated cDNA and gene sequences from GenBank. This process produces a set of unique, high-fidelity virtual transcripts, or tentative consensus (TC) sequences. The TC sequences can be used to provide putative genes with functional annotation, to link the transcripts to genetic and physical maps, to provide links to orthologous and paralogous genes, and as a resource for comparative and functional genomic analysis

Crossref

PubMed Central

The first whole genome and transcriptome of the cinereous vulture reveals adaptation in the gastric and immune defense systems and possible convergent evolution between the Old and New World vultures

Author: A Goncalves
A Löytynoja
A Mortazavi
Alvin Chon
Andrea Manica
B Eliotout
B Li
B Li
BJ Haas
CG Sibley
D Kim
DD Pollock
DL Ogada
DW Huang
E Quevillon
E Trompouki
E Videvall
ED Jarvis
F Ikeda
G Marçais
G Pertea
G Takaesu
G Zhang
GD Ruxton
GE Duke
GJ Krejs
H Li
H Li
H Li
Hak-Min Kim
HS Yim
HY Song
HyeJin Lee
Hyunho Kim
IA Adzhubei
International Chicken Genome Sequencing Consortium
J Ferguson-Lees
J Hoyo del
J Parra
J Puente de la
J Wang
J Xu
J Ye
J Zhang
JeHoon Jun
Jeongheui Lim
Jeremy Edwards
Jessica A. Weber
JG Teodoro
JM Lastra de la
Jong Bhak
Junsu Ko
K Arnold
K Tamura
Kyudong Han
M Kanehisa
M Loiarro
M Roggenbuck
M Tariq
M Wink
MD Robinson
MD Robinson
MG Grabherr
NIBR (National Institute of Biological Resources)
Oksung Chung
P Vijayakumar
R Medzhitov
R Nielsen
RA Dalloul
RA Goldstein
S Anders
S Sharma
Seondeok Jin
SF Altschul
Stephen J. O’Brien
Sungwoong Jho
T Schwede
T Yamazaki
W He
W Miller
WC Warren
Woon Kee Paek
X Ma
X Zhan
Y Choi
Y Huang
Y Moriya
YS Cho
Yun Sung Cho
Z Yang
Z Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Background: The cinereous vulture, Aegypius monachus, is the largest bird of prey and plays a key role in the ecosystem by removing carcasses, thus preventing the spread of diseases. Its feeding habits force it to cope with constant exposure to pathogens, making this species an interesting target for discovering functionally selected genetic variants. Furthermore, the presence of two independently evolved vulture groups, Old World and New World vultures, provides a natural experiment in which to investigate convergent evolution due to obligate scavenging. Results: We sequenced the genome of a cinereous vulture, and mapped it to the bald eagle reference genome, a close relative with a divergence time of 18 million years. By comparing the cinereous vulture to other avian genomes, we find positively selected genetic variations in this species associated with respiration, likely linked to their ability of immune defense responses and gastric acid secretion, consistent with their ability to digest carcasses. Comparisons between the Old World and New World vulture groups suggest convergent gene evolution. We assemble the cinereous vulture blood transcriptome from a second individual, and annotate genes. Finally, we infer the demographic history of the cinereous vulture which shows marked fluctuations in effective population size during the late Pleistocene. Conclusions: We present the first genome and transcriptome analyses of the cinereous vulture compared to other avian genomes and transcriptomes, revealing genetic signatures of dietary and environmental adaptations accompanied by possible convergent evolution between the Old World and New World vulturesopen

Crossref

Springer - Publisher Connector

PubMed Central

ScholarWorks@UNIST

NSU Works

BaRTv1.0:an improved barley reference transcript dataset to determine accurate changes in the barley transcriptome using RNA-seq

Author: A Ashoub
A Busch
A Dobin
A Dobin
A Janiak
Abdellah Barakate
AM Bolger
AM Mastrangelo
AS Reddy
AT Pham
B Panahi
BA Veeneman
BJ Haas
C Soneson
CG Simpson
Claire Halpin
Claus-Dieter Mayer
CPG Calixto
CPG Calixto
CPG Calixto
Craig G. Simpson
D Staiger
D Szakonyi
G Capovilla
G Guo
Gordon Stephen
GP Alamancos
H Liu
IK Dawson
International Barley Sequencing Consortium
J Bazin
J Russell
Jason Kam
Jenny Morris
John Fuller
John W. S. Brown
JWS Brown
K Mrízová
K Shirasu
KE Hayer
Linda Milne
LS Dahleen
M Kalyna
M Kintlová
M Mascher
M Pertea
M. Cristina Casao
Micha Bayer
Miriam Schreiber
Monika Zwirek
NL Bray
P Ren
Paulo Rapazote-Flores
Pete E. Hedley
PG Engström
Q Zhang
Q Zhang
R Patro
R Zhang
R Zhang
RF Carvalho
Robbie Waugh
RR Sokal
Runxuan Zhang
S Chamala
S Filichkin
S Schindler
S. Ouyang
Sarah M. McKim
SF Altschul
SH Kim
SR Thatcher
T Laloum
T Matsumoto
TD Wu
TW Nilsen
Wenbin Guo
X Gan
XN Zhang
Y Lee
Y Marquez
Y Shi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/12/2019
Field of study

Background: The time required to analyse RNA-seq data varies considerably, due to discrete steps for computational assembly, quantification of gene expression and splicing analysis. Recent fast non-alignment tools such as Kallisto and Salmon overcome these problems, but these tools require a high quality, comprehensive reference transcripts dataset (RTD), which are rarely available in plants.Results: A high-quality, non-redundant barley gene RTD and database (Barley Reference Transcripts - BaRTv1.0) has been generated. BaRTv1.0, was constructed from a range of tissues, cultivars and abiotic treatments and transcripts assembled and aligned to the barley cv. Morex reference genome (Mascher et al. Nature; 544: 427-433, 2017). Full-length cDNAs from the barley variety Haruna nijo (Matsumoto et al. Plant Physiol; 156: 20-28, 2011) determined transcript coverage, and high-resolution RT-PCR validated alternatively spliced (AS) transcripts of 86 genes in five different organs and tissue. These methods were used as benchmarks to select an optimal barley RTD. BaRTv1.0-Quantification of Alternatively Spliced Isoforms (QUASI) was also made to overcome inaccurate quantification due to variation in 5' and 3' UTR ends of transcripts. BaRTv1.0-QUASI was used for accurate transcript quantification of RNA-seq data of five barley organs/tissues. This analysis identified 20,972 significant differentially expressed genes, 2791 differentially alternatively spliced genes and 2768 transcripts with differential transcript usage.Conclusion: A high confidence barley reference transcript dataset consisting of 60,444 genes with 177,240 transcripts has been generated. Compared to current barley transcripts, BaRTv1.0 transcripts are generally longer, have less fragmentation and improved gene models that are well supported by splice junction reads. Precise transcript quantification using BaRTv1.0 allows routine analysis of gene expression and AS.</p

Crossref

Discovery Research Portal

In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae

Author: A Fleischmann
A Navrátilová
A Zuccolo
A Zuccolo
Andrea Koblížková
Andreas Houben
B Piegu
C Llorens
CA Thomas
D Aird
E Kejnovský
F Otto
F Ronquist
G García
G Pertea
H Schaefer
H Weiss-Schneeweiss
HJT Pagan
Ilia J. Leitch
Iva Fuková
J Doležel
J Doležel
J Doležel
J Doležel
J Greilhuber
J Ištvánek
J Macas
J Macas
J Macas
J Macas
J Macas
J Macas
J Pellicer
J Pellicer
JA Ågren
JA Ågren
Jana Čížková
Jaroslav Doležel
Jaume Pellicer
Jiří Macas
JL Bennetzen
JPM Camacho
JS Hawkins
JS Hawkins
KM Devos
KR Oliver
KR Oliver
Laura J. Kelly
LD Ingham
LJ Kelly
LJ Kelly
LJ Kelly
M El Baidouri
M Kidwell
M Lynch
M Nouzová
M Piednoël
MA Lysák
MC Estep
MI Tenaillon
P Neumann
P Neumann
P Novák
P Novák
P Novák
P Smýkal
P Trávníček
Pavel Neumann
Petr Novák
RB Flavell
RJ Britten
S Klemme
S Linquist
S Lockton
S Renny-Byfield
SF Altschul
T Hall
T Wicker
TP Michael
TR Gregory
V Hemleben
V Steinbauerová
V Steinbauerová
Z Cai
Z Gong
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 25/11/2015
Field of study

The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes

Public Library of Science (PLOS)

Crossref

British Library (BL) Shared Research Repository

Directory of Open Access Journals

PubMed Central

Queen Mary Research Online

The Francis Crick Institute