Search CORE

90 research outputs found

Directed sequencing and annotation of three Dicentrarchus labrax L. chromosomes by applying Sanger- and pyrosequencing technologies on pooled DNA of comparatively mapped BAC clones

Author: Beck Alfred
Kodira Chinnappa
Kuhl Heiner
Reinhardt Richard
Timmermann Bernd
Tine Mbaye
Publication venue: Elsevier Inc.
Publication date: 01/09/2011
Field of study

AbstractDicentrarchus labrax is one of the major marine aquaculture species in the European Union. In this study, we have developed a directed-sequencing strategy to sequence three sea bass chromosomes and compared results with other teleosts.Three BAC DNA pools were created from sea bass BAC clones that mapped to stickleback chromosomes/groups V, XVII and XXI. The pools were sequenced to 17–39x coverage by pyrosequencing. Data assembly was supported by Sanger reads and mate pair data and resulted in superscaffolds of 13.2Mb, 17.5Mb and 13.7Mb respectively. Annotation features of the superscaffolds include 1477 genes. We analyzed size change of exon, intron and intergenic sequence between teleost species and deduced a simple model for the evolution of genome composition in teleost lineage.Combination of second generation sequencing technologies, Sanger sequencing and genome partitioning strategies allows “high-quality draft assemblies” of chromosome-sized superscaffolds, which are crucial for the prediction and annotation of complete genes

Elsevier - Publisher Connector

MPG.PuRe

Leveraging Spatial Variation in Tumor Purity for Improved Somatic Variant Calling of Archival Tumor Only Samples

Author: Chinnappa Kodira
Chinnappa Kodira
Daniel Enriquez
Erica E. Tassone
James Newell
Jonathan Adkins
Michael E. Berens
Nhan L. Tran
Nicole C. Hank
Rebecca F. Halperin
Ronald Korn
Ronald Korn
Sara A. Byron
Seungchan Kim
Sidharth Kulkarni
Winnie S. Liang
Publication venue: 'Frontiers Media SA'
Publication date: 01/03/2019
Field of study

Archival tumor samples represent a rich resource of annotated specimens for translational genomics research. However, standard variant calling approaches require a matched normal sample from the same individual, which is often not available in the retrospective setting, making it difficult to distinguish between true somatic variants and individual-specific germline variants. Archival sections often contain adjacent normal tissue, but this tissue can include infiltrating tumor cells. As existing comparative somatic variant callers are designed to exclude variants present in the normal sample, a novel approach is required to leverage adjacent normal tissue with infiltrating tumor cells for somatic variant calling. Here we present lumosVar 2.0, a software package designed to jointly analyze multiple samples from the same patient, built upon our previous single sample tumor only variant caller lumosVar 1.0. The approach assumes that the allelic fraction of somatic variants and germline variants follow different patterns as tumor content and copy number state change. lumosVar 2.0 estimates allele specific copy number and tumor sample fractions from the data, and uses a to model to determine expected allelic fractions for somatic and germline variants and to classify variants accordingly. To evaluate the utility of lumosVar 2.0 to jointly call somatic variants with tumor and adjacent normal samples, we used a glioblastoma dataset with matched high and low tumor content and germline whole exome sequencing data (for true somatic variants) available for each patient. Both sensitivity and positive predictive value were improved when analyzing the high tumor and low tumor samples jointly compared to analyzing the samples individually or in-silico pooling of the two samples. Finally, we applied this approach to a set of breast and prostate archival tumor samples for which tumor blocks containing adjacent normal tissue were available for sequencing. Joint analysis using lumosVar 2.0 detected several variants, including known cancer hotspot mutations that were not detected by standard somatic variant calling tools using the adjacent tissue as presumed normal reference. Together, these results demonstrate the utility of leveraging paired tissue samples to improve somatic variant calling when a constitutional sample is not available

Directory of Open Access Journals

Recommended from our members

Towards a Library of Standard Operating Procedures (SOPs) for (meta)genomic annotation

Author: Angiuoli Samuel V.
Cochrane Guy
Field Dawn
Garrity George
Gussman Aaron
Klimke William
Kodira Chinnappa D.
Kyrpides Nikos
Kyrpides Nikos
Madupu Ramana
Markowitz Victor
Tatusova Tatiana
Thomson Nick
White Owen
Publication venue: Lawrence Berkeley National Laboratory
Publication date: 01/04/2008
Field of study

Genome annotations describe the features of genomes and accompany sequences in genome databases. The methodologies used to generate genome annotation are diverse and typically vary amongst groups. Descriptions of the annotation procedure are helpful in interpreting genome annotation data. Standard Operating Procedures (SOPs) for genome annotation describe the processes that generate genome annotations. Some groups are currently documenting procedures but standards are lacking for structure and content of annotation SOPs. In addition, there is no central repository to store and disseminate procedures and protocols for genome annotation. We highlight the importance of SOPs for genome annotation and endorse a central online repository of SOPs

UNT Digital Library

The Cassava Genome: Current Progress, Future Directions

Author: AA Raji
AC Roa
AP Chan
B Boher
BL Patil
Brian Desany
Chinnappa Kodira
Claude Fauquet
D Edwards
Daniel S. Rokhsar
F Awoleye
Fausto Rodriguez
GA Tuskan
H Ceballos
HM Lam
J Schmutz
Joseph Tohme
K Reilly
M Balat
Mohammed Mohiuddin
N Gill
NL Quinn
NM Springer
Pablo D. Rabinowicz
PM Schmitz
Pradeep Reddy Marri
RJ Elshire
RJ Hillocks
S Sraphet
S Tangphatsornruang
Simon Prochnik
Steve Rounsley
Timothy Harkins
VV Kapitonov
Publication venue: Springer-Verlag
Publication date: 01/01/2012
Field of study

The starchy swollen roots of cassava provide an essential food source for nearly a billion people, as well as possibilities for bioenergy, yet improvements to nutritional content and resistance to threatening diseases are currently impeded. A 454-based whole genome shotgun sequence has been assembled, which covers 69% of the predicted genome size and 96% of protein-coding gene space, with genome finishing underway. The predicted 30,666 genes and 3,485 alternate splice forms are supported by 1.4 M expressed sequence tags (ESTs). Maps based on simple sequence repeat (SSR)-, and EST-derived single nucleotide polymorphisms (SNPs) already exist. Thanks to the genome sequence, a high-density linkage map is currently being developed from a cross between two diverse cassava cultivars: one susceptible to cassava brown streak disease; the other resistant. An efficient genotyping-by-sequencing (GBS) approach is being developed to catalog SNPs both within the mapping population and among diverse African farmer-preferred varieties of cassava. These resources will accelerate marker-assisted breeding programs, allowing improvements in disease-resistance and nutrition, and will help us understand the genetic basis for disease resistance

Crossref

Springer - Publisher Connector

PubMed Central

CGSpace

Genome analysis of the necrotrophic fungal pathogens Sclerotinia sclerotiorum and Botrytis cinerea

Author: Amselem Joelle
Andrew Marion
Anthouard Véronique
Beever Ross E.
Beffa Rolland
Benito Ernesto P.
Benoit Isabelle
Bouzid Ourdia
Brault Baptiste
Chen Zehua
Choquer Mathias
Collémare Jérome
Cotton Pascale
Couloux Arnaud
Coutinho Pedro M.
Cuomo Christina A.
Da Silva Corinne
Danchin Etienne G.
de Vries Ronald P.
Dickman Marty
Dyer Paul S.
Fillinger Sabine
Fournier Elisabeth
Gautier Angélique
Giraud Corinne
Giraud Tatiana
Gonzalez Celedonio
Gout Lilian
Grossetete Sandrine
Güldener Ulrich
Hahn Matthias
Henrissat Bernard
Howlett Barbara J.
Kodira Chinnappa
Kohn Linda
Kretschmer Matthias
Lapalu Nicolas
Lappartient Anne
Lebrun Marc-Henri
Leroch Michaela
Levis Caroline
Mauceli Evan
Neuvéglise Cécile
Oeser Birgitt
Pearson Matthew
Plummer Kim M.
Poulain Julie
Poussereau Nathalie
Pradier Jean-Marc
Quesneville Hadi
Quévillon Emmanuel
Rascle Christine
Richardson Paul M.
Rollins Jeffrey A.
Schumacher Julia
Sexton Adrienne
Sharon Amir
Silva Evelyn
Simon Adeline
Sirven Catherine
Soanes Darren M.
Ségurens Béatrice
Talbot Nicholas J.
Templeton Matt
ten Have Arjen
Tudzynski Bettina
Tudzynski Paul
van Kan Jan A. L.
Viaud Muriel
Wincker Patrick
Yandava Chandri
Yarden Oded
Zeng Qiandong
Publication venue
Publication date: 01/01/2011
Field of study

Sclerotinia sclerotiorum and Botrytis cinerea are closely related necrotrophic plant pathogenic fungi notable for their wide host ranges and environmental persistence. These attributes have made these species models for understanding the complexity of necrotrophic, broad host-range pathogenicity. Despite their similarities, the two species differ in mating behaviour and the ability to produce asexual spores. We have sequenced the genomes of one strain of S. sclerotiorum and two strains of B. cinerea. The comparative analysis of these genomes relative to one another and to other sequenced fungal genomes is provided here. Their 38–39 Mb genomes include 11,860–14,270 predicted genes, which share 83% amino acid identity on average between the two species. We have mapped the S. sclerotiorum assembly to 16 chromosomes and found large-scale co-linearity with the B. cinerea genomes. Seven percent of the S. sclerotiorum genome comprises transposable elements compared t

HAL AMU

Directory of Open Access Journals

HAL Descartes

Wageningen University & Research Publications

University of Melbourne Institutional Repository

Genomic Analysis of the Basal Lineage Fungus Rhizopus oryzae Reveals a Whole-Genome Duplication

Author: Abe Ayumi
Birren Bruce W.
Burger Gertraud
Butler Margi
Calvo Sarah E.
Corrochano Luis M.
Cuomo Christina A.
Elias Marek
Engels Reinhard
Fu Jianmin
Galagan James
Grabherr Manfred G.
Hansberg Wilhelm
Ibrahim Ashraf S.
Idnurm Alexander
Kim Jung-Mi
Kodira Chinnappa D.
Koehrsen Michael J.
Lang B. Franz
Liu Bo
Ma Li-Jun
Miranda-Saavedra Diego
O'Leary Sinead
Ortiz-Castellanos Lucila
Poulter Russell
Rodriguez-Romero Julio
Ruiz-Herrera José
Shen Yao-Qing
Skory Christopher
Sone Teruo
Wickes Brian L.
Zeng Qiandong
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Rhizopus oryzae is the primary cause of mucormycosis, an emerging, life-threatening infection characterized by rapid angioinvasive growth with an overall mortality rate that exceeds 50%. As a representative of the paraphyletic basal group of the fungal kingdom called “zygomycetes,” R. oryzae is also used as a model to study fungal evolution. Here we report the genome sequence of R. oryzae strain 99–880, isolated from a fatal case of mucormycosis. The highly repetitive 45.3 Mb genome assembly contains abundant transposable elements (TEs), comprising approximately 20% of the genome. We predicted 13,895 protein-coding genes not overlapping TEs, many of which are paralogous gene pairs. The order and genomic arrangement of the duplicated gene pairs and their common phylogenetic origin provide evidence for an ancestral whole-genome duplication (WGD) event. The WGD resulted in the duplication of nearly all subunits of the protein complexes associated with respiratory electron transport chains, the V-ATPase, and the ubiquitin–proteasome systems. The WGD, together with recent gene duplications, resulted in the expansion of multiple gene families related to cell growth and signal transduction, as well as secreted aspartic protease and subtilase protein families, which are known fungal virulence factors. The duplication of the ergosterol biosynthetic pathway, especially the major azole target, lanosterol 14α-demethylase (ERG11), could contribute to the variable responses of R. oryzae to different azole drugs, including voriconazole and posaconazole. Expanded families of cell-wall synthesis enzymes, essential for fungal cell integrity but absent in mammalian hosts, reveal potential targets for novel and R. oryzae-specific diagnostic and therapeutic treatments

CiteSeerX

ScholarWorks@UMass Amherst

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

idUS. Depósito de Investigación Universidad de Sevilla

Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication

Author: Alexandre Lomsadze
Amparo Herrero Ortega
Andrea Zuccolo
Arnaud Couloux
Brian Desany
Chinnappa Kodira
Chunxian Chen
Daniel Ram\uf3n
Daniel Rokhsar
DEL FABBRO Cristian
Dominique Brunel
Federica Cattonaro
Florent Murat
Fran\ue7ois Luro
Francis Quetier
Francisco R. Tadeo
Frederick Gmitter
G. Albert Wu
Giuseppe Reforgiato
Jane Grimwood
Jarrod Chapman
Javier Terol
Jeremy Schmutz
Jerome Salse
Jerry Jenkins
Juan V. Mu\uf1oz Sanz
Juli\ue1n P\ue9rez P\ue9rez
Juliana Freitas Ast\ufaa
Julie Poulain
Kamel Jabbari
Karin Fredrikson
Karine Labadie
Leandro H. Estornell
Luis Navarro
Manuel Ruiz
Manuel Talon
Marco Aur\ue9lio Takita
Marcos Antonio Machado
Mark Borodovsky
Mikeal Roose
Mohammed Mohiuddin
Morgante Michele
Olivier Jaillon
Pablo Aleza
Patrick Ollitrault
Patrick Wincker
Paul Burns
Sara Pinosio
Simon Prochnik
Simone Scalabrin
Tim Harkins
Uffe Hellsten
Victoria Ibanez
William G. Farmerie
Xavier Perrier
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Cultivated citrus are selections from, or hybrids of, wild progenitor species whose identities and contributions to citrus domestication remain controversial. Here we sequence and compare citrus genomes-a high-quality reference haploid clementine genome and mandarin, pummelo, sweet-orange and sour-orange genomes-and show that cultivated types derive from two progenitor species. Although cultivated pummelos represent selections from one progenitor species, Citrus maxima, cultivated mandarins are introgressions of C. maxima into the ancestral mandarin species Citrus reticulata. The most widely cultivated citrus, sweet orange, is the offspring of previously admixed individuals, but sour orange is an F1 hybrid of pure C. maxima and C. reticulata parents, thus implying that wild mandarins were part of the early breeding germplasm. A Chinese wild 'mandarin' diverges substantially from C. reticulata, thus suggesting the possibility of other unrecognized wild citrus species. Understanding citrus phylogeny through genome analysis clarifies taxonomic relationships and facilitates sequence-directed genetic improvement

Archivio istituzionale della ricerca - Università degli Studi di Udine

Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)

Author: A Dijkhuizen
A Saveliev
AG Jones
C Jeffrey
C Schlötterer
C Schlötterer
Chinnappa D Kodira
CR Primmer
D Tautz
DE MacHugh
DJ Somers
Douglas A Senalik
EA Sia
FC Serquen
G Bates
G Toth
GA Tuskan
H Ellegren
H Ellegren
H Ellegren
H Innan
H Wang
HJ Price
IRGSP (International Rice Genome Sequencing Project)
J Quackenbush
JA Eisen
JC Garza
JE Bowers
JH Mun
JH Peng
JL Weber
JM Bradeen
K Arumuganathan
L Cardle
L Santi
LD Knerr
Luming Yang
M Morgante
M Morgante
M Wierdl
MD Robbins
MG Murray
N Huo
N Watcharawongpaiboon
NN Fitzsimmons
O Jaillon
P Sebastian
Pablo F Cavagnaro
Philipp W Simon
Q Kong
R Gur-Arie
R Lister
R Wooster
RK Varshney
S Huang
S Leclercq
S Rozen
S Subramanian
S Temnykh
Sanwen Huang
SG Guo
SP Zhang
SR Kruglyak
SR McCouch
SS Renner
T Horejsi
T Thiel
Timothy T Harkins
TW Whitaker
V Meglic
W Powell
WC Kennard
Y Danin-Poleg
Y Kashi
Y Ren
Y Weng
Y Weng
Y Weng
Y Weng
YC Li
YH Han
YH Park
Yiqun Weng
Z Li
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Cucumber, <it>Cucumis sativus </it>L. is an important vegetable crop worldwide. Until very recently, cucumber genetic and genomic resources, especially molecular markers, have been very limited, impeding progress of cucumber breeding efforts. Microsatellites are short tandemly repeated DNA sequences, which are frequently favored as genetic markers due to their high level of polymorphism and codominant inheritance. Data from previously characterized genomes has shown that these repeats vary in frequency, motif sequence, and genomic location across taxa. During the last year, the genomes of two cucumber genotypes were sequenced including the Chinese fresh market type inbred line '9930' and the North American pickling type inbred line 'Gy14'. These sequences provide a powerful tool for developing markers in a large scale. In this study, we surveyed and characterized the distribution and frequency of perfect microsatellites in 203 Mbp assembled Gy14 DNA sequences, representing 55% of its nuclear genome, and in cucumber EST sequences. Similar analyses were performed in genomic and EST data from seven other plant species, and the results were compared with those of cucumber. Results A total of 112,073 perfect repeats were detected in the Gy14 cucumber genome sequence, accounting for 0.9% of the assembled Gy14 genome, with an overall density of 551.9 SSRs/Mbp. While tetranucleotides were the most frequent microsatellites in genomic DNA sequence, dinucleotide repeats, which had more repeat units than any other SSR type, had the highest cumulative sequence length. Coding regions (ESTs) of the cucumber genome had fewer microsatellites compared to its genomic sequence, with trinucleotides predominating in EST sequences. AAG was the most frequent repeat in cucumber ESTs. Overall, AT-rich motifs prevailed in both genomic and EST data. Compared to the other species examined, cucumber genomic sequence had the highest density of SSRs (although comparable to the density of poplar, grapevine and rice), and was richest in AT dinucleotides. Using an electronic PCR strategy, we investigated the polymorphism between 9930 and Gy14 at 1,006 SSR loci, and found unexpectedly high degree of polymorphism (48.3%) between the two genotypes. The level of polymorphism seems to be positively associated with the number of repeat units in the microsatellite. The <it>in silico </it>PCR results were validated empirically in 660 of the 1,006 SSR loci. In addition, primer sequences for more than 83,000 newly-discovered cucumber microsatellites, and their exact positions in the Gy14 genome assembly were made publicly available. Conclusions The cucumber genome is rich in microsatellites; AT and AAG are the most abundant repeat motifs in genomic and EST sequences of cucumber, respectively. Considering all the species investigated, some commonalities were noted, especially within the monocot and dicot groups, although the distribution of motifs and the frequency of certain repeats were characteristic of the species examined. The large number of SSR markers developed from this study should be a significant contribution to the cucurbit research community.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central