Search CORE

Leveraging Spatial Variation in Tumor Purity for Improved Somatic Variant Calling of Archival Tumor Only Samples

Author: Chinnappa Kodira
Chinnappa Kodira
Daniel Enriquez
Erica E. Tassone
James Newell
Jonathan Adkins
Michael E. Berens
Nhan L. Tran
Nicole C. Hank
Rebecca F. Halperin
Ronald Korn
Ronald Korn
Sara A. Byron
Seungchan Kim
Sidharth Kulkarni
Winnie S. Liang
Publication venue: 'Frontiers Media SA'
Publication date: 01/03/2019
Field of study

Archival tumor samples represent a rich resource of annotated specimens for translational genomics research. However, standard variant calling approaches require a matched normal sample from the same individual, which is often not available in the retrospective setting, making it difficult to distinguish between true somatic variants and individual-specific germline variants. Archival sections often contain adjacent normal tissue, but this tissue can include infiltrating tumor cells. As existing comparative somatic variant callers are designed to exclude variants present in the normal sample, a novel approach is required to leverage adjacent normal tissue with infiltrating tumor cells for somatic variant calling. Here we present lumosVar 2.0, a software package designed to jointly analyze multiple samples from the same patient, built upon our previous single sample tumor only variant caller lumosVar 1.0. The approach assumes that the allelic fraction of somatic variants and germline variants follow different patterns as tumor content and copy number state change. lumosVar 2.0 estimates allele specific copy number and tumor sample fractions from the data, and uses a to model to determine expected allelic fractions for somatic and germline variants and to classify variants accordingly. To evaluate the utility of lumosVar 2.0 to jointly call somatic variants with tumor and adjacent normal samples, we used a glioblastoma dataset with matched high and low tumor content and germline whole exome sequencing data (for true somatic variants) available for each patient. Both sensitivity and positive predictive value were improved when analyzing the high tumor and low tumor samples jointly compared to analyzing the samples individually or in-silico pooling of the two samples. Finally, we applied this approach to a set of breast and prostate archival tumor samples for which tumor blocks containing adjacent normal tissue were available for sequencing. Joint analysis using lumosVar 2.0 detected several variants, including known cancer hotspot mutations that were not detected by standard somatic variant calling tools using the adjacent tissue as presumed normal reference. Together, these results demonstrate the utility of leveraging paired tissue samples to improve somatic variant calling when a constitutional sample is not available

Analysis of the genome and transcriptome of Cryptococcus neoformans var. grubii reveals complex RNA expression and microevolution leading to virulence attenuation.

Cryptococcus neoformans is a pathogenic basidiomycetous yeast responsible for more than 600,000 deaths each year. It occurs as two serotypes (A and D) representing two varieties (i.e. grubii and neoformans, respectively). Here, we sequenced the genome and performed an RNA-Seq-based analysis of the C. neoformans var. grubii transcriptome structure. We determined the chromosomal locations, analyzed the sequence/structural features of the centromeres, and identified origins of replication. The genome was annotated based on automated and manual curation. More than 40,000 introns populating more than 99% of the expressed genes were identified. Although most of these introns are located in the coding DNA sequences (CDS), over 2,000 introns in the untranslated regions (UTRs) were also identified. Poly(A)-containing reads were employed to locate the polyadenylation sites of more than 80% of the genes. Examination of the sequences around these sites revealed a new poly(A)-site-associated motif (AUGHAH). In addition, 1,197 miscRNAs were identified. These miscRNAs can be spliced and/or polyadenylated, but do not appear to have obvious coding capacities. Finally, this genome sequence enabled a comparative analysis of strain H99 variants obtained after laboratory passage. The spectrum of mutations identified provides insights into the genetics underlying the micro-evolution of a laboratory strain, and identifies mutations involved in stress responses, mating efficiency, and virulence

Università degli Studi del Molise: IRIS

DukeSpace

ProdInra

HAL-Inserm

Open Access Repository of IISc Research Publications

eScholarship - University of California

University of Melbourne Institutional Repository

HAL-Pasteur

Genomic analysis of the necrotrophic fungal pathogens Sclerotinia sclerotiorum and Botrytis cinerea

Author: Amselem J
Andrew M
Anthouard V
Beever RE
Beffa R
Benito EP
Benoit I
Bouzid O
Brault B
Chen Z
Choquer M
Collémare J
Cotton P
Couloux A
Coutinho PM
Cuomo CA
Da Silva C
Danchin EG
de Vries RP
Dickman M
Dyer PS
Fillinger S
Fournier E
Gautier A
Giraud C
Giraud T
Gonzalez C
Gout L
Grossetete S
Güldener U
Hahn M
Henrissat B
Howlett BJ
Kodira C
Kohn L
Kretschmer M
Lapalu N
Lappartient A
Lebrun M-H
Leroch M
Levis C
Mauceli E
Neuvéglise C
Oeser B
Pearson M
Plummer KM
Poulain J
Poussereau N
Pradier J-M
Quesneville H
Quévillon E
Rascle C
Rollins JA
Schumacher J
Sexton A
Sharon A
Silva E
Simon A
Sirven C
Soanes DM
Ségurens B
Talbot NJ
Templeton M
ten Have A
Tudzynski B
Tudzynski P
van Kan JAL
Viaud M
Wincker P
Yandava C
Yarden O
Zeng Q
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 13/02/2017
Field of study

This is the final version of the article. Available from the publisher via the DOI in this record.Sclerotinia sclerotiorum and Botrytis cinerea are closely related necrotrophic plant pathogenic fungi notable for their wide host ranges and environmental persistence. These attributes have made these species models for understanding the complexity of necrotrophic, broad host-range pathogenicity. Despite their similarities, the two species differ in mating behaviour and the ability to produce asexual spores. We have sequenced the genomes of one strain of S. sclerotiorum and two strains of B. cinerea. The comparative analysis of these genomes relative to one another and to other sequenced fungal genomes is provided here. Their 38-39 Mb genomes include 11,860-14,270 predicted genes, which share 83% amino acid identity on average between the two species. We have mapped the S. sclerotiorum assembly to 16 chromosomes and found large-scale co-linearity with the B. cinerea genomes. Seven percent of the S. sclerotiorum genome comprises transposable elements compared to <1% of B. cinerea. The arsenal of genes associated with necrotrophic processes is similar between the species, including genes involved in plant cell wall degradation and oxalic acid production. Analysis of secondary metabolism gene clusters revealed an expansion in number and diversity of B. cinerea-specific secondary metabolites relative to S. sclerotiorum. The potential diversity in secondary metabolism might be involved in adaptation to specific ecological niches. Comparative genome analysis revealed the basis of differing sexual mating compatibility systems between S. sclerotiorum and B. cinerea. The organization of the mating-type loci differs, and their structures provide evidence for the evolution of heterothallism from homothallism. These data shed light on the evolutionary and mechanistic bases of the genetically complex traits of necrotrophic pathogenicity and sexual mating. This resource should facilitate the functional studies designed to better understand what makes these fungi such successful and persistent pathogens of agronomic crops.The Sclerotinia sclerotiorum genome project was supported by the USDA Cooperative State Research, Education and Extension Service (USDA-NRI 2004). Sclerotinia sclerotiorum ESTs were funded by a grant to JA Rollins from USDA specific cooperative agreement 58-5442-4-281. The genome sequence of Botrytis cinerea strain T4 was funded by Genoscope, CEA, France. M Viaud was funded by the “Projet INRA Jeune-Equipe”. PM Coutinho and B Henrissat were funded by the ANR to project E-Tricel (grant ANR-07-BIOE-006). The CAZy database is funded in part by GIS-IBiSA. DM Soanes and NJ Talbot were partly funded by the UK Biotechnology and Biological Sciences Research Council. KM Plummer was partially funded by the New Zealand Bio-Protection Research Centre, http://bioprotection.org.nz/. BJ Howlett and A Sexton were partially funded by the Australian Grains Research and Development Corporation, www.grdc.com.au. L Kohn was partially funded by NSERC Discovery Grant (Natural Sciences and Engineering Research Council of Canada) - Grant number 458078. M Dickman was supported by the NSF grant MCB-092391 and BARD grant US-4041-07C. O Yarden was supported by BARD grant US-4041-07C. EG Danchin obtained financial support from the European Commission (STREP FungWall grant, contract: LSHB - CT- 2004 - 511952). A Botrytis Genome Workshop (Kaiserslautern, Germany) was supported by a grant from the German Science Foundation (DFG; HA1486) to M Hahn

Open Research Exeter

Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)

Author: A Dijkhuizen
A Saveliev
AG Jones
C Jeffrey
C Schlötterer
C Schlötterer
Chinnappa D Kodira
CR Primmer
D Tautz
DE MacHugh
DJ Somers
Douglas A Senalik
EA Sia
FC Serquen
G Bates
G Toth
GA Tuskan
H Ellegren
H Ellegren
H Ellegren
H Innan
H Wang
HJ Price
IRGSP (International Rice Genome Sequencing Project)
J Quackenbush
JA Eisen
JC Garza
JE Bowers
JH Mun
JH Peng
JL Weber
JM Bradeen
K Arumuganathan
L Cardle
L Santi
LD Knerr
Luming Yang
M Morgante
M Morgante
M Wierdl
MD Robbins
MG Murray
N Huo
N Watcharawongpaiboon
NN Fitzsimmons
O Jaillon
P Sebastian
Pablo F Cavagnaro
Philipp W Simon
Q Kong
R Gur-Arie
R Lister
R Wooster
RK Varshney
S Huang
S Leclercq
S Rozen
S Subramanian
S Temnykh
Sanwen Huang
SG Guo
SP Zhang
SR Kruglyak
SR McCouch
SS Renner
T Horejsi
T Thiel
Timothy T Harkins
TW Whitaker
V Meglic
W Powell
WC Kennard
Y Danin-Poleg
Y Kashi
Y Ren
Y Weng
Y Weng
Y Weng
Y Weng
YC Li
YH Han
YH Park
Yiqun Weng
Z Li
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Cucumber, <it>Cucumis sativus </it>L. is an important vegetable crop worldwide. Until very recently, cucumber genetic and genomic resources, especially molecular markers, have been very limited, impeding progress of cucumber breeding efforts. Microsatellites are short tandemly repeated DNA sequences, which are frequently favored as genetic markers due to their high level of polymorphism and codominant inheritance. Data from previously characterized genomes has shown that these repeats vary in frequency, motif sequence, and genomic location across taxa. During the last year, the genomes of two cucumber genotypes were sequenced including the Chinese fresh market type inbred line '9930' and the North American pickling type inbred line 'Gy14'. These sequences provide a powerful tool for developing markers in a large scale. In this study, we surveyed and characterized the distribution and frequency of perfect microsatellites in 203 Mbp assembled Gy14 DNA sequences, representing 55% of its nuclear genome, and in cucumber EST sequences. Similar analyses were performed in genomic and EST data from seven other plant species, and the results were compared with those of cucumber. Results A total of 112,073 perfect repeats were detected in the Gy14 cucumber genome sequence, accounting for 0.9% of the assembled Gy14 genome, with an overall density of 551.9 SSRs/Mbp. While tetranucleotides were the most frequent microsatellites in genomic DNA sequence, dinucleotide repeats, which had more repeat units than any other SSR type, had the highest cumulative sequence length. Coding regions (ESTs) of the cucumber genome had fewer microsatellites compared to its genomic sequence, with trinucleotides predominating in EST sequences. AAG was the most frequent repeat in cucumber ESTs. Overall, AT-rich motifs prevailed in both genomic and EST data. Compared to the other species examined, cucumber genomic sequence had the highest density of SSRs (although comparable to the density of poplar, grapevine and rice), and was richest in AT dinucleotides. Using an electronic PCR strategy, we investigated the polymorphism between 9930 and Gy14 at 1,006 SSR loci, and found unexpectedly high degree of polymorphism (48.3%) between the two genotypes. The level of polymorphism seems to be positively associated with the number of repeat units in the microsatellite. The <it>in silico </it>PCR results were validated empirically in 660 of the 1,006 SSR loci. In addition, primer sequences for more than 83,000 newly-discovered cucumber microsatellites, and their exact positions in the Gy14 genome assembly were made publicly available. Conclusions The cucumber genome is rich in microsatellites; AT and AAG are the most abundant repeat motifs in genomic and EST sequences of cucumber, respectively. Considering all the species investigated, some commonalities were noted, especially within the monocot and dicot groups, although the distribution of motifs and the frequency of certain repeats were characteristic of the species examined. The large number of SSR markers developed from this study should be a significant contribution to the cucurbit research community.</p

Springer - Publisher Connector

Diposit Digital de la Universitat de Barcelona

Sequencing of Culex quinquefasciatus establishes a platform for mosquito comparative genomics

Author: Abrudan Jenica
Amedeo Paolo
Antelo Beatriz
Arensburger Peter
Atkinson Peter W.
Bartholomay Lyric
Bidwell Shelby
Birren Bruce
Caler Elisabet
Camara Francisco
Campbell Corey L.
Campbell Kathryn S.
Casola Claudio
Castro Marta T.
Chandramouliswaran Ishwar
Chapman Sinéad B.
Christensen Bruce M.
Christley Scott
Collins Frank H.
Cornel Anthony
Costas Javier
Dimopoulos George
Eisenstadt Eric
Feschotte Cedric
Fraser-Liggett Claire
Guigó Serra Roderic
Haas Brian
Hammond Martin
Hannick Linda I.
Hansson Bill S.
Hemingway Janet
Higgs Stephen
Hill Sharon
Howarth Clint
Ignell Rickard
Kennedy Ryan C.
Kodira Chinnappa D.
Lanzaro Gregory C.
Lawson Daniel
Lee Norman H.
Liu Nannan
Lobo Neil F.
Mao Chunhong
Mayhew George
Megy Karine
Michel Kristin
Mori Akio
Muskavitch Marc A. T.
Naveira Horacio
Nene Vishvanath
Nguyen Nam
Pearson Matthew D.
Pritham Ellen J.
Puiu Daniela
Qi Yumin
Raikhel Alexander S.
Ranson Hilary
Ribeiro Jose M. C.
Roberston Hugh M.
Severson David W.
Shumway Martin
Stanke Mario
Strausberg Robert
Sun Cheng
Sutton Granger
Tu Zhijian (Jake)
Tubio Jose M. C.
Unger Maria F
Vanlandingham Dana L.
Vilella Albert J.
Waterhouse Robert M.
White Jared R.
White Owen
Wondji Charles S.
Wortman Jennifer
Zdobnov Evgeny M.
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 24/07/2018
Field of study

Culex quinquefasciatus (the southern house mosquito) is an important mosquito vector of viruses such as West Nile virus and St. Louis encephalitis virus, as well as of nematodes that cause lymphatic filariasis. C. quinquefasciatus is one species within the Culex pipiens species complex and can be found throughout tropical and temperate climates of the world. The ability of C. quinquefasciatus to take blood meals from birds, livestock, and humans contributes to its ability to vector pathogens between species. Here, we describe the genomic sequence of C. quinquefasciatus: Its repertoire of 18,883 protein-coding genes is 22% larger than that of Aedes aegypti and 52% larger than that of Anopheles gambiae with multiple gene-family expansions, including olfactory and gustatory receptors, salivary gland genes, and genes associated with xenobiotic detoxification

Insights into evolution of multicellular fungi from the assembled chromosomes of the mushroom Coprinopsis cinerea (Coprinus cinereus)

Author: Ahren D.
Au C. H.
Birren B. W.
Borodovsky M.
Burns C.
Canback B.
Casselton L. A.
Cheng C. K.
Deng J.
Dietrich F. S.
Fargo D. C.
Farman M. L.
Gathman A. C.
Goldberg J.
Guigo R.
Hoegger P. J.
Hooker J. B.
Huggins A.
James T. Y.
Kamada T.
Kilaru S.
Kodira C.
Kues U.
Kupfer D.
Kwan H. S.
Li W.
Lilly W. W.
Lomsadze A.
Ma L.-J.
Mackey A. J.
Stajich J. E.
Wilke S. K.
Publication venue
Publication date: 01/01/2010
Field of study

The mushroom Coprinopsis cinerea is a classic experimental model for multicellular development in fungi because it grows on defined media, completes its life cycle in 2 weeks, produces some 108 synchronized meiocytes, and can be manipulated at all stages in development by mutation and transformation. The 37-megabase genome of C. cinerea was sequenced and assembled into 13 chromosomes. Meiotic recombination rates vary greatly along the chromosomes, and retrotransposons are absent in large regions of the genome with low levels of meiotic recombination. Single-copy genes with identifiable orthologs in other basidiomycetes are predominant in low-recombination regions of the chromosome. In contrast, paralogous multicopy genes are found in the highly recombining regions, including a large family of protein kinases (FunK1) unique to multicellular fungi. Analyses of P450 and hydrophobin gene families confirmed that local gene duplications drive the expansions of paralogous copies and the expansions occur in independent lineages of Agaricomycotina fungi. Gene-expression patterns from microarrays were used to dissect the transcriptional program of dikaryon formation (mating). Several members of the FunK1 kinase family are differentially regulated during sexual morphogenesis, and coordinate regulation of adjacent duplications is rare. The genomes of C. cinerea and Laccaria bicolor, a symbiotic basidiomycete, share extensive regions of synteny. The largest syntenic blocks occur in regions with low meiotic recombination rates, no transposable elements, and tight gene spacing, where orthologous single-copy genes are overrepresented. The chromosome assembly of C. cinerea is an essential resource in understanding the evolution of multicellularity in the fungi

Carolina Digital Repository

Comparative and Functional Genomics of Rhodococcus opacus PD630 for Biofuels Development

Author: A Arakaki
A Argyrou
A Marchler-Bauer
A Pohlmann
A Stamatakis
AF Alvarez
AI Saeed
AJ Enright
AK Pandey
AL Delcher
AL Delcher
Alex L. B. Leach
AM Waterhouse
Anthony C. DeBono
Anthony J. Sinskey
AR Horswill
Brian Desany
Bruce W. Birren
C Kaddor
Chinnappa D. Kodira
Christine Dancel
Christopher A. Desjardins
D Jendrossek
D Portevin
D Post
DE Vance
Dirk Gevers
DL Rainwater
DL Rainwater
DP MacEachran
E Puglisi
E Schwartz
E Schweizer
E Severi
E Severi
E Vimr
ER Goncalves
F Abascal
F David
F-F Hsu
G Timmins
HM Alvarez
HM Alvarez
HM Alvarez
I Letunic
I Matsunaga
IB Lomakin
IC Sutcliffe
Ion Ghiviriga
J Hughes
J Rengarajan
Jason P. Affourtit
Jason W. Holder
Jeremy Zucker
Jil C. Ulrich
JM Mathieu
K Isono
K Katoh
K Kurosawa
K Lagesen
K Raman
KC Yam
KR Robrock
L Diacovich
L Li
M Brudno
M Green
M Hernandez
M Seto
M Wu
MA Larkin
MJ de Hoon
MP Mansour
MP McLeod
O Lenz
O Zimhony
OP Peoples
PA Lessard
Paul A. Godfrey
Paul M. Richardson
PD Karp
PR Romero
Qiandong Zeng
R Edgar
R Gande
R Gande
R Kalscheuer
R Van der Geize
RD Finn
RL Hunter
S Griffiths-Jones
S Guindon
S Kikuchi
S Rajakumari
SC Slater
SK Parker
T Chopra
T Lee
T Sirakova
TD Sirakova
TD Sirakova
Thomas Abeel
TM Lowe
U Grafe
X Yang
Y Hu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

The Actinomycetales bacteria Rhodococcus opacus PD630 and Rhodococcus jostii RHA1 bioconvert a diverse range of organic substrates through lipid biosynthesis into large quantities of energy-rich triacylglycerols (TAGs). To describe the genetic basis of the Rhodococcus oleaginous metabolism, we sequenced and performed comparative analysis of the 9.27 Mb R. opacus PD630 genome. Metabolic-reconstruction assigned 2017 enzymatic reactions to the 8632 R. opacus PD630 genes we identified. Of these, 261 genes were implicated in the R. opacus PD630 TAGs cycle by metabolic reconstruction and gene family analysis. Rhodococcus synthesizes uncommon straight-chain odd-carbon fatty acids in high abundance and stores them as TAGs. We have identified these to be pentadecanoic, heptadecanoic, and cis-heptadecenoic acids. To identify bioconversion pathways, we screened R. opacus PD630, R. jostii RHA1, Ralstonia eutropha H16, and C. glutamicum 13032 for growth on 190 compounds. The results of the catabolic screen, phylogenetic analysis of the TAGs cycle enzymes, and metabolic product characterizations were integrated into a working model of prokaryotic oleaginy.Cambridge-MIT InstituteMassachusetts Institute of Technology. (Seed Grant program)Shell Oil CompanyNational Institute of Allergy and Infectious Diseases (U.S.)United States. National Institutes of HealthNational Institutes of Health. Department of Health and Human Services (Contract No. HHSN272200900006C

CiteSeerX

Public Library of Science (PLOS)

DSpace@MIT

MURAL - Maynooth University Research Archive Library

Evolution of pathogenicity and sexual reproduction in eight Candida genomes

Author: A Forche
A Tavanti
Aaron M. Neiman
AE Tsong
AE Tsong
Alistair J. P. Brown
Anja Forche
AS Chau
B Dujon
B Slutsky
BB Tuch
Bernhard Hube
BR Braun
Bruce W. Birren
Carol A. Munro
Chinnappa Kodira
Christina A. Cuomo
CM Hull
David A. Fitzpatrick
David Harris
David Soll
DR Scannell
E Fabre
Elissavet Nikolaou
Esther Rheinbay
Florian F. Schmitzberger
Frans M. Klis
Gavin Sherlock
Geraldine Butler
Ian Stansfield
Ino Agrafioti
J Zhang
Janet Quinn
Jennifer L. Reedy
JL Argueso
Joseph Heitman
JP van der Walt
JP van der Walt
Judith Berman
K Nielsen
Kevin A. T. Silverstein
KJ Daniels
KJ Verstrepen
KM Yeater
KW Tzung
LL Hoyer
Lois L. Hoyer
M Legrand
M van het Hoog
MA Pfaller
MA Santos
Manfred Grabherr
Manolis Kellis
Manuel A. S. Santos
Marek S. Skrzypek
Maria C. Costanzo
Maria C. Santos
Martha B. Arnaud
Mary E. Logue
Matthew Berriman
Matthew D. Rasmussen
ME Logue
Michael A. Quail
Michael C. Lorenz
Michael F. Lin
Michael P. H. Stumpf
Neil A. R. Gow
Nicola Lennard
Peter E. Sudbery
Piet W. J. de Groot
Prachi Shah
Qiandong Zeng
QT Phan
R Kaur
RE Zordan
Rodney Staggs
Ronny Martin
RS Almeida
S Bates
Sascha Brunke
SE Massey
Sharadha Sakthikumar
SM Noble
SR Lockhart
SR Lockhart
Steven Bates
T de los Santos
T Jones
Thyagarajan Srikantha
TW Jeffries
WS Chu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Candida species are the most common cause of opportunistic fungal infection worldwide. Here we report the genome sequences of six Candida species and compare these and related pathogens and non-pathogens. There are significant expansions of cell wall, secreted and transporter gene families in pathogenic species, suggesting adaptations associated with virulence. Large genomic tracts are homozygous in three diploid species, possibly resulting from recent recombination events. Surprisingly, key components of the mating and meiosis pathways are missing from several species. These include major differences at the mating-type loci (MTL); Lodderomyces elongisporus lacks MTL, and components of the a1/2 cell identity determinant were lost in other species, raising questions about how mating and cell types are controlled. Analysis of the CUG leucine-to-serine genetic-code change reveals that 99% of ancestral CUG codons were erased and new ones arose elsewhere. Lastly, we revise the Candida albicans gene catalogue, identifying many new genes.publishe

Repositório Institucional da Universidade de Aveiro

NUI Maynooth Eprint Archive

Maynooth University ePrints and eTheses Archive