Search CORE

81 research outputs found

Dissecting Allele Architecture of Early Onset IBD Using High-Density Genotyping

Author: +24 additional authors
Baldassano R.
Cutler D. J.
Dubinsky M.
Guthery S. L.
Kugathasan S.
Markowitz J.
Okou D. T.
Prahalad S.
Walters T.
Zwick M. E.
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2015
Field of study

BACKGROUND: The inflammatory bowel diseases (IBD) are common, complex disorders in which genetic and environmental factors are believed to interact leading to chronic inflammatory responses against the gut microbiota. Earlier genetic studies performed in mostly adult population of European descent identified 163 loci affecting IBD risk, but most have relatively modest effect sizes, and altogether explain only ~20% of the genetic susceptibility. Pediatric onset represents about 25% of overall incident cases in IBD, characterized by distinct disease physiology, course and risks. The goal of this study is to compare the allelic architecture of early onset IBD with adult onset in population of European descent. METHODS: We performed a fine mapping association study of early onset IBD using high-density Immunochip genotyping on 1008 pediatric-onset IBD cases (801 Crohn\u27s disease; 121 ulcerative colitis and 86 IBD undetermined) and 1633 healthy controls. Of the 158 SNP genotypes obtained (out of the 163 identified in adult onset), this study replicated 4% (5 SNPs out of 136) of the SNPs identified in the Crohn\u27s disease (CD) cases and 0.8% (1 SNP out of 128) in the ulcerative colitis (UC) cases. Replicated SNPs implicated the well known NOD2 and IL23R. The point estimate for the odds ratio (ORs) for NOD2 was above and outside the confidence intervals reported in adult onset. A polygenic liability score weakly predicted the age of onset for a larger collection of CD cases (p\u3c 0.03, R2= 0.007), but not for the smaller number of UC cases. CONCLUSIONS: The allelic architecture of common susceptibility variants for early onset IBD is similar to that of adult onset. This immunochip genotyping study failed to identify additional common variants that may explain the distinct phenotype that characterize early onset IBD. A comprehensive dissection of genetic loci is necessary to further characterize the genetic architecture of early onset IBD

Hofstra Northwell Academic Works (Hofstra Northwell School of Medicine)

Efficient oligonucleotide probe selection for pan-genomic tiling arrays

Author: A Toledo-Arana
AB Olshen
Adam M Phillippy
AM Phillippy
C Zhang
D Johnson
D Medini
D Pinkel
D Volokhov
D Wang
DG Wang
DR Call
DT Okou
FR Pinto
G Ausiello
GJ Porreca
H Tettelin
H Tettelin
H Willenbrock
H Willenbrock
JM Farber
L Snipen
L Snipen
M Doumith
M Schena
M Wiedmann
MK Borucki
MR Garey
P Bertone
S Feng
S Graf
S Kurtz
Steven L Salzberg
T Slezak
TC Mockler
TG Ksiazek
TJ Albert
U Feige
W Tembe
Wei Zhang
WH Chung
Xiangyu Deng
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Background: Array comparative genomic hybridization is a fast and cost-effective method for detecting, genotyping, and comparing the genomic sequence of unknown bacterial isolates. This method, as with all microarray applications, requires adequate coverage of probes targeting the regions of interest. An unbiased tiling of probes across the entire length of the genome is the most flexible design approach. However, such a whole-genome tiling requires that the genome sequence is known in advance. For the accurate analysis of uncharacterized bacteria, an array must query a fully representative set of sequences from the species' pan-genome. Prior microarrays have included only a single strain per array or the conserved sequences of gene families. These arrays omit potentially important genes and sequence variants from the pan-genome. Results: This paper presents a new probe selection algorithm (PanArray) that can tile multiple whole genomes using a minimal number of probes. Unlike arrays built on clustered gene families, PanArray uses an unbiased, probe-centric approach that does not rely on annotations, gene clustering, or multi-alignments. Instead, probes are evenly tiled across all sequences of the pangenome at a consistent level of coverage. To minimize the required number of probes, probes conserved across multiple strains in the pan-genome are selected first, and additional probes are used only where necessary to span polymorphic regions of the genome. The viability of the algorithm is demonstrated by array designs for seven different bacterial pan-genomes and, in particular, the design of a 385,000 probe array that fully tiles the genomes of 20 different Listeria monocytogenes strains with overlapping probes at greater than twofold coverage. Conclusion: PanArray is an oligonucleotide probe selection algorithm for tiling multiple genome sequences using a minimal number of probes. It is capable of fully tiling all genomes of a species on a single microarray chip. These unique pan-genome tiling arrays provide maximum flexibility for the analysis of both known and uncharacterized strains.https://doi.org/10.1186/1471-2105-10-29

Crossref

Springer - Publisher Connector

PubMed Central

Digital Repository at the University of Maryland

Targeted next-generation sequencing of DNA regions proximal to a conserved GXGXXG signaling motif enables systematic discovery of tyrosine kinase fusions in cancer

Author: Agnes Viale
Albert
Bean
Ben-Neriah
Berger
Campbell
Ciampi
Ciampi
Druker
Gnirke
Gu
Hanks
Hantschel
Hodges
Horn
Jossart
Juliann Chmielecki
Katherine Hutchinson
Kent
Kent
Koivunen
Kumar-Sinha
Leary
Levin
Maher
Maher
Manning
Margulies
Martin Peifer
Nagar
Nicholas D. Socci
Nowell
Okou
Peilin Jia
Rabes
Rikova
Roman K. Thomas
Rowley
Santoro
Sawyers
Soda
Stephens
Takeuchi
Tracy
William Pao
Zhao
Zhongming Zhao
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

Tyrosine kinase (TK) fusions are attractive drug targets in cancers. However, rapid identification of these lesions has been hampered by experimental limitations. Our in silico analysis of known cancer-derived TK fusions revealed that most breakpoints occur within a defined region upstream of a conserved GXGXXG kinase motif. We therefore designed a novel DNA-based targeted sequencing approach to screen systematically for fusions within the 90 human TKs; it should detect 92% of known TK fusions. We deliberately paired ‘in-solution’ DNA capture with 454 sequencing to minimize starting material requirements, take advantage of long sequence reads, and facilitate mapping of fusions. To validate this platform, we analyzed genomic DNA from thyroid cancer cells (TPC-1) and leukemia cells (KG-1) with fusions known only at the mRNA level. We readily identified for the first time the genomic fusion sequences of CCDC6-RET in TPC-1 cells and FGFR1OP2-FGFR1 in KG-1 cells. These data demonstrate the feasibility of this approach to identify TK fusions across multiple human cancers in a high-throughput, unbiased manner. This method is distinct from other similar efforts, because it focuses specifically on targets with therapeutic potential, uses only 1.5 µg of DNA, and circumvents the need for complex computational sequence analysis

Crossref

Kölner UniversitätsPublikationsServer

PubMed Central

MPG.PuRe

Comprehensive assessment of sequence variation within the copy number variable defensin cluster on 8p23 by target enriched in-depth 454 sequencing

Author: A Gnirke
AJ Brookes
Andreas Petzold
AU Rehman
B Ewing
B Ewing
D Fredman
D Summerer
D Summerer
DA Wheeler
DR Bentley
DT Okou
DW Kim
E Hodges
EJ Hollox
Francesca Raffaelli
GJ Porreca
J Chmielecki
J Wang
JA Armour
JI Kim
Jochen Hampe
JR Conejo-Garcia
K Huse
Karol Szafranski
Klaus Huse
M Choi
M Groth
M Groth
M Meyer
Marco Groth
Marius Felder
Matthias Platzer
MN Bainbridge
N Schracke
Philip Rosenstiel
RI Lehrer
RJ Hardwick
S Taudien
S Taudien
SC Schuster
SM Ahn
Stefan Schreiber
Stefan Taudien
TJ Albert
WJ Kent
Xinmin Zhang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background In highly copy number variable (CNV) regions such as the human defensin gene locus, comprehensive assessment of sequence variations is challenging. PCR approaches are practically restricted to tiny fractions, and next-generation sequencing (NGS) approaches of whole individual genomes e.g. by the 1000 Genomes Project is confined by an affordable sequence depth. Combining target enrichment with NGS may represent a feasible approach. Results As a proof of principle, we enriched a ~850 kb section comprising the CNV defensin gene cluster DEFB, the invariable DEFA part and 11 control regions from two genomes by sequence capture and sequenced it by 454 technology. 6,651 differences to the human reference genome were found. Comparison to HapMap genotypes revealed sensitivities and specificities in the range of 94% to 99% for the identification of variations. Using error probabilities for rigorous filtering revealed 2,886 unique single nucleotide variations (SNVs) including 358 putative novel ones. DEFB CN determinations by haplotype ratios were in agreement with alternative methods. Conclusion Although currently labor extensive and having high costs, target enriched NGS provides a powerful tool for the comprehensive assessment of SNVs in highly polymorphic CNV regions of individual genomes. Furthermore, it reveals considerable amounts of putative novel variations and simultaneously allows CN estimation.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Hochschulbibliothekszentrum des Landes Nordrhein-Westfalen (hbz)

Transethnic analysis of the human leukocyte antigen region for ulcerative colitis reveals not only shared but also ethnicity-specific disease associations

Author: Abedian S.
Alizadeh B.
BK T.
Boucher G.
Brant S.
Cheon J.
Daryani N.
Datta L.
Degenhardt F.
ElAbd H.
Ellinghaus D.
Ellinghaus E.
Ellul P.
Esaki M.
Franke A.
Fuyuno Y.
Haritunians T.
Hong M.
Hübenthal M.
Jung E.
Juyal G.
Juzenas S.
Karlsen T.
Kubo M.
Kugathasan S.
Lenz T.
Leslie S.
Malekzadeh R.
Mayr G.
McGovern D.
Midha V.
Motyer A.
Ng S.
Okou D.
Raychaudhuri S.
Rioux J.
Rosati E.
Schembri J.
Schreiber S.
Song K.
Sood A.
Takahashi A.
Torres E.
Umeno J.
Vahedi H.
Weersma R.
Wendorff M.
Wong S.
Yamazaki K.
Yang S.
Ye B.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/03/2021
Field of study

Inflammatory bowel disease (IBD) is a chronic inflammatory disease of the gut. Genetic association studies have identified the highly variable human leukocyte antigen (HLA) region as the strongest susceptibility locus for IBD, and specifically DRB1*01:03 as a determining factor for ulcerative colitis (UC). However, for most of the association signal such a delineation could not be made due to tight structures of linkage disequilibrium within the HLA. The aim of this study was therefore to further characterize the HLA signal using a trans-ethnic approach. We performed a comprehensive fine mapping of single HLA alleles in UC in a cohort of 9,272 individuals with African American, East Asian, Puerto Rican, Indian and Iranian descent and 40,691 previously analyzed Caucasians, additionally analyzing whole HLA haplotypes. We computationally characterized the binding of associated HLA alleles to human self-peptides and analysed the physico-chemical properties of the HLA proteins and predicted self-peptidomes. Highlighting alleles of the HLA-DRB1*15 group and their correlated HLA-DQ-DR haplotypes, we identified consistent associations across different ethnicities but also identified population-specific signals. We observed that DRB1*01:03 is mostly present in individuals of Western European descent and hardly present in non-Caucasian individuals. We found peptides predicted to bind to risk HLA alleles to be rich in positively charged amino acids such. We conclude that the HLA plays an important role for UC susceptibility across different ethnicities. This research further implicates specific features of peptides that are predicted to bind risk and protective HLA proteins

MPG.PuRe

University of Melbourne Institutional Repository

High-throughput SNP genotyping in the highly heterozygous genome of Eucalyptus: assay success, polymorphism and transferability across species

Abstract Background High-throughput SNP genotyping has become an essential requirement for molecular breeding and population genomics studies in plant species. Large scale SNP developments have been reported for several mainstream crops. A growing interest now exists to expand the speed and resolution of genetic analysis to outbred species with highly heterozygous genomes. When nucleotide diversity is high, a refined diagnosis of the target SNP sequence context is needed to convert queried SNPs into high-quality genotypes using the Golden Gate Genotyping Technology (GGGT). This issue becomes exacerbated when attempting to transfer SNPs across species, a scarcely explored topic in plants, and likely to become significant for population genomics and inter specific breeding applications in less domesticated and less funded plant genera. Results We have successfully developed the first set of 768 SNPs assayed by the GGGT for the highly heterozygous genome of <it>Eucalyptus </it>from a mixed Sanger/454 database with 1,164,695 ESTs and the preliminary 4.5X draft genome sequence for <it>E. grandis</it>. A systematic assessment of <it>in silico </it>SNP filtering requirements showed that stringent constraints on the SNP surrounding sequences have a significant impact on SNP genotyping performance and polymorphism. SNP assay success was high for the 288 SNPs selected with more rigorous <it>in silico </it>constraints; 93% of them provided high quality genotype calls and 71% of them were polymorphic in a diverse panel of 96 individuals of five different species. SNP reliability was high across nine <it>Eucalyptus </it>species belonging to three sections within subgenus Symphomyrtus and still satisfactory across species of two additional subgenera, although polymorphism declined as phylogenetic distance increased. Conclusions This study indicates that the GGGT performs well both within and across species of <it>Eucalyptus </it>notwithstanding its nucleotide diversity ≥2%. The development of a much larger array of informative SNPs across multiple <it>Eucalyptus </it>species is feasible, although strongly dependent on having a representative and sufficiently deep collection of sequences from many individuals of each target species. A higher density SNP platform will be instrumental to undertake genome-wide phylogenetic and population genomics studies and to implement molecular breeding by Genomic Selection in <it>Eucalyptus</it>.</p

Repository Open Access to Scientific Information from Embrapa

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

RCAAP - Repositório Científico de Acesso Aberto de Portugal

Universidade de São Paulo

Identification of factors required for meristem function in Arabidopsis using a novel next generation sequencing fast forward genetics approach

Author: A Gnirke
AC Darby
Anja van Dijken
B Scheres
B Wightman
Ben Scheres
BJ Till
C Nusslein-Volhard
CA ten Hove
D Botstein
DT Okou
E Cuppen
Edwin Cuppen
H Li
IJ Nijman
Isaäc J Nijman
J Maple
J Shendure
JG Williams
JJ Doyle
JM Murray
JM Rommens
K Schneeberger
LE Jao
M Doitsidou
M Mokry
Michal Mokry
ML Goldberg
N Schracke
P Vos
PC Ng
RC Lee
Rene Benjamins
Renze Heidstra
RW Michelmore
S Bougourd
S Sabatini
S Sarin
S Sivasubbu
SB Ng
TJ Albert
Y Duverger
Y Kim
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Phenotype-driven forward genetic experiments are powerful approaches for linking phenotypes to genomic elements but they still involve a laborious positional cloning process. Although sequencing of complete genomes now becomes available, discriminating causal mutations from the enormous amounts of background variation remains a major challenge. Method To improve this, we developed a universal two-step approach, named 'fast forward genetics', which combines traditional bulk segregant techniques with targeted genomic enrichment and next-generation sequencing technology Results As a proof of principle we successfully applied this approach to two Arabidopsis mutants and identified a novel factor required for stem cell activity. Conclusion We demonstrated that the 'fast forward genetics' procedure efficiently identifies a small number of testable candidate mutations. As the approach is independent of genome size, it can be applied to any model system of interest. Furthermore, we show that experiments can be multiplexed and easily scaled for the identification of multiple individual mutants in a single sequencing run.</p

Crossref

Online Research @ Cardiff

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Systematic generation of in vivo G protein-coupled receptor mutants in the rat

Author: AM Geurts
B Vroling
BM Smits
BM Smits
C Baakman
D Huszar
D VanLeeuwen
DA Keays
DA Nickerson
DT Okou
E Cuppen
F Horn
F Horn
G Vriend
GB Mills
H van Roekel
HJ Jacob
HJ Jacob
I J Nijman
IS Farooqi
J Hines
J Mul
JA Ballesteros
JJ Contos
JK Noveroske
JM Amos-Landgraf
JR Homberg
K Kitada
K Palczewski
L Kruglyak
M Buehr
M Verheul
MS Cotroneo
MS Springer
MW Beukers
N Claij
NM Urs
O Fritze
P Li
P Ma
P Toonen
PC Ng
Q Liang
R van Boxtel
R van Boxtel
R van Boxtel
RJ Anney
S Rozen
SG Rasmussen
SM Harrison
T Mashimo
T Warne
V Guryev
WJ Ansorge
WL Russell
Y Gondo
Y Zan
Publication venue: Nature Publishing Group
Publication date: 01/01/2010
Field of study

G-protein-coupled receptors (GPCRs) constitute a large family of cell surface receptors that are involved in a wide range of physiological and pathological processes, and are targets for many therapeutic interventions. However, genetic models in the rat, one of the most widely used model organisms in physiological and pharmacological research, are largely lacking. Here, we applied N-ethyl-N-nitrosourea (ENU)-driven target-selected mutagenesis to generate an in vivo GPCR mutant collection in the rat. A pre-selected panel of 250 human GPCR homologs was screened for mutations in 813 rats, resulting in the identification of 131 non-synonymous mutations. From these, seven novel potential rat gene knockouts were established as well as 45 lines carrying missense mutations in various genes associated with or involved in human diseases. We provide extensive in silico modeling results of the missense mutations and show experimental data, suggesting loss-of-function phenotypes for several models, including Mc4r and Lpar1. Taken together, the approach used resulted not only in a set of novel gene knockouts, but also in allelic series of more subtle amino acid variants, similar as commonly observed in human disease. The mutants presented here may greatly benefit studies to understand specific GPCR function and support the development of novel therapeutic strategies

Targeted high throughput sequencing in clinical cancer Settings: formaldehyde fixed-paraffin embedded (FFPE) tumor tissues, input amount and tumor heterogeneity

Author: A Gnirke
A Weise
AW Briggs
B Timmermann
Berger MSL F Michael
Bernd Timmermann
BS Taylor
C Greenman
CA Macintosh
D Aird
DG Bostwick
DT Okou
DW Bell
DW Parsons
E Hodges
ED Pleasance
ED Pleasance
G Bartsch
Georg Bartsch
Georg Schaefer
GJ Porreca
Hans Lehrach
Helmut Klocker
HM Wood
Holger Sültmann
Irmgard Verdorfer
J Clark
J Yu
L Ding
LD Wood
M Aihara
M Barry
Martin Kerick
Melanie Isau
Michal R Schweiger
MR Schweiger
MR Schweiger
N Navin
PM Krawitz
R Mehra
R Mehra
R Yatani
Ralf Herwig
RB Shah
S Jones
SB Ng
SP Shah
Sylvia Krobitsch
T Shiraishi
T Sjoblom
TJ Albert
TJ Ley
W Horninger
W Lee
W Liu
WA Sakr
Z Kan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Massively parallel sequencing technologies have brought an enormous increase in sequencing throughput. However, these technologies need to be further improved with regard to reproducibility and applicability to clinical samples and settings. Methods Using identification of genetic variations in prostate cancer as an example we address three crucial challenges in the field of targeted re-sequencing: Small nucleotide variation (SNV) detection in samples of formalin-fixed paraffin embedded (FFPE) tissue material, minimal amount of input sample and sampling in view of tissue heterogeneity. Results We show that FFPE tissue material can supplement for fresh frozen tissues for the detection of SNVs and that solution-based enrichment experiments can be accomplished with small amounts of DNA with only minimal effects on enrichment uniformity and data variance. Finally, we address the question whether the heterogeneity of a tumor is reflected by different genetic alterations, e.g. different foci of a tumor display different genomic patterns. We show that the tumor heterogeneity plays an important role for the detection of copy number variations. Conclusions The application of high throughput sequencing technologies in cancer genomics opens up a new dimension for the identification of disease mechanisms. In particular the ability to use small amounts of FFPE samples available from surgical tumor resections and histopathological examinations facilitates the collection of precious tissue materials. However, care needs to be taken in regard to the locations of the biopsies, which can have an influence on the prediction of copy number variations. Bearing these technological challenges in mind will significantly improve many large-scale sequencing studies and will - in the long term - result in a more reliable prediction of individual cancer therapies.</p

Crossref

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Mapping of the Disease Locus and Identification of ADAMTS10 As a Candidate Gene in a Canine Model of Primary Open Angle Glaucoma

Author: A Roy
B Coulombe
B Langmead
C Drogemuller
D Ahram
DA Samuelson
DE Brooks
DT Okou
EA Merritt
Edward O. MacKay
EK Karlsson
ER Tamm
F Ramirez
GR Abecasis
Gregory S. Barsh
H Li
HG Parker
J Morales
John Kuchtey
Jonathan L. Haines
K Lindblad-Toh
K Soejima
KE Keller
KE Keller
Kirk N. Gelatt
KM Smith
KN Gelatt
KN Gelatt
L Faivre
L Faivre
Lana M. Olson
M Akiyama
M Ali
M Johnson
M Silberstein
MP Fautsch
N Dagoneau
NJ Izquierdo
OY Tektas
P Duggal
P Dureau
P Kumar
P Yue
PJ Kraulis
Rachel W. Kuchtey
RC Tripathi
RL Peiffer Jr
RP Somerville
RR Allingham
S Porter
SS Apte
T. M. Iverson
TA Jones
TJ Albert
Tommy Rinkoski
TS Acott
XL Zheng
YH Kwon
Z Duan
Publication venue: Public Library of Science
Publication date: 01/02/2011
Field of study

Primary open angle glaucoma (POAG) is a leading cause of blindness worldwide, with elevated intraocular pressure as an important risk factor. Increased resistance to outflow of aqueous humor through the trabecular meshwork causes elevated intraocular pressure, but the specific mechanisms are unknown. In this study, we used genome-wide SNP arrays to map the disease gene in a colony of Beagle dogs with inherited POAG to within a single 4 Mb locus on canine chromosome 20. The Beagle POAG locus is syntenic to a previously mapped human quantitative trait locus for intraocular pressure on human chromosome 19. Sequence capture and next-generation sequencing of the entire canine POAG locus revealed a total of 2,692 SNPs segregating with disease. Of the disease-segregating SNPs, 54 were within exons, 8 of which result in amino acid substitutions. The strongest candidate variant causes a glycine to arginine substitution in a highly conserved region of the metalloproteinase ADAMTS10. Western blotting revealed ADAMTS10 protein is preferentially expressed in the trabecular meshwork, supporting an effect of the variant specific to aqueous humor outflow. The Gly661Arg variant in ADAMTS10 found in the POAG Beagles suggests that altered processing of extracellular matrix and/or defects in microfibril structure or function may be involved in raising intraocular pressure, offering specific biochemical targets for future research and treatment strategies

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central