Search CORE

Directory of Open Access Journals

Harvard University - DASH

FigShare

Recommended from our members

On Quality Control Measures in Genome-wide Association Studies: A Test to Assess the Genotyping Quality of Individual Probands in Family-based Association Studies and an Application to the HapMap Data

Author: Dermitzakis Emmanouil T.
Fardo David W.
Ionita-Laza Iuliana
Lange Christoph
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 12/02/2012
Field of study

Allele transmissions in pedigrees provide a natural way of evaluating the genotyping quality of a particular proband in a family-based, genome-wide association study. We propose a transmission test that is based on this feature and that can be used for quality control filtering of genome-wide genotype data for individual probands. The test has one degree of freedom and assesses the average genotyping error rate of the genotyped SNPs for a particular proband. As we show in simulation studies, the test is sufficiently powerful to identify probands with an unreliable genotyping quality that cannot be detected with standard quality control filters. This feature of the test is further exemplified by an application to the third release of the HapMap data. The test is ideally suited as the final layer of quality control filters in the cleaning process of genome-wide association studies. It identifies probands with insufficient genotyping quality that were not removed by standard quality control filtering

Constructing gene association networks for rheumatoid arthritis using the backward genotype-trait association (BGTA) algorithm

Author: Cong Lei
Ding Yuejing
Ionita-Laza Iuliana
Lo Shaw-Hwa
Zheng Tian
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Rheumatoid arthritis (RA, MIM 180300) is a common and complex inflammatory disorder. The North American Rheumatoid Arthritis Consortium (NARAC) data, as part of the Genetic Analysis Workshop 15 data, consists of both genome scan and candidate gene studies on RA patients. We applied the backward genotype-trait association (BGTA) algorithm to capture marginal and gene × gene interaction effects of multiple susceptibility loci on RA disease status. A two-stage screening approach was used for the genome scan, whereas a comprehensive study of all possible subsets was conducted for the candidate genes. For the genome scan, we constructed an association network among 39 genetic loci that demonstrated strong signals, 19 of which have been reported in the RA literature. For the candidate genes, we found strong signals for PTPN22 and SUMO4. Based on significant association evidence, we built an association network among the loci of PTPN22, PADI4, DLG5, SLC22A4, SUMO4, and CARD15. To control for false positives, we used permutation tests to constrain the family-wise type I error rate to 1%. Using the BGTA algorithm, we identified genetic loci and candidate genes that were associated with RA susceptibility and association networks among them. For the first time, we report possible interactions between single-nucleotide polymorphisms/genes, which may be useful for biological interpretation

Comparing the power of family-based association tests for sequence data with applications in the GAW18 simulated data

Author: Chen Yong
Huang Jing
Ionita-Laza Iuliana
Swartz Michael
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2014
Field of study

We apply a family-based extension of the sequence kernel association test (SKAT) to 93 trios extracted from the 20 pedigrees in the Genetic Analysis Workshop 18 simulated data. Each extracted trio includes a unique set of parents to ensure conditionally independent trios are sampled. We compare the empirical type I error and power between the family-based SKAT and the burden test under varying percentages of causal single-nucleotide polymorphisms included in the analysis. Our investigation using simulated data suggests that, under the setting used for Genetic Analysis Workshop 18 data, both the family-based SKAT and the burden test have limited power, and that there is no substantial impact of percentage of signal on the power of either test. The low power is partially a result of the small sample size. However, we find that both the family-based SKAT and the burden test are more powerful when we use only rare variants, rather than common variants, to test the association

Springer - Publisher Connector

Identification of Rare Causal Variants in Sequence-Based Studies: Methods and Applications to VPS13B, a Gene Involved in Cohen Syndrome and Autism

Author: Buxbaum Joseph D.
Capanu Marinela
De Rubeis Silvia
Ionita-Laza Iuliana
McCallum Kenneth
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2014
Field of study

Pinpointing the small number of causal variants among the abundant naturally occurring genetic variation is a difficult challenge, but a crucial one for understanding precise molecular mechanisms of disease and follow-up functional studies. We propose and investigate two complementary statistical approaches for identification of rare causal variants in sequencing studies: a backward elimination procedure based on groupwise association tests, and a hierarchical approach that can integrate sequencing data with diverse functional and evolutionary conservation annotations for individual variants. Using simulations, we show that incorporation of multiple bioinformatic predictors of deleteriousness, such as PolyPhen-2, SIFT and GERP++ scores, can improve the power to discover truly causal variants. As proof of principle, we apply the proposed methods to VPS13B, a gene mutated in the rare neurodevelopmental disorder called Cohen syndrome, and recently reported with recessive variants in autism. We identify a small set of promising candidates for causal variants, including two loss-of-function variants and a rare, homozygous probably-damaging variant that could contribute to autism risk

Directory of Open Access Journals

FigShare

Recommended from our members

Joint study of genetic regulators for expression traits related to breast cancer

Author: Cong Lei
Ding Yuejing
Ionita-Laza Iuliana
Lo Shaw-Hwa
Wang Shuang
Zheng Tian
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

The mRNA expression levels of genes have been shown to have discriminating power for the classification of breast cancer. Studying the heritability of gene expression levels on breast cancer related transcripts can lead to the identification of shared common regulators and inter-regulation patterns, which would be important for dissecting the etiology of breast cancer. We applied multilocus association genome-wide scans to 18 breast cancer related transcripts and combined the results with traditional linkage scans. Regulatory hotspots for these transcripts were identified and some inter-regulation patterns were observed. We also derived evidence on interacting genetic regulatory loci shared by a number of these transcripts. In this paper, by restricting to a set of related genes, we were able to employ a more detailed multilocus approach that evaluates both marginal and interaction association signals at each single-nucleotide polymorphism. Interesting inter-regulation patterns and significant overlaps of genetic regulators between transcripts were observed. Interaction association results returned more expression quantitative trait locus hotspots that are significant

Essential Tremor in aCharcot-Marie-Tooth Type 2C Kindred Does Not Segregate with the TRPV4 R269H Mutation

Author: Elan D. Louis
Iuliana Ionita-Laza
Lorraine N. Clark
Nora Hernandez
Ruth Ottman
Publication venue: 'S. Karger AG'
Publication date
Field of study

Copy number variation genotyping using family information

Author: Chu Jen-hwa
Darvishi Katayoon
Ionita-Laza Iuliana
Lee Charles
Mills Ryan E
Raby Benjamin A
Rogers Angela
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/08/2015
Field of study

Abstract Background In recent years there has been a growing interest in the role of copy number variations (CNV) in genetic diseases. Though there has been rapid development of technologies and statistical methods devoted to detection in CNVs from array data, the inherent challenges in data quality associated with most hybridization techniques remains a challenging problem in CNV association studies. Results To help address these data quality issues in the context of family-based association studies, we introduce a statistical framework for the intensity-based array data that takes into account the family information for copy-number assignment. The method is an adaptation of traditional methods for modeling SNP genotype data that assume Gaussian mixture model, whereby CNV calling is performed for all family members simultaneously and leveraging within family-data to reduce CNV calls that are incompatible with Mendelian inheritance while still allowing de-novo CNVs. Applying this method to simulation studies and a genome-wide association study in asthma, we find that our approach significantly improves CNV calls accuracy, and reduces the Mendelian inconsistency rates and false positive genotype calls. The results were validated using qPCR experiments. Conclusions In conclusion, we have demonstrated that the use of family information can improve the quality of CNV calling and hopefully give more powerful association test of CNVs.http://deepblue.lib.umich.edu/bitstream/2027.42/112374/1/12859_2012_Article_5896.pd

Deep Blue Documents

Small sample properties of rare variant analysis methods

Author: Ionita-Laza Iuliana
Kim Taebeom
Niu Jiangong
Shete Sanjay
Swartz Michael
Yu Robert
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2014
Field of study

We are now well into the sequencing era of genetic analysis, and methods to investigate rare variants associated with disease remain in high demand. Currently, the more common rare variant analysis methods are burden tests and variance component tests. This report introduces a burden test known as the modified replication based sum statistic and evaluates its performance, and the performance of other common burden and variance component tests under the setting of a small sample size (103 total cases and controls) using the Genetic Analysis Workshop 18 simulated data with complete knowledge of the simulation model. Specifically we look at the variable threshold sum statistic, replication-based sum statistics, the C-alpha, and sequence kernel association test. Using minor allele frequency thresholds of less than 0.05, we find that the modified replication based sum statistic is competitive with all methods and that using 103 individuals leads to all methods being vastly underpowered. Much larger sample sizes are needed to confidently find truly associated genes

Springer - Publisher Connector