76 research outputs found

    To publish or not to publish.......

    Get PDF

    SNPLims: a data management system for genome wide association studies

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Recent progresses in genotyping technologies allow the generation high-density genetic maps using hundreds of thousands of genetic markers for each DNA sample. The availability of this large amount of genotypic data facilitates the whole genome search for genetic basis of diseases.</p> <p>We need a suitable information management system to efficiently manage the data flow produced by whole genome genotyping and to make it available for further analyses.</p> <p>Results</p> <p>We have developed an information system mainly devoted to the storage and management of SNP genotype data produced by the Illumina platform from the raw outputs of genotyping into a relational database.</p> <p>The relational database can be accessed in order to import any existing data and export user-defined formats compatible with many different genetic analysis programs.</p> <p>After calculating family-based or case-control association study data, the results can be imported in SNPLims. One of the main features is to allow the user to rapidly identify and annotate statistically relevant polymorphisms from the large volume of data analyzed. Results can be easily visualized either graphically or creating ASCII comma separated format output files, which can be used as input to further analyses.</p> <p>Conclusions</p> <p>The proposed infrastructure allows to manage a relatively large amount of genotypes for each sample and an arbitrary number of samples and phenotypes. Moreover, it enables the users to control the quality of the data and to perform the most common screening analyses and identify genes that become β€œcandidate” for the disease under consideration.</p

    Variation in LPA Is Associated with Lp(a) Levels in Three Populations from the Third National Health and Nutrition Examination Survey

    Get PDF
    The distribution of lipoprotein(a) [Lp(a)] levels can differ dramatically across diverse racial/ethnic populations. The extent to which genetic variation in LPA can explain these differences is not fully understood. To explore this, 19 LPA tagSNPs were genotyped in 7,159 participants from the Third National Health and Nutrition Examination Survey (NHANES III). NHANES III is a diverse population-based survey with DNA samples linked to hundreds of quantitative traits, including serum Lp(a). Tests of association between LPA variants and transformed Lp(a) levels were performed across the three different NHANES subpopulations (non-Hispanic whites, non-Hispanic blacks, and Mexican Americans). At a significance threshold of p<0.0001, 15 of the 19 SNPs tested were strongly associated with Lp(a) levels in at least one subpopulation, six in at least two subpopulations, and none in all three subpopulations. In non-Hispanic whites, three variants were associated with Lp(a) levels, including previously known rs6919246 (pβ€Š=β€Š1.18Γ—10βˆ’30). Additionally, 12 and 6 variants had significant associations in non-Hispanic blacks and Mexican Americans, respectively. The additive effects of these associated alleles explained up to 11% of the variance observed for Lp(a) levels in the different racial/ethnic populations. The findings reported here replicate previous candidate gene and genome-wide association studies for Lp(a) levels in European-descent populations and extend these findings to other populations. While we demonstrate that LPA is an important contributor to Lp(a) levels regardless of race/ethnicity, the lack of generalization of associations across all subpopulations suggests that specific LPA variants may be contributing to the observed Lp(a) between-population variance

    Genotype at the P554L Variant of the Hexose-6 Phosphate Dehydrogenase Gene Is Associated with Carotid Intima-Medial Thickness

    Get PDF
    Objective: The combined thickness of the intima and media of the carotid artery (carotid intima-medial thickness, CIMT) is associated with cardiovascular disease and stroke. Previous studies indicate that carotid intima-medial thickness is a significantly heritable phenotype, but the responsible genes are largely unknown. Hexose-6 phosphate dehydrogenase (H6PDH) is a microsomal enzyme whose activity regulates corticosteroid metabolism in the liver and adipose tissue; variability in measures of corticosteroid metabolism within the normal range have been associated with risk factors for cardiovascular disease. We performed a genetic association study in 854 members of 224 families to assess the relationship between polymorphisms in the gene coding for hexose-6 phosphate dehydrogenase (H6PD) and carotid intima-medial thickness. Methods: Families were ascertained via a hypertensive proband. CIMT was measured using B-mode ultrasound. Single nucleotide polymorphisms (SNPs) tagging common variation in the H6PD gene were genotyped. Association was assessed following adjustment for significant covariates including "classical" cardiovascular risk factors. Functional studies to determine the effect of particular SNPs on H6PDH were performed. Results: There was evidence of association between the single nucleotide polymorphism rs17368528 in exon five of the H6PD gene, which encodes an amino-acid change from proline to leucine in the H6PDH protein, and mean carotid intima-medial thickness (p = 0.00065). Genotype was associated with a 5% (or 0.04 mm) higher mean carotid intima-medial thickness measurement per allele, and determined 2% of the population variability in the phenotype. Conclusions: Our results suggest a novel role for the H6PD gene in atherosclerosis susceptibility

    Discovery and replication of SNP-SNP interactions for quantitative lipid traits in over 60,000 individuals

    Get PDF
    Background The genetic etiology of human lipid quantitative traits is not fully elucidated, and interactions between variants may play a role. We performed a gene-centric interaction study for four different lipid traits: low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), total cholesterol (TC), and triglycerides (TG). Results Our analysis consisted of a discovery phase using a merged dataset of five different cohorts (n = 12,853 to n = 16,849 depending on lipid phenotype) and a replication phase with ten independent cohorts totaling up to 36,938 additional samples. Filters are often applied before interaction testing to correct for the burden of testing all pairwise interactions. We used two different filters: 1. A filter that tested only single nucleotide polymorphisms (SNPs) with a main effect of p < 0.001 in a previous association study. 2. A filter that only tested interactions identified by Biofilter 2.0. Pairwise models that reached an interaction significance level of p < 0.001 in the discovery dataset were tested for replication. We identified thirteen SNP-SNP models that were significant in more than one replication cohort after accounting for multiple testing. Conclusions These results may reveal novel insights into the genetic etiology of lipid levels. Furthermore, we developed a pipeline to perform a computationally efficient interaction analysis with multi-cohort replication

    Neural networks for modeling gene-gene interactions in association studies

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Our aim is to investigate the ability of neural networks to model different two-locus disease models. We conduct a simulation study to compare neural networks with two standard methods, namely logistic regression models and multifactor dimensionality reduction. One hundred data sets are generated for each of six two-locus disease models, which are considered in a low and in a high risk scenario. Two models represent independence, one is a multiplicative model, and three models are epistatic. For each data set, six neural networks (with up to five hidden neurons) and five logistic regression models (the null model, three main effect models, and the full model) with two different codings for the genotype information are fitted. Additionally, the multifactor dimensionality reduction approach is applied.</p> <p>Results</p> <p>The results show that neural networks are more successful in modeling the structure of the underlying disease model than logistic regression models in most of the investigated situations. In our simulation study, neither logistic regression nor multifactor dimensionality reduction are able to correctly identify biological interaction.</p> <p>Conclusions</p> <p>Neural networks are a promising tool to handle complex data situations. However, further research is necessary concerning the interpretation of their parameters.</p

    Genetic Profiling Using Genome-Wide Significant Coronary Artery Disease Risk Variants Does Not Improve the Prediction of Subclinical Atherosclerosis: The Cardiovascular Risk in Young Finns Study, the Bogalusa Heart Study and the Health 2000 Survey – A Meta-Analysis of Three Independent Studies

    Get PDF
    Background Genome-wide association studies (GWASs) have identified a large number of variants (SNPs) associating with an increased risk of coronary artery disease (CAD). Recently, the CARDIoGRAM consortium published a GWAS based on the largest study population so far. They successfully replicated twelve already known associations and discovered thirteen new SNPs associating with CAD. We examined whether the genetic profiling of these variants improves prediction of subclinical atherosclerosis – i.e., carotid intima-media thickness (CIMT) and carotid artery elasticity (CAE) – beyond classical risk factors. Subjects and Methods We genotyped 24 variants found in a population of European ancestry and measured CIMT and CAE in 2001 and 2007 from 2,081, and 2,015 subjects (aged 30–45 years in 2007) respectively, participating in the Cardiovascular Risk in Young Finns Study (YFS). The Bogalusa Heart Study (BHS; n = 1179) was used as a replication cohort (mean age of 37.5). For additional replication, a sub-sample of 5 SNPs was genotyped for 1,291 individuals aged 46–76 years participating in the Health 2000 population survey. We tested the impact of genetic risk score (GRS24SNP/CAD) calculated as a weighted (by allelic odds ratios for CAD) sum of CAD risk alleles from the studied 24 variants on CIMT, CAE, the incidence of carotid atherosclerosis and the progression of CIMT and CAE during a 6-year follow-up. Results CIMT or CAE did not significantly associate with GRS24SNP/CAD before or after adjusting for classical CAD risk factors (p>0.05 for all) in YFS or in the BHS. CIMT and CAE associated with only one SNP each in the YFS. The findings were not replicated in the replication cohorts. In the meta-analysis CIMT or CAE did not associate with any of the SNPs. Conclusion Genetic profiling, by using known CAD risk variants, should not improve risk stratification for subclinical atherosclerosis beyond conventional risk factors among healthy young adults.Public Library of Science open acces

    Genome-Wide Association Studies in Atherosclerosis

    Get PDF
    Cardiovascular disease remains the major cause of worldwide morbidity and mortality. Its pathophysiology is complex and multifactorial. Because the phenotype of cardiovascular disease often shows a marked heritable pattern, it is likely that genetic factors play an important role. In recent years, large genome-wide association studies have been conducted to decipher the molecular mechanisms underlying this heritable and prevalent phenotype. The emphasis of this review is on the recently identified 17 susceptibility loci for coronary artery disease. Implications of their discovery for biology and clinical medicine are discussed. A description of the landscape of human genetics in the near future in the context of next-generation sequence technologies is provided at the conclusion of this review

    Population genomics of cardiometabolic traits: design of the University College London-London School of Hygiene and Tropical Medicine-Edinburgh-Bristol (UCLEB) Consortium.

    Get PDF
    Substantial advances have been made in identifying common genetic variants influencing cardiometabolic traits and disease outcomes through genome wide association studies. Nevertheless, gaps in knowledge remain and new questions have arisen regarding the population relevance, mechanisms, and applications for healthcare. Using a new high-resolution custom single nucleotide polymorphism (SNP) array (Metabochip) incorporating dense coverage of genomic regions linked to cardiometabolic disease, the University College-London School-Edinburgh-Bristol (UCLEB) consortium of highly-phenotyped population-based prospective studies, aims to: (1) fine map functionally relevant SNPs; (2) precisely estimate individual absolute and population attributable risks based on individual SNPs and their combination; (3) investigate mechanisms leading to altered risk factor profiles and CVD events; and (4) use Mendelian randomisation to undertake studies of the causal role in CVD of a range of cardiovascular biomarkers to inform public health policy and help develop new preventative therapies

    Patterns of Ancestry, Signatures of Natural Selection, and Genetic Association with Stature in Western African Pygmies

    Get PDF
    African Pygmy groups show a distinctive pattern of phenotypic variation, including short stature, which is thought to reflect past adaptation to a tropical environment. Here, we analyze Illumina 1M SNP array data in three Western Pygmy populations from Cameroon and three neighboring Bantu-speaking agricultural populations with whom they have admixed. We infer genome-wide ancestry, scan for signals of positive selection, and perform targeted genetic association with measured height variation. We identify multiple regions throughout the genome that may have played a role in adaptive evolution, many of which contain loci with roles in growth hormone, insulin, and insulin-like growth factor signaling pathways, as well as immunity and neuroendocrine signaling involved in reproduction and metabolism. The most striking results are found on chromosome 3, which harbors a cluster of selection and association signals between approximately 45 and 60 Mb. This region also includes the positional candidate genes DOCK3, which is known to be associated with height variation in Europeans, and CISH, a negative regulator of cytokine signaling known to inhibit growth hormone-stimulated STAT5 signaling. Finally, pathway analysis for genes near the strongest signals of association with height indicates enrichment for loci involved in insulin and insulin-like growth factor signaling
    • …
    corecore