79 research outputs found

    Allelic spectrum of the natural variation in CRP

    Get PDF
    With the recent completion of the International HapMap Project, many tools are in hand for genetic association studies seeking to test the common variant/common disease hypothesis. In contrast, very few tools and resources are in place for genotype–phenotype studies hypothesizing that rare variation has a large impact on the phenotype of interest. To create these tools for rare variant/common disease studies, much interest is being generated towards investing in re-sequencing either large sample sizes of random chromosomes or smaller sample sizes of patients with extreme phenotypes. As a case study for rare variant discovery in random chromosomes, we have re-sequenced ~1,000 chromosomes representing diverse populations for the gene C-reactive protein (CRP). CRP is an important gene in the fields of cardiovascular and inflammation genetics, and its size (~2 kb) makes it particularly amenable medical or deep re-sequencing. With these data, we explore several issues related to the present-day candidate gene association study including the benefits of complete SNP discovery, the effects of tagSNP selection across diverse populations, and completeness of dbSNP for CRP. Also, we show that while deep re-sequencing uncovers potentially medically relevant coding SNPs, these SNPs are fleetingly rare when genotyped in a population-based survey of 7,000 Americans (NHANES III). Collectively, these data suggest that several different types re-sequencing and genotyping approaches may be required to fully understand the complete spectrum of alleles that impact human phenotypes. ELECTRONIC SUPPLEMENTARY MATERIAL: Supplementary material is available for this article at http://dx.doi.org/10.1007/s00439-006-0160-y and is accessible for authorized users

    Ancestral Informative Marker Selection and Population Structure Visualization Using Sparse Laplacian Eigenfunctions

    Get PDF
    Identification of a small panel of population structure informative markers can reduce genotyping cost and is useful in various applications, such as ancestry inference in association mapping, forensics and evolutionary theory in population genetics. Traditional methods to ascertain ancestral informative markers usually require the prior knowledge of individual ancestry and have difficulty for admixed populations. Recently Principal Components Analysis (PCA) has been employed with success to select SNPs which are highly correlated with top significant principal components (PCs) without use of individual ancestral information. The approach is also applicable to admixed populations. Here we propose a novel approach based on our recent result on summarizing population structure by graph Laplacian eigenfunctions, which differs from PCA in that it is geometric and robust to outliers. Our approach also takes advantage of the priori sparseness of informative markers in the genome. Through simulation of a ring population and the real global population sample HGDP of 650K SNPs genotyped in 940 unrelated individuals, we validate the proposed algorithm at selecting most informative markers, a small fraction of which can recover the similar underlying population structure efficiently. Employing a standard Support Vector Machine (SVM) to predict individuals' continental memberships on HGDP dataset of seven continents, we demonstrate that the selected SNPs by our method are more informative but less redundant than those selected by PCA. Our algorithm is a promising tool in genome-wide association studies and population genetics, facilitating the selection of structure informative markers, efficient detection of population substructure and ancestral inference

    IL1B-CGTC haplotype is associated with colorectal cancer in admixed individuals with increased African ancestry

    Get PDF
    Single-nucleotide polymorphisms (SNPs) in cytokine genes can affect gene expression and thereby modulate inflammation and carcinogenesis. However, the data on the association between SNPs in the interleukin 1 beta gene (IL1B) and colorectal cancer (CRC) are conflicting. We found an association between a 4-SNP haplotype block of the IL1B (-3737C/-1464G/-511T/-31C) and CRC risk, and this association was exclusively observed in individuals with a higher proportion of African ancestry, such as individuals from the Coastal Colombian region (odds ratio, OR 2.06; 95% CI 1.31–3.25; p < 0.01). Moreover, a significant interaction between this CRC risk haplotype and local African ancestry dosage was identified in locus 2q14 (p = 0.03). We conclude that Colombian individuals with high African ancestry proportions at locus 2q14 harbour more IL1B-CGTC copies and are consequently at an increased risk of CRC. This haplotype has been previously found to increase the IL1B promoter activity and is the most frequent haplotype in African Americans. Despite of limitations in the number of samples and the lack of functional analysis to examine the effect of these haplotypes on CRC cell lines, our results suggest that inflammation and ethnicity play a major role in the modulation of CRC risk

    Admixture mapping: from paradigms of race and ethnicity to population history

    Get PDF
    Admixture mapping is a whole genome association strategy that takes advantage of population history—or genetic ancestry—to map genes for complex diseases. However, because it uses racial/ethnic groupings to examine differential disease risk, admixture mapping raises ethical and social concerns. While there has been much theoretical commentary regarding the ethical and social implications of population-based genetic research, empirical data from stakeholders most closely involved with these studies is limited. One of the first admixture mapping studies carried out was a scan for Multiple Sclerosis (MS) risk factors in an African-American population. Applying qualitative research methods, we used this example to explore developing views, experiences and perceptions of the ethical and social implications of admixture mapping and other population-based research—their value, risks and benefits, and the future prospects of the field. Additionally, we sought to understand how social and ethical risks might be mitigated, and the benefits of this research optimized. We draw on in-depth, one-on-one interviews with leading population geneticists, genome scientists, bioethicists, and African-Americans with MS. Here we present our findings from this unique group of key informants and stakeholders

    Comparison of measures of marker informativeness for ancestry and admixture mapping

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Admixture mapping is a powerful gene mapping approach for an admixed population formed from ancestral populations with different allele frequencies. The power of this method relies on the ability of ancestry informative markers (AIMs) to infer ancestry along the chromosomes of admixed individuals. In this study, more than one million SNPs from HapMap databases and simulated data have been interrogated in admixed populations using various measures of ancestry informativeness: Fisher Information Content (FIC), Shannon Information Content (SIC), F statistics (F<sub>ST</sub>), Informativeness for Assignment Measure (I<sub>n</sub>), and the Absolute Allele Frequency Differences (delta, δ). The objectives are to compare these measures of informativeness to select SNP markers for ancestry inference, and to determine the accuracy of AIM panels selected by each measure in estimating the contributions of the ancestors to the admixed population.</p> <p>Results</p> <p>F<sub>ST </sub>and I<sub>n </sub>had the highest Spearman correlation and the best agreement as measured by Kappa statistics based on deciles. Although the different measures of marker informativeness performed comparably well, analyses based on the top 1 to 10% ranked informative markers of simulated data showed that I<sub>n </sub>was better in estimating ancestry for an admixed population.</p> <p>Conclusions</p> <p>Although millions of SNPs have been identified, only a small subset needs to be genotyped in order to accurately predict ancestry with a minimal error rate in a cost-effective manner. In this article, we compared various methods for selecting ancestry informative SNPs using simulations as well as SNP genotype data from samples of admixed populations and showed that the I<sub>n </sub>measure estimates ancestry proportion (in an admixed population) with lower bias and mean square error.</p

    Haplotypes of DNA repair and cell cycle control genes, X-ray exposure, and risk of childhood acute lymphoblastic leukemia

    Get PDF
    [[abstract]]Background: Acute leukemias of childhood are a heterogeneous group of malignancies characterized by cytogenetic abnormalities, such as translocations and changes in ploidy. These abnormalities may be influenced by altered DNA repair and cell cycle control processes. Methods: We examined the association between childhood acute lymphoblastic leukemia (ALL) and 32 genes in DNA repair and cell cycle pathways using a haplotype-based approach, among 377 childhood ALL cases and 448 controls enrolled during 1995-2002. Results: We found that haplotypes in APEX1, BRCA2, ERCC2, and RAD51 were significantly associated with total ALL, while haplotypes in NBN and XRCC4, and CDKN2A were associated with structural and numerical change subtypes, respectively. In addition, we observed statistically significant interaction between exposure to 3 or more diagnostic X-rays and haplotypes of XRCC4 on risk of structural abnormality-positive childhood ALL. Conclusions: These results support a role of altered DNA repair and cell cycle processes in the risk of childhood ALL, and show that this genetic susceptibility can differ by cytogenetic subtype and may be modified by exposure to ionizing radiation. To our knowledge, our study is the first to broadly examine the DNA repair and cell cycle pathways using a haplotype approach in conjunction with X-ray exposures in childhood ALL risk. If confirmed, future studies are needed to identify specific functional SNPs in the regions of interest identified in this analysis

    Linguistic and maternal genetic diversity are not correlated in Native Mexicans

    Get PDF
    Mesoamerica, defined as the broad linguistic and cultural area from middle southern Mexico to Costa Rica, might have played a pivotal role during the colonization of the American continent. The Mesoamerican isthmus has constituted an important geographic barrier that has severely restricted gene flow between North and South America in pre-historical times. Although the Native American component has been already described in admixed Mexican populations, few studies have been carried out in native Mexican populations. In this study, we present mitochondrial DNA (mtDNA) sequence data for the first hypervariable region (HVR-I) in 477 unrelated individuals belonging to 11 different native populations from Mexico. Almost all of the Native Mexican mtDNAs could be classified into the four pan-Amerindian haplogroups (A2, B2, C1, and D1); only two of them could be allocated to the rare Native American lineage D4h3. Their haplogroup phylogenies are clearly star-like, as expected from relatively young populations that have experienced diverse episodes of genetic drift (e.g., extensive isolation, genetic drift, and founder effects) and posterior population expansions. In agreement with this observation, Native Mexican populations show a high degree of heterogeneity in their patterns of haplogroup frequencies. Haplogroup X2a was absent in our samples, supporting previous observations where this clade was only detected in the American northernmost areas. The search for identical sequences in the American continent shows that, although Native Mexican populations seem to show a closer relationship to North American populations, they cannot be related to a single geographical region within the continent. Finally, we did not find significant population structure in the maternal lineages when considering the four main and distinct linguistic groups represented in our Mexican samples (Oto-Manguean, Uto-Aztecan, Tarascan, and Mayan), suggesting that genetic divergence predates linguistic diversification in Mexico

    An African Ancestry-Specific Allele of CTLA4 Confers Protection against Rheumatoid Arthritis in African Americans

    Get PDF
    Cytotoxic T-lymphocyte associated protein 4 (CTLA4) is a negative regulator of T-cell proliferation. Polymorphisms in CTLA4 have been inconsistently associated with susceptibility to rheumatoid arthritis (RA) in populations of European ancestry but have not been examined in African Americans. The prevalence of RA in most populations of European and Asian ancestry is ∼1.0%; RA is purportedly less common in black Africans, with little known about its prevalence in African Americans. We sought to determine if CTLA4 polymorphisms are associated with RA in African Americans. We performed a 2-stage analysis of 12 haplotype tagging single nucleotide polymorphisms (SNPs) across CTLA4 in a total of 505 African American RA patients and 712 African American controls using Illumina and TaqMan platforms. The minor allele (G) of the rs231778 SNP was 0.054 in RA patients, compared to 0.209 in controls (4.462×10−26, Fisher's exact). The presence of the G allele was associated with a substantially reduced odds ratio (OR) of having RA (AG+GG genotypes vs. AA genotype, OR 0.19, 95% CI: 0.13–0.26, p = 2.4×10−28, Fisher's exact), suggesting a protective effect. This SNP is polymorphic in the African population (minor allele frequency [MAF] 0.09 in the Yoruba population), but is very rare in other groups (MAF = 0.002 in 530 Caucasians genotyped for this study). Markers associated with RA in populations of European ancestry (rs3087243 [+60C/T] and rs231775 [+49A/G]) were not replicated in African Americans. We found no confounding of association for rs231778 after stratifying for the HLA-DRB1 shared epitope, presence of anti-cyclic citrullinated peptide antibody, or degree of admixture from the European population. An African ancestry-specific genetic variant of CTLA4 appears to be associated with protection from RA in African Americans. This finding may explain, in part, the relatively low prevalence of RA in black African populations

    Y Chromosome Lineages in Men of West African Descent

    Get PDF
    The early African experience in the Americas is marked by the transatlantic slave trade from ∼1619 to 1850 and the rise of the plantation system. The origins of enslaved Africans were largely dependent on European preferences as well as the availability of potential laborers within Africa. Rice production was a key industry of many colonial South Carolina low country plantations. Accordingly, rice plantations owners within South Carolina often requested enslaved Africans from the so-called “Grain Coast” of western Africa (Senegal to Sierra Leone). Studies on the African origins of the enslaved within other regions of the Americas have been limited. To address the issue of origins of people of African descent within the Americas and understand more about the genetic heterogeneity present within Africa and the African Diaspora, we typed Y chromosome specific markers in 1,319 men consisting of 508 west and central Africans (from 12 populations), 188 Caribbeans (from 2 islands), 532 African Americans (AAs from Washington, DC and Columbia, SC), and 91 European Americans. Principal component and admixture analyses provide support for significant Grain Coast ancestry among African American men in South Carolina. AA men from DC and the Caribbean showed a closer affinity to populations from the Bight of Biafra. Furthermore, 30–40% of the paternal lineages in African descent populations in the Americas are of European ancestry. Diverse west African ancestries and sex-biased gene flow from EAs has contributed greatly to the genetic heterogeneity of African populations throughout the Americas and has significant implications for gene mapping efforts in these populations

    Dissecting the Within-Africa Ancestry of Populations of African Descent in the Americas

    Get PDF
    The ancestry of African-descended Americans is known to be drawn from three distinct populations: African, European, and Native American. While many studies consider this continental admixture, few account for the genetically distinct sources of ancestry within Africa--the continent with the highest genetic variation. Here, we dissect the within-Africa genetic ancestry of various populations of the Americas self-identified as having primarily African ancestry using uniparentally inherited mitochondrial DNA.We first confirmed that our results obtained using uniparentally-derived group admixture estimates are correlated with the average autosomal-derived individual admixture estimates (hence are relevant to genomic ancestry) by assessing continental admixture using both types of markers (mtDNA and Y-chromosome vs. ancestry informative markers). We then focused on the within-Africa maternal ancestry, mining our comprehensive database of published mtDNA variation (∼5800 individuals from 143 African populations) that helped us thoroughly dissect the African mtDNA pool. Using this well-defined African mtDNA variation, we quantified the relative contributions of maternal genetic ancestry from multiple W/WC/SW/SE (West to South East) African populations to the different pools of today's African-descended Americans of North and South America and the Caribbean.Our analysis revealed that both continental admixture and within-Africa admixture may be critical to achieving an adequate understanding of the ancestry of African-descended Americans. While continental ancestry reflects gender-specific admixture processes influenced by different socio-historical practices in the Americas, the within-Africa maternal ancestry reflects the diverse colonial histories of the slave trade. We have confirmed that there is a genetic thread connecting Africa and the Americas, where each colonial system supplied their colonies in the Americas with slaves from African colonies they controlled or that were available for them at the time. This historical connection is reflected in different relative contributions from populations of W/WC/SW/SE Africa to geographically distinct Africa-derived populations of the Americas, adding to the complexity of genomic ancestry in groups ostensibly united by the same demographic label
    corecore