39 research outputs found

    PCA-Correlated SNPs for Structure Identification in Worldwide Human Populations

    Get PDF
    Existing methods to ascertain small sets of markers for the identification of human population structure require prior knowledge of individual ancestry. Based on Principal Components Analysis (PCA), and recent results in theoretical computer science, we present a novel algorithm that, applied on genomewide data, selects small subsets of SNPs (PCA-correlated SNPs) to reproduce the structure found by PCA on the complete dataset, without use of ancestry information. Evaluating our method on a previously described dataset (10,805 SNPs, 11 populations), we demonstrate that a very small set of PCA-correlated SNPs can be effectively employed to assign individuals to particular continents or populations, using a simple clustering algorithm. We validate our methods on the HapMap populations and achieve perfect intercontinental differentiation with 14 PCA-correlated SNPs. The Chinese and Japanese populations can be easily differentiated using less than 100 PCA-correlated SNPs ascertained after evaluating 1.7 million SNPs from HapMap. We show that, in general, structure informative SNPs are not portable across geographic regions. However, we manage to identify a general set of 50 PCA-correlated SNPs that effectively assigns individuals to one of nine different populations. Compared to analysis with the measure of informativeness, our methods, although unsupervised, achieved similar results. We proceed to demonstrate that our algorithm can be effectively used for the analysis of admixed populations without having to trace the origin of individuals. Analyzing a Puerto Rican dataset (192 individuals, 7,257 SNPs), we show that PCA-correlated SNPs can be used to successfully predict structure and ancestry proportions. We subsequently validate these SNPs for structure identification in an independent Puerto Rican dataset. The algorithm that we introduce runs in seconds and can be easily applied on large genome-wide datasets, facilitating the identification of population substructure, stratification assessment in multi-stage whole-genome association studies, and the study of demographic history in human populations

    Ancestry-related assortative mating in Latino populations

    Get PDF
    Examination of ancestry-informative genetic markers shows that Puerto Rican and Mexican populations have shown strong assortative mating that continues to this day

    History Shaped the Geographic Distribution of Genomic Admixture on the Island of Puerto Rico

    Get PDF
    Contemporary genetic variation among Latin Americans human groups reflects population migrations shaped by complex historical, social and economic factors. Consequently, admixture patterns may vary by geographic regions ranging from countries to neighborhoods. We examined the geographic variation of admixture across the island of Puerto Rico and the degree to which it could be explained by historic and social events. We analyzed a census-based sample of 642 Puerto Rican individuals that were genotyped for 93 ancestry informative markers (AIMs) to estimate African, European and Native American ancestry. Socioeconomic status (SES) data and geographic location were obtained for each individual. There was significant geographic variation of ancestry across the island. In particular, African ancestry demonstrated a decreasing East to West gradient that was partially explained by historical factors linked to the colonial sugar plantation system. SES also demonstrated a parallel decreasing cline from East to West. However, at a local level, SES and African ancestry were negatively correlated. European ancestry was strongly negatively correlated with African ancestry and therefore showed patterns complementary to African ancestry. By contrast, Native American ancestry showed little variation across the island and across individuals and appears to have played little social role historically. The observed geographic distributions of SES and genetic variation relate to historical social events and mating patterns, and have substantial implications for the design of studies in the recently admixed Puerto Rican population. More generally, our results demonstrate the importance of incorporating social and geographic data with genetics when studying contemporary admixed populations

    CAG repeat instability at SCA2 locus: anchoring CAA interruptions and linked single nucleotide polymorphisms

    Get PDF
    Spinocerebellar ataxia 2 (SCA2) is an autosomal dominant neurodegenerative disorder that results from the expansion of a cryptic CAG repeat within the exon 1 of the SCA2 gene. The CAG repeat in normal individuals varies in length from 14 to 31 repeats and is frequently interrupted by one or more CAA triplets, whereas the expanded alleles contain a pure uninterrupted stretch of 34 to 59 CAG repeats. We have previously reported the presence of a limited pool of 'ancestral' or 'at risk' haplotypes for the expanded SCA2 alleles in the Indian population. We now report the identification of two novel single nucleotide polymorphisms (SNPs) in exon 1 of the SCA2 gene and their characterization in 215 normal and 64 expanded chromosomes. The two biallelic SNPs distinguished two haplotypes, GT and CC, each of which formed a predominant haplotype associated with normal and expanded SCA2 alleles. All the expanded alleles segregated with CC haplotype, which otherwise was associated with only 29.3% of the normal chromosomes. CAA interspersion analysis revealed that majority of the normal alleles with CC haplotype were either pure or lacked the most proximal 5' CAA interruption. The repeat length variation at SCA2 locus also appeared to be polar with changes occurring mostly at the 5' end of the repeat. Our results demonstrate that CAA interruptions play an important role in conferring stability to SCA2 repeat and their absence predisposes alleles towards instability and pathological expansion. Our study also provides new haplotypes associated with SCA2 that should prove useful in further understanding the mutational history and mechanism of repeat instability at the SCA2 locus

    CAG repeat instability at SCA2 locus: anchoring CAA interruptions and linked single nucleotide polymorphisms

    Get PDF
    Spinocerebellar ataxia 2 (SCA2) is an autosomal dominant neurodegenerative disorder that results from the expansion of a cryptic CAG repeat within the exon 1 of the SCA2 gene. The CAG repeat in normal individuals varies in length from 14 to 31 repeats and is frequently interrupted by one or more CAA triplets, whereas the expanded alleles contain a pure uninterrupted stretch of 34 to 59 CAG repeats. We have previously reported the presence of a limited pool of 'ancestral' or 'at risk' haplotypes for the expanded SCA2 alleles in the Indian population. We now report the identification of two novel single nucleotide polymorphisms (SNPs) in exon 1 of the SCA2 gene and their characterization in 215 normal and 64 expanded chromosomes. The two biallelic SNPs distinguished two haplotypes, GT and CC, each of which formed a predominant haplotype associated with normal and expanded SCA2 alleles. All the expanded alleles segregated with CC haplotype, which otherwise was associated with only 29.3% of the normal chromosomes. CAA interspersion analysis revealed that majority of the normal alleles with CC haplotype were either pure or lacked the most proximal 5' CAA interruption. The repeat length variation at SCA2 locus also appeared to be polar with changes occurring mostly at the 5' end of the repeat. Our results demonstrate that CAA interruptions play an important role in conferring stability to SCA2 repeat and their absence predisposes alleles towards instability and pathological expansion. Our study also provides new haplotypes associated with SCA2 that should prove useful in further understanding the mutational history and mechanism of repeat instability at the SCA2 locus

    CAG repeat instability at SCA2 locus: anchoring CAA interruptions and linked single nucleotide polymorphisms

    No full text
    Spinocerebellar ataxia 2 (SCA2) is an autosomal dominant neurodegenerative disorder that results from the expansion of a cryptic CAG repeat within the exon 1 of the SCA2 gene. The CAG repeat in normalindividuals varies in length from 14 to 31 repeats and is frequently interrupted by one or more CAA triplets, whereas the expanded alleles contain a pure uninterrupted stretch of 34 to 59 CAG repeats. We have previously reported the presence of a limited pool of 'ancestral' or 'at risk' haplotypes for the expanded SCA2 alleles in the Indian population. We now report the identification of two novel single nucleotide polymorphisms (SNPs) in exon 1 of the SCA2 gene and their haracterization in 215 normal and 64 expanded chromosomes. The two biallelic SNPs distinguished two haplotypes, GT and CC, each of which formed a predominant haplotype associated with normal and expanded SCA2 alleles. All the expanded alleles segregated with CC haplotype, which otherwise was associated with only 29.3% of the normal chromosomes. CAA interspersion analysis revealed that majority of the normal alleles with CC haplotype were either pure or lacked the most proximal 5' CAA interruption. The repeat length variation at SCA2 locus also appeared to be polar with changes occurring mostly at the 5' end of the repeat. Our results demonstrate that CAA interruptions play an important role in conferring stability to SCA2 repeat and their absence predisposes alleles towards instability and pathological expansion. Our study also provides new haplotypes associated with SCA2 that should prove useful in further understanding the mutational history and mechanism of repeat instability at the SCA2 locus

    Hypospadias and Genes Related to Genital Tubercle and Early Urethral Development

    No full text
    PurposeWe determined whether variants in genes associated with genital tubercle (the anlage for the penis) and early urethral development were associated with hypospadias in humans.Materials and methodsWe examined 293 relatively common tag single nucleotide polymorphisms in BMP4, BMP7, FGF8, FGF10, FGFR2, HOXA13, HOXD13, HOXA4, HOXB6, SRY, WT1, WTAP, SHH, GLI1, GLI2 and GLI3. The analysis included 624 cases (81 mild, 319 moderate, 209 severe, 15 undetermined severity) and 844 population based nonmalformed male controls born in California from 1990 to 2003.ResultsThere were 28 single nucleotide polymorphisms for which any of the comparisons (ie overall or for a specific severity) had a p value of less than 0.01. The homozygous variant genotypes for 4 single nucleotide polymorphisms in BMP7 were associated with at least a twofold increased risk of hypospadias regardless of severity. Five single nucleotide polymorphisms for FGF10 were associated with threefold to fourfold increased risks, regardless of severity. For 4 of them the results were restricted to whites. For GLI1, GLI2 and GLI3 there were 12 associated single nucleotide polymorphisms but results were inconsistent by severity and race/ethnicity. For SHH 1 single nucleotide polymorphism was associated with a 2.4-fold increased risk of moderate hypospadias. For WT1 6 single nucleotide polymorphisms were associated with approximately a twofold increased risk, primarily for severe hypospadias.ConclusionsThis study provides evidence that single nucleotide polymorphisms in several genes that contribute to genital tubercle and early urethral development are associated with hypospadias risk

    Canalization of the urethral plate precedes fusion of the urethral folds during male penile urethral development: the double zipper hypothesis.

    No full text
    PurposeWe describe the "double zipper" mechanism of human male urethral formation, where the distal zipper opens the urethral groove through canalization of the urethral plate, and a second closing zipper follows behind and closes the urethral groove to form the tubular urethra.Materials and methodsAnonymous human fetal genital specimens were acquired and gender was determined by polymerase chain reaction of the Y chromosome. Specimens were processed for optical projection tomography, stained with E-cadherin, Ki67 and caspase 3, and imaged.ResultsEight developing male fetal specimens from 6.5 to 16.5 weeks of gestation were analyzed by optical projection tomography, and an additional 5 specimens by serial sections. Phallus length ranged from 1.3 to 3.7 mm. The urethral plate canalized into a groove with 2 epithelial edges that subsequently fused. Ki67 staining was localized to the dorsal aspect of the urethral plate. In contrast, caspase 3 staining was not observed. The entire process was completed during a 10-week period.ConclusionsThe human male urethra appears to form by 2 mechanisms, an initial "opening zipper" that facilitates distal canalization of the solid urethral plate to form the urethral groove, which involves a high rate of epithelial proliferation (apoptosis not observed), and a "closing zipper" facilitating fusion of the 2 epithelial surfaces of the urethral groove, and thus extending the penile urethra distally. Improved knowledge of the molecular mechanisms of these processes is critical to understanding mechanisms of abnormal urethral development, such as hypospadias
    corecore