13 research outputs found
Recommended from our members
Genomic, patterns of selection and differentiation in African populations and implications for mapping disease association
The main objective of this thesis is to gain a better understanding of genomic patterns of natural selection and population differentiation in Africa, where there is great genetic diversity, and of the implications for genetic mapping of complex diseases.
I began by studying two neighbouring villages in eastern Sudan that are of different ethnicity, Hausa and Masalit, and that appear to have different susceptibility to malaria and visceral leishmaniasis (VL). Specifically, I investigated patterns of linkage disequilibrium (LD) and haplotypic signals of positive selection in the 5q31 genomic region which contains immune genes that have been implicated in susceptibility to malaria and VL.
In my first analysis, by genotyping 34 single nucleotide polymorphisms (SNPs) in the 5q31 region, I did not find signals of selection or population differentiation between the Hausa and Masalit using available statistical methods. I conceived the idea that patterns of LD might provide a more sensitive test of population differentiation, and I developed an approach for this using permutation analysis. This method revealed differentiation between the Hausa, the Masalit and other African ethnic groups.
To better understand signals of selection, I next studied a region of the genome associated with a known malaria resistance factor, the haemoglobin S (HbS) variant of the HBB gene. By genotyping 26 SNPs in the region of the HBB gene, I observed a haplotype that extended in excess of 1 Mb, despite being at high frequency and spanning several recombinational hotspots. This long haplotype carried the HbS allele but, importantly, it could be readily detected without typing the HbS variant.
Building on this observation, I designed a new method to screen the whole genome for long haplotypes that might be signals of selection, and developed a software programme to implement this method. I validated this method using haplotypic data for the Yoruba generated by the HapMap project and complemented by additional SNP data that I generated on HapMap cell lines, and found that the HbS allele resides on a haplotype that extends to 1.2 Mb, and is at strikingly high frequency compared to other haplotypes of similar length on the same chromosome.
Next I applied this method to a large family-based association study of severe malaria in The Gambia, and identified several novel genomic regions with unusually long haplotypes of high frequency. These included a number of regions that may be associated with resistance to severe malaria, and which merit further investigation
Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations
Background: The sickle (βs) mutation in the beta-globin gene (HBB) occurs on five "classical" βs haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the βs allele - a consequence of protection from severe malarial infection afforded by heterozygotes - has been associated with a high degree of extended haplotype similarity. The relationship between classical βs haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical βs haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). Results: The most common βs sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P < 0.001), and was independent of marker choice and marker density. Among the Yoruba, Benin haplotypes were highly conserved, with very strong linkage disequilibrium (LD) extending a megabase across the βs mutation. Conclusion: Two different classical βs haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess if 400 kb. Although the multicentric geographic distribution of βs haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence outcomes in sickle cell disease, could lie considerable distances away from β-globin
Candidate malaria susceptibility/protective SNPs in hospital and population-based studies: the effect of sub-structuring
Background: Populations of East Africa including Sudan, exhibit some of the highest indices of genetic diversity in the continent and worldwide. The current study aims to address the possible impact of population structure and population stratification on the outcome of case-control association-analysis of malaria candidate-genes in different Sudanese populations, where the pronounced genetic heterogeneity becomes a source of concern for the potential effect on the studies outcome. Methods: A total of 72 SNPs were genotyped using the Sequenom iPLEX Gold assay in 449 DNA samples that included; cases and controls from two village populations, malaria patients and out-patients from the area of Sinnar and additional controls consisting of healthy Nilo-Saharan speaking individuals. The population substructure was estimated using the Structure 2.2 programme. Results & Discussion: The Hardy-Weinberg Equilibrium values were generally within expectation in Hausa and Massalit. However, in the Sinnar area there was a notable excess of homozygosity, which was attributed to the Whalund effect arising from population amalgamation within the sample. The programme STRUCTURE revealed a division of both Hausa and Massalit into two substructures with the partition in Hausa more pronounced than in Massalit; in Sinnar there was no defined substructure. More than 25 of the 72 SNPs assayed were informative in all areas. Some important SNPs were not differentially distributed between malaria cases and controls, including SNPs in CD36 and NOS2. A number of SNPs showed significant p-values for differences in distribution of genotypes between cases and controls including: rs1805015 (in IL4R1) (P=0001), rs17047661 (in CR1) (P=0.02) and rs1800750 (TNF-376) (P=0.01) in the hospital samples; rs1050828 (G6PD+202) (P=0.02) and rs1800896 (IL10-1082) (P=0.04) in Massalit and rs2243250 (IL4-589) (P=0.04) in Hausa. Conclusions: The difference in population structure partly accounts for some of these significant associations, and the strength of association proved to be sensitive to all levels of sub-structuring whether in the hospital or population-based study
Y Chromosome Lineage- and Village-Specific Genes on Chromosomes 1p22 and 6q27 Control Visceral Leishmaniasis in Sudan
Familial clustering and ethnic differences suggest that visceral leishmaniasis caused by Leishmania donovani is under genetic control. A recent genome scan provided evidence for a major susceptibility gene on Chromosome 22q12 in the Aringa ethnic group in Sudan. We now report a genome-wide scan using 69 families with 173 affected relatives from two villages occupied by the related Masalit ethnic group. A primary ten-centimorgan scan followed by refined mapping provided evidence for major loci at 1p22 (LOD score 5.65; nominal p = 1.72 × 10(−7); empirical p < 1 × 10(−5); λ(S) = 5.1) and 6q27 (LOD score 3.74; nominal p = 1.68 × 10(−5); empirical p < 1 × 10(−4); λ(S) = 2.3) that were Y chromosome–lineage and village-specific. Neither village supported a visceral leishmaniasis susceptibility gene on 22q12. The results suggest strong lineage-specific genes due to founder effect and consanguinity in these recently immigrant populations. These chance events in ethnically uniform African populations provide a powerful resource in the search for genes and mechanisms that regulate this complex disease
Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations
Abstract Background The sickle (βs) mutation in the beta-globin gene (HBB) occurs on five "classical" βs haplotype backgrounds in ethnic groups of African ancestry. Strong selection in favour of the βs allele – a consequence of protection from severe malarial infection afforded by heterozygotes – has been associated with a high degree of extended haplotype similarity. The relationship between classical βs haplotypes and long-range haplotype similarity may have both anthropological and clinical implications, but to date has not been explored. Here we evaluate the haplotype similarity of classical βs haplotypes over 400 kb in population samples from Jamaica, The Gambia, and among the Yoruba of Nigeria (Hapmap YRI). Results The most common βs sub-haplotype among Jamaicans and the Yoruba was the Benin haplotype, while in The Gambia the Senegal haplotype was observed most commonly. Both subtypes exhibited a high degree of long-range haplotype similarity extending across approximately 400 kb in all three populations. This long-range similarity was significantly greater than that seen for other haplotypes sampled in these populations (P s mutation. Conclusion Two different classical βs haplotypes, sampled from different populations, exhibit comparable and extensive long-range haplotype similarity and strong LD. This LD extends across the adjacent recombination hotspot, and is discernable at distances in excess of 400 kb. Although the multi-centric geographic distribution of βs haplotypes indicates strong subdivision among early Holocene sub-Saharan populations, we find no evidence that selective pressures imposed by falciparum malaria varied in intensity or timing between these subpopulations. Our observations also suggest that cis-acting loci, which may influence outcomes in sickle cell disease, could lie considerable distances away from β-globin.</p
Detecting and characterizing genomic signatures of positive selection in global populations.
Natural selection is a significant force that shapes the architecture of the human genome and introduces diversity across global populations. The question of whether advantageous mutations have arisen in the human genome as a result of single or multiple mutation events remains unanswered except for the fact that there exist a handful of genes such as those that confer lactase persistence, affect skin pigmentation, or cause sickle cell anemia. We have developed a long-range-haplotype method for identifying genomic signatures of positive selection to complement existing methods, such as the integrated haplotype score (iHS) or cross-population extended haplotype homozygosity (XP-EHH), for locating signals across the entire allele frequency spectrum. Our method also locates the founder haplotypes that carry the advantageous variants and infers their corresponding population frequencies. This presents an opportunity to systematically interrogate the whole human genome whether a selection signal shared across different populations is the consequence of a single mutation process followed subsequently by gene flow between populations or of convergent evolution due to the occurrence of multiple independent mutation events either at the same variant or within the same gene. The application of our method to data from 14 populations across the world revealed that positive-selection events tend to cluster in populations of the same ancestry. Comparing the founder haplotypes for events that are present across different populations revealed that convergent evolution is a rare occurrence and that the majority of shared signals stem from the same evolutionary event