309 research outputs found

    Optimizing Illumina next-generation sequencing library preparation for extremely AT-biased genomes.

    Get PDF
    BAckground: Massively parallel sequencing technology is revolutionizing approaches to genomic and genetic research. Since its advent, the scale and efficiency of Next-Generation Sequencing (NGS) has rapidly improved. In spite of this success, sequencing genomes or genomic regions with extremely biased base composition is still a great challenge to the currently available NGS platforms. The genomes of some important pathogenic organisms like Plasmodium falciparum (high AT content) and Mycobacterium tuberculosis (high GC content) display extremes of base composition. The standard library preparation procedures that employ PCR amplification have been shown to cause uneven read coverage particularly across AT and GC rich regions, leading to problems in genome assembly and variation analyses. Alternative library-preparation approaches that omit PCR amplification require large quantities of starting material and hence are not suitable for small amounts of DNA/RNA such as those from clinical isolates. We have developed and optimized library-preparation procedures suitable for low quantity starting material and tolerant to extremely high AT content sequences. Results: We have used our optimized conditions in parallel with standard methods to prepare Illumina sequencing libraries from a non-clinical and a clinical isolate (containing ~53% host contamination). By analyzing and comparing the quality of sequence data generated, we show that our optimized conditions that involve a PCR additive (TMAC), produces amplified libraries with improved coverage of extremely AT-rich regions and reduced bias toward GC neutral templates. Conclusion: We have developed a robust and optimized Next-Generation Sequencing library amplification method suitable for extremely AT-rich genomes. The new amplification conditions significantly reduce bias and retain the complexity of either extremes of base composition. This development will greatly benefit sequencing clinical samples that often require amplification due to low mass of DNA starting material

    Efficient depletion of host DNA contamination in malaria clinical sequencing.

    Get PDF
    The cost of whole-genome sequencing (WGS) is decreasing rapidly as next-generation sequencing technology continues to advance, and the prospect of making WGS available for public health applications is becoming a reality. So far, a number of studies have demonstrated the use of WGS as an epidemiological tool for typing and controlling outbreaks of microbial pathogens. Success of these applications is hugely dependent on efficient generation of clean genetic material that is free from host DNA contamination for rapid preparation of sequencing libraries. The presence of large amounts of host DNA severely affects the efficiency of characterizing pathogens using WGS and is therefore a serious impediment to clinical and epidemiological sequencing for health care and public health applications. We have developed a simple enzymatic treatment method that takes advantage of the methylation of human DNA to selectively deplete host contamination from clinical samples prior to sequencing. Using malaria clinical samples with over 80% human host DNA contamination, we show that the enzymatic treatment enriches Plasmodium falciparum DNA up to ∼9-fold and generates high-quality, nonbiased sequence reads covering >98% of 86,158 catalogued typeable single-nucleotide polymorphism loci

    Ethical Data Release in Genome-Wide Association Studies in Developing Countries

    Get PDF
    Michael Parker and colleagues discuss the ethical issues associated with data release from genome-wide association studies in developing countries

    The ethics of sustainable genomic research in Africa

    Get PDF

    Candidate malaria susceptibility/protective SNPs in hospital and population-based studies: the effect of sub-structuring

    Get PDF
    Background: Populations of East Africa including Sudan, exhibit some of the highest indices of genetic diversity in the continent and worldwide. The current study aims to address the possible impact of population structure and population stratification on the outcome of case-control association-analysis of malaria candidate-genes in different Sudanese populations, where the pronounced genetic heterogeneity becomes a source of concern for the potential effect on the studies outcome. Methods: A total of 72 SNPs were genotyped using the Sequenom iPLEX Gold assay in 449 DNA samples that included; cases and controls from two village populations, malaria patients and out-patients from the area of Sinnar and additional controls consisting of healthy Nilo-Saharan speaking individuals. The population substructure was estimated using the Structure 2.2 programme. Results & Discussion: The Hardy-Weinberg Equilibrium values were generally within expectation in Hausa and Massalit. However, in the Sinnar area there was a notable excess of homozygosity, which was attributed to the Whalund effect arising from population amalgamation within the sample. The programme STRUCTURE revealed a division of both Hausa and Massalit into two substructures with the partition in Hausa more pronounced than in Massalit; in Sinnar there was no defined substructure. More than 25 of the 72 SNPs assayed were informative in all areas. Some important SNPs were not differentially distributed between malaria cases and controls, including SNPs in CD36 and NOS2. A number of SNPs showed significant p-values for differences in distribution of genotypes between cases and controls including: rs1805015 (in IL4R1) (P=0001), rs17047661 (in CR1) (P=0.02) and rs1800750 (TNF-376) (P=0.01) in the hospital samples; rs1050828 (G6PD+202) (P=0.02) and rs1800896 (IL10-1082) (P=0.04) in Massalit and rs2243250 (IL4-589) (P=0.04) in Hausa. Conclusions: The difference in population structure partly accounts for some of these significant associations, and the strength of association proved to be sensitive to all levels of sub-structuring whether in the hospital or population-based study

    Further evidence supporting a role for gs signal transduction in severe malaria pathogenesis.

    Get PDF
    With the functional demonstration of a role in erythrocyte invasion by Plasmodium falciparum parasites, implications in the aetiology of common conditions that prevail in individuals of African origin, and a wealth of pharmacological knowledge, the stimulatory G protein (Gs) signal transduction pathway presents an exciting target for anti-malarial drug intervention. Having previously demonstrated a role for the G-alpha-s gene, GNAS, in severe malaria disease, we sought to identify other important components of the Gs pathway. Using meta-analysis across case-control and family trio (affected child and parental controls) studies of severe malaria from The Gambia and Malawi, we sought evidence of association in six Gs pathway candidate genes: adenosine receptor 2A (ADORA2A) and 2B (ADORA2B), beta-adrenergic receptor kinase 1 (ADRBK1), adenylyl cyclase 9 (ADCY9), G protein beta subunit 3 (GNB3), and regulator of G protein signalling 2 (RGS2). Our study amassed a total of 2278 cases and 2364 controls. Allele-based models of association were investigated in all genes, and genotype and haplotype-based models were investigated where significant allelic associations were identified. Although no significant associations were observed in the other genes, several were identified in ADORA2A. The most significant association was observed at the rs9624472 locus, where the G allele (approximately 20% frequency) appeared to confer enhanced risk to severe malaria [OR = 1.22 (1.09-1.37); P = 0.001]. Further investigation of the ADORA2A gene region is required to validate the associations identified here, and to identify and functionally characterize the responsible causal variant(s). Our results provide further evidence supporting a role of the Gs signal transduction pathway in the regulation of severe malaria, and request further exploration of this pathway in future studies

    Whole genome sequencing of Plasmodium falciparum from dried blood spots using selective whole genome amplification

    Get PDF
    BACKGROUND: Translating genomic technologies into healthcare applications for the malaria parasite Plasmodium falciparum has been limited by the technical and logistical difficulties of obtaining high quality clinical samples from the field. Sampling by dried blood spot (DBS) finger-pricks can be performed safely and efficiently with minimal resource and storage requirements compared with venous blood (VB). Here, the use of selective whole genome amplification (sWGA) to sequence the P. falciparum genome from clinical DBS samples was evaluated, and the results compared with current methods that use leucodepleted VB. METHODS: Parasite DNA with high (>95%) human DNA contamination was selectively amplified by Phi29 polymerase using short oligonucleotide probes of 8-12 mers as primers. These primers were selected on the basis of their differential frequency of binding the desired (P. falciparum DNA) and contaminating (human) genomes. RESULTS: Using sWGA method, clinical samples from 156 malaria patients, including 120 paired samples for head-to-head comparison of DBS and leucodepleted VB were sequenced. Greater than 18-fold enrichment of P. falciparum DNA was achieved from DBS extracts. The parasitaemia threshold to achieve >5× coverage for 50% of the genome was 0.03% (40 parasites per 200 white blood cells). Over 99% SNP concordance between VB and DBS samples was achieved after excluding missing calls. CONCLUSION: The sWGA methods described here provide a reliable and scalable way of generating P. falciparum genome sequence data from DBS samples. The current data indicate that it will be possible to get good quality sequence on most if not all drug resistance loci from the majority of symptomatic malaria patients. This technique overcomes a major limiting factor in P. falciparum genome sequencing from field samples, and paves the way for large-scale epidemiological applications

    Comparison of genomic signatures of selection on Plasmodium falciparum between different regions of a country with high malaria endemicity.

    Get PDF
    BACKGROUND: Genome wide sequence analyses of malaria parasites from widely separated areas of the world have identified contrasting population structures and signatures of selection. To compare relatively closely situated but ecologically contrasting regions within an endemic African country, population samples of Plasmodium falciparum clinical isolates were collected in Ghana from Kintampo in the central forest-savannah area, and Navrongo in a drier savannah area ~350 km to the north with more seasonally-restricted transmission. Parasite DNA was sequenced and paired-end reads mapped to the P. falciparum reference genome. RESULTS: High coverage genome wide sequence data for 85 different clinical isolates enabled analysis of 121,712 single nucleotide polymorphisms (SNPs). The local populations had similar proportions of mixed genotype infections, similar SNP allele frequency distributions, and eleven chromosomal regions had elevated integrated haplotype scores (|iHS|) in both. A between-population Rsb metric comparing extended haplotype homozygosity indicated a stronger signal within Kintampo for one of these regions (on chromosome 14) and in Navrongo for two of these regions (on chromosomes 10 and 13). At least one gene in each of these identified regions is a potential target of locally varying selection. The candidates include genes involved in parasite development in mosquitoes, members of variant-expressed multigene families, and a leading vaccine-candidate target of immunity. CONCLUSIONS: Against a background of very similar population structure and selection signatures in the P. falciparum populations of Ghana, three narrow genomic regions showed evidence indicating local differences in historical timing or intensity of selection. Sampling of closely situated populations across heterogeneous environments has potential to refine the mapping of important loci under temporally or spatially varying selection
    • …
    corecore