116 research outputs found

    Why Are Nigeria-Cameroon Chimpanzees (Pan troglodytes ellioti) Free of SIVcpz Infection?

    Get PDF
    Abstract Simian immunodeficiency virus (SIV) naturally infects two subspecies of chimpanzee: Pan troglodytes troglodytes from Central Africa (SIVcpzPtt) and P. t. schweinfurtii from East Africa (SIVcpzPts), but is absent in P. t. verus from West Africa and appears to be absent in P. t. ellioti inhabiting Nigeria and western Cameroon. One explanation for this pattern is that P. t. troglodytes and P. t schweinfurthii may have acquired SIVcpz after their divergence from P. t. verus and P. t. ellioti. However, all of the subspecies, except P. t. verus, still occasionally exchange migrants making the absence of SIVcpz in P. t. ellioti puzzling. Sampling of P. t. ellioti has been minimal to date, particularly along the banks of the Sanaga River, where its range abuts that of P. t. troglodytes. This study had three objectives. First, we extended the sampling of SIVcpz across the range of chimpanzees north of the Sanaga River to address whether under-sampling might account for the absence of evidence for SIVcpz infection in P. t. ellioti. Second, we investigated how environmental variation is associated with the spread and prevalence of SIVcpz in the two chimpanzee subspecies inhabiting Cameroon since environmental variation has been shown to contribute to their divergence from one another. Finally, we compared the prevalence and distribution of SIVcpz with that of Simian Foamy Virus (SFV) to examine the role of ecology and behavior in shaping the distribution of diseases in wild host populations. The dataset includes previously published results on SIVcpz infection and SFVcpz as well as newly collected data, and represents over 1000 chimpanzee fecal samples from 41 locations across Cameroon. Results revealed that none of the 181 P. t. ellioti fecal samples collected across the range of P. t. ellioti tested positive for SIVcpz. In addition, species distribution models suggest that environmental variation contributes to differences in the distribution and prevalence of SIVcpz and SFVcpz. The ecological niches of these two viruses are largely non-overlapping, although stronger statistical support for this conclusion will require more sampling

    Positional Cloning of “Lisch-like”, a Candidate Modifier of Susceptibility to Type 2 Diabetes in Mice

    Get PDF
    In 404 Lepob/ob F2 progeny of a C57BL/6J (B6) x DBA/2J (DBA) intercross, we mapped a DBA-related quantitative trait locus (QTL) to distal Chr1 at 169.6 Mb, centered about D1Mit110, for diabetes-related phenotypes that included blood glucose, HbA1c, and pancreatic islet histology. The interval was refined to 1.8 Mb in a series of B6.DBA congenic/subcongenic lines also segregating for Lepob. The phenotypes of B6.DBA congenic mice include reduced β-cell replication rates accompanied by reduced β-cell mass, reduced insulin/glucose ratio in blood, reduced glucose tolerance, and persistent mild hypoinsulinemic hyperglycemia. Nucleotide sequence and expression analysis of 14 genes in this interval identified a predicted gene that we have designated “Lisch-like” (Ll) as the most likely candidate. The gene spans 62.7 kb on Chr1qH2.3, encoding a 10-exon, 646–amino acid polypeptide, homologous to Lsr on Chr7qB1 and to Ildr1 on Chr16qB3. The largest isoform of Ll is predicted to be a transmembrane molecule with an immunoglobulin-like extracellular domain and a serine/threonine-rich intracellular domain that contains a 14-3-3 binding domain. Morpholino knockdown of the zebrafish paralog of Ll resulted in a generalized delay in endodermal development in the gut region and dispersion of insulin-positive cells. Mice segregating for an ENU-induced null allele of Ll have phenotypes comparable to the B.D congenic lines. The human ortholog, C1orf32, is in the middle of a 30-Mb region of Chr1q23-25 that has been repeatedly associated with type 2 diabetes

    Initial sequencing and analysis of the human genome

    Full text link
    The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/62798/1/409860a0.pd

    Insights into hominid evolution from the gorilla genome sequence.

    Get PDF
    Gorillas are humans' closest living relatives after chimpanzees, and are of comparable importance for the study of human origins and evolution. Here we present the assembly and analysis of a genome sequence for the western lowland gorilla, and compare the whole genomes of all extant great ape genera. We propose a synthesis of genetic and fossil evidence consistent with placing the human-chimpanzee and human-chimpanzee-gorilla speciation events at approximately 6 and 10 million years ago. In 30% of the genome, gorilla is closer to human or chimpanzee than the latter are to each other; this is rarer around coding genes, indicating pervasive selection throughout great ape evolution, and has functional consequences in gene expression. A comparison of protein coding genes reveals approximately 500 genes showing accelerated evolution on each of the gorilla, human and chimpanzee lineages, and evidence for parallel acceleration, particularly of genes involved in hearing. We also compare the western and eastern gorilla species, estimating an average sequence divergence time 1.75 million years ago, but with evidence for more recent genetic exchange and a population bottleneck in the eastern species. The use of the genome sequence in these and future analyses will promote a deeper understanding of great ape biology and evolution

    A second generation human haplotype map of over 3.1 million SNPs

    Full text link
    We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r(2) of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r(2) of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/62863/1/nature06258.pd

    Comparative performances of machine learning methods for classifying Crohn Disease patients using genome-wide genotyping data

    Get PDF
    Abstract: Crohn Disease (CD) is a complex genetic disorder for which more than 140 genes have been identified using genome wide association studies (GWAS). However, the genetic architecture of the trait remains largely unknown. The recent development of machine learning (ML) approaches incited us to apply them to classify healthy and diseased people according to their genomic information. The Immunochip dataset containing 18,227 CD patients and 34,050 healthy controls enrolled and genotyped by the international Inflammatory Bowel Disease genetic consortium (IIBDGC) has been re-analyzed using a set of ML methods: penalized logistic regression (LR), gradient boosted trees (GBT) and artificial neural networks (NN). The main score used to compare the methods was the Area Under the ROC Curve (AUC) statistics. The impact of quality control (QC), imputing and coding methods on LR results showed that QC methods and imputation of missing genotypes may artificially increase the scores. At the opposite, neither the patient/control ratio nor marker preselection or coding strategies significantly affected the results. LR methods, including Lasso, Ridge and ElasticNet provided similar results with a maximum AUC of 0.80. GBT methods like XGBoost, LightGBM and CatBoost, together with dense NN with one or more hidden layers, provided similar AUC values, suggesting limited epistatic effects in the genetic architecture of the trait. ML methods detected near all the genetic variants previously identified by GWAS among the best predictors plus additional predictors with lower effects. The robustness and complementarity of the different methods are also studied. Compared to LR, non-linear models such as GBT or NN may provide robust complementary approaches to identify and classify genetic markers

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead
    corecore