59 research outputs found

    Mouse SNP Miner: an annotated database of mouse functional single nucleotide polymorphisms

    Get PDF
    BACKGROUND: The mapping of quantitative trait loci in rat and mouse has been extremely successful in identifying chromosomal regions associated with human disease-related phenotypes. However, identifying the specific phenotype-causing DNA sequence variations within a quantitative trait locus has been much more difficult. The recent availability of genomic sequence from several mouse inbred strains (including C57BL/6J, 129X1/SvJ, 129S1/SvImJ, A/J, and DBA/2J) has made it possible to catalog DNA sequence differences within a quantitative trait locus derived from crosses between these strains. However, even for well-defined quantitative trait loci (<10 Mb) the identification of candidate functional DNA sequence changes remains challenging due to the high density of sequence variation between strains. DESCRIPTION: To help identify functional DNA sequence variations within quantitative trait loci we have used the Ensembl annotated genome sequence to compile a database of mouse single nucleotide polymorphisms (SNPs) that are predicted to cause missense, nonsense, frameshift, or splice site mutations (available at ). For missense mutations we have used the PolyPhen and PANTHER algorithms to predict whether amino acid changes are likely to disrupt protein function. CONCLUSION: We have developed a database of mouse SNPs predicted to cause missense, nonsense, frameshift, and splice-site mutations. Our analysis revealed that 20% and 14% of missense SNPs are likely to be deleterious according to PolyPhen and PANTHER, respectively, and 6% are considered deleterious by both algorithms. The database also provides gene expression and functional annotations from the Symatlas, Gene Ontology, and OMIM databases to further assess candidate phenotype-causing mutations. To demonstrate its utility, we show that Mouse SNP Miner successfully finds a previously identified candidate SNP in the taste receptor, Tas1r3, that underlies sucrose preference in the C57BL/6J strain. We also use Mouse SNP Miner to derive a list of candidate phenotype-causing mutations within a previously uncharacterized QTL for response to morphine in the 129/Sv strain

    Asymmetric and non-uniform evolution of recently duplicated human genes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Gene duplications are a source of new genes and protein functions. The innovative role of duplication events makes families of paralogous genes an interesting target for studies in evolutionary biology. Here we study global trends in the evolution of human genes that resulted from recent duplications.</p> <p>Results</p> <p>The pressure of negative selection is weaker during a short time immediately after a duplication event. Roughly one fifth of genes in paralogous gene families are evolving asymmetrically: one of the proteins encoded by two closest paralogs accumulates amino acid substitutions significantly faster than its partner. This asymmetry cannot be explained by differences in gene expression levels. In asymmetric gene pairs the number of deleterious mutations is increased in one copy, while decreased in the other copy as compared to genes constituting non-asymmetrically evolving pairs. The asymmetry in the rate of synonymous substitutions is much weaker and not significant.</p> <p>Conclusions</p> <p>The increase of negative selection pressure over time after a duplication event seems to be a major trend in the evolution of human paralogous gene families. The observed asymmetry in the evolution of paralogous genes shows that in many cases one of two gene copies remains practically unchanged, while the other accumulates functional mutations. This supports the hypothesis that slowly evolving gene copies preserve their original functions, while fast evolving copies obtain new specificities or functions.</p> <p>Reviewers</p> <p>This article was reviewed by Dr. Igor Rogozin (nominated by Dr. Arcady Mushegian), Dr. Fyodor Kondrashov, and Dr. Sergei Maslov.</p

    A retrocopy of a gene can functionally displace the source gene in evolution

    Get PDF
    The e(y)2 gene of Drosophila melanogaster encodes the ubiquitous evolutionarily conserved co-activator of RNA polymerase II that is involved in transcription regulation of a high number of genes. The Drosophila e(y)2b gene, paralogue of the e(y)2 has been found. The analysis of structure of the e(y)2, e(y)2b and its orthologues from other species reveals that the e(y)2 gene derived as a result of retroposition of the e(y)2b during Drosophila evolution. The mRNA-derived retrogenes lack introns or regulatory regions; most of them become pseudogenes whereas some acquire tissue-specific functions. Here we describe the different situation: the e(y)2 retrogene performs the general function and is ubiquitously expressed, while the source gene is functional only in a small group of male germ cells. This must have resulted from retroposition into a transcriptionally favorable region of the genome

    Sequencing strategies and characterization of 721 vervet monkey genomes for future genetic analyses of medically relevant traits

    Get PDF
    Background We report here the first genome-wide high-resolution polymorphism resource for non-human primate (NHP) association and linkage studies, constructed for the Caribbean-origin vervet monkey, or African green monkey (Chlorocebus aethiops sabaeus), one of the most widely used NHPs in biomedical research. We generated this resource by whole genome sequencing (WGS) of monkeys from the Vervet Research Colony (VRC), an NIH-supported research resource for which extensive phenotypic data are available. Results We identified genome-wide single nucleotide polymorphisms (SNPs) by WGS of 721 members of an extended pedigree from the VRC. From high-depth WGS data we identified more than 4 million polymorphic unequivocal segregating sites; by pruning these SNPs based on heterozygosity, quality control filters, and the degree of linkage disequilibrium (LD) between SNPs, we constructed genome-wide panels suitable for genetic association (about 500,000 SNPs) and linkage analysis (about 150,000 SNPs). To further enhance the utility of these resources for linkage analysis, we used a further pruned subset of the linkage panel to generate multipoint identity by descent matrices. Conclusions The genetic and phenotypic resources now available for the VRC and other Caribbean-origin vervets enable their use for genetic investigation of traits relevant to human diseases

    MYB bi-allelic targeting abrogates primitive clonogenic progenitors while the emergence of primitive blood cells is not affected

    Get PDF
    MYB is a key regulator of definitive hematopoiesis and it is dispensable for the development of primitive hematopoietic cells in vertebrates. To delineate definitive versus primitive hematopoiesis during differentiation of human embryonic stem cells, we have introduced reporters into the MYB locus and inactivated the gene by bi-allelic targeting. To recapitulate the early developmental events more adequately, the mutant and wild type human embryonic stem cell lines were differentiated in defined culture conditions without the addition of hematopoietic cytokines. The differentiation of the reporter cell lines demonstrated that MYB is specifically expressed throughout emerging hematopoietic cell populations. Here we show that the disruption of the MYB gene leads to severe defects in the development and proliferation of primitive hematopoietic progenitors while the emergence of primitive blood cells is not affected. We also provide evidence that MYB is essential for neutrophil and T cell development and the upregulation of innate immunity genes during hematopoietic differentiation. Our results suggest that the endothelial origin of primitive blood cells is direct and does not include the intermediate step of primitive hematopoietic progenitors

    ACE2 and TMPRSS2 variation in savanna monkeys (Chlorocebus spp.): potential risk for zoonotic/anthroponotic transmission of SARS-CoV-2 and a potential model for functional studies

    Get PDF
    The COVID-19 pandemic, caused by the coronavirus SARS-CoV-2, has devastated health infrastructure around the world. Both ACE2 (an entry receptor) and TMPRSS2 (used by the virus for spike protein priming) are key proteins to SARS-CoV-2 cell entry, enabling progression to COVID-19 in humans. Comparative genomic research into critical ACE2 binding sites, associated with the spike receptor binding domain, has suggested that African and Asian primates may also be susceptible to disease from SARS-CoV-2 infection. Savanna monkeys (Chlorocebus spp.) are a widespread non-human primate with well-established potential as a bi-directional zoonotic/anthroponotic agent due to high levels of human interaction throughout their range in sub-Saharan Africa and the Caribbean. To characterize potential functional variation in savanna monkey ACE2 and TMPRSS2, we inspected recently published genomic data from 245 savanna monkeys, including 163 wild monkeys from Africa and the Caribbean and 82 captive monkeys from the Vervet Research Colony (VRC). We found several missense variants. One missense variant in ACE2 (X:14,077,550; Asp30Gly), common in Ch. sabaeus, causes a change in amino acid residue that has been inferred to reduce binding efficiency of SARS-CoV-2, suggesting potentially reduced susceptibility. The remaining populations appear as susceptible as humans, based on these criteria for receptor usage. All missense variants observed in wild Ch. sabaeus populations are also present in the VRC, along with two splice acceptor variants (at X:14,065,076) not observed in the wild sample that are potentially disruptive to ACE2 function. The presence of these variants in the VRC suggests a promising model for SARS-CoV-2 infection and vaccine and therapy development. In keeping with a One Health approach, characterizing actual susceptibility and potential for bi-directional zoonotic/anthroponotic transfer in savanna monkey populations may be an important consideration for controlling COVID-19 epidemics in communities with frequent human/non-human primate interactions that, in many cases, may have limited health infrastructure.P40 OD010965 - NIH HHSPublished versio

    Contribution of common and rare variants to bipolar disorder susceptibility in extended pedigrees from population isolates.

    Get PDF
    Current evidence from case/control studies indicates that genetic risk for psychiatric disorders derives primarily from numerous common variants, each with a small phenotypic impact. The literature describing apparent segregation of bipolar disorder (BP) in numerous multigenerational pedigrees suggests that, in such families, large-effect inherited variants might play a greater role. To identify roles of rare and common variants on BP, we conducted genetic analyses in 26 Colombia and Costa Rica pedigrees ascertained for bipolar disorder 1 (BP1), the most severe and heritable form of BP. In these pedigrees, we performed microarray SNP genotyping of 838 individuals and high-coverage whole-genome sequencing of 449 individuals. We compared polygenic risk scores (PRS), estimated using the latest BP1 genome-wide association study (GWAS) summary statistics, between BP1 individuals and related controls. We also evaluated whether BP1 individuals had a higher burden of rare deleterious single-nucleotide variants (SNVs) and rare copy number variants (CNVs) in a set of genes related to BP1. We found that compared with unaffected relatives, BP1 individuals had higher PRS estimated from BP1 GWAS statistics (P = 0.001 ~ 0.007) and displayed modest increase in burdens of rare deleterious SNVs (P = 0.047) and rare CNVs (P = 0.002 ~ 0.033) in genes related to BP1. We did not observe rare variants segregating in the pedigrees. These results suggest that small-to-moderate effect rare and common variants are more likely to contribute to BP1 risk in these extended pedigrees than a few large-effect rare variants

    Rare Copy Number Variants in \u3cem\u3eNRXN1\u3c/em\u3e and \u3cem\u3eCNTN6\u3c/em\u3e Increase Risk for Tourette Syndrome

    Get PDF
    Tourette syndrome (TS) is a model neuropsychiatric disorder thought to arise from abnormal development and/or maintenance of cortico-striato-thalamo-cortical circuits. TS is highly heritable, but its underlying genetic causes are still elusive, and no genome-wide significant loci have been discovered to date. We analyzed a European ancestry sample of 2,434 TS cases and 4,093 ancestry-matched controls for rare (\u3c 1% frequency) copy-number variants (CNVs) using SNP microarray data. We observed an enrichment of global CNV burden that was prominent for large (\u3e 1 Mb), singleton events (OR = 2.28, 95% CI [1.39–3.79], p = 1.2 × 10−3) and known, pathogenic CNVs (OR = 3.03 [1.85–5.07], p = 1.5 × 10−5). We also identified two individual, genome-wide significant loci, each conferring a substantial increase in TS risk (NRXN1 deletions, OR = 20.3, 95% CI [2.6–156.2]; CNTN6 duplications, OR = 10.1, 95% CI [2.3–45.4]). Approximately 1% of TS cases carry one of these CNVs, indicating that rare structural variation contributes significantly to the genetic architecture of TS
    • …
    corecore