13 research outputs found
CGDSNPdb: a database resource for error-checked and imputed mouse SNPs
The Center for Genome Dynamics Single Nucleotide Polymorphism Database (CGDSNPdb) is an open-source value-added database with more than nine million mouse single nucleotide polymorphisms (SNPs), drawn from multiple sources, with genotypes assigned to multiple inbred strains of laboratory mice. All SNPs are checked for accuracy and annotated for properties specific to the SNP as well as those implied by changes to overlapping protein-coding genes. CGDSNPdb serves as the primary interface to two unique data sets, the âimputed genotype resourceâ in which a Hidden Markov Model was used to assess local haplotypes and the most probable base assignment at several million genomic loci in tens of strains of mice, and the Affymetrix Mouse Diversity Genotyping Array, a high density microarray with over 600 000 SNPs and over 900 000 invariant genomic probes. CGDSNPdb is accessible online through either a web-based query tool or a MySQL public login
A customized and versatile high-density genotyping array for the mouse
We designed a high-density mouse genotyping array containing 623,124 SNPs that capture the known genetic variation present in the laboratory mouse. The array also contains 916,269 invariant genomic probes that are targeted to functional elements and regions known to harbor segmental duplications. The array opens the door to the characterization of genetic diversity, copy number variation, allele specific gene expression and DNA methylation and will extend the successes of human genome-wide association studies to the mouse
Systematic variation in mRNA 3âČ-processing signals during mouse spermatogenesis
Gene expression and processing during mouse male germ cell maturation (spermatogenesis) is highly specialized. Previous reports have suggested that there is a high incidence of alternative 3âČ-processing in male germ cell mRNAs, including reduced usage of the canonical polyadenylation signal, AAUAAA. We used EST libraries generated from mouse testicular cells to identify 3âČ-processing sites used at various stages of spermatogenesis (spermatogonia, spermatocytes and round spermatids) and testicular somatic Sertoli cells. We assessed differences in 3âČ-processing characteristics in the testicular samples, compared to control sets of widely used 3âČ-processing sites. Using a new method for comparison of degenerate regulatory elements between sequence samples, we identified significant changes in the use of putative 3âČ-processing regulatory sequence elements in all spermatogenic cell types. In addition, we observed a trend towards truncated 3âČ-untranslated regions (3âČ-UTRs), with the most significant differences apparent in round spermatids. In contrast, Sertoli cells displayed a much smaller trend towards 3âČ-UTR truncation and no significant difference in 3âČ-processing regulatory sequences. Finally, we identified a number of genes encoding mRNAs that were specifically subject to alternative 3âČ-processing during meiosis and postmeiotic development. Our results highlight developmental differences in polyadenylation site choice and in the elements that likely control them during spermatogenesis
A review of psychological approaches to working with older adults with personality disorders
Motivation: Cis-acting regulatory elements are frequently constrained by both sequence content and positioning relative to a functional site, such as a splice or polyadenylation site. We describe an approach to regulatory motif analysis based on non-negative matrix factorization (NMF). Whereas existing pattern recognition algorithms commonly focus primarily on sequence content, our method simultaneously characterizes both positioning and sequence content of putative motifs. Results: Tests on artificially generated sequences show that NMF can faithfully reproduce both positioning and content of test motifs. We show how the variation of the residual sum of squares can be used to give a robust estimate of the number of motifs or patterns in a sequence set. Our analysis distinguishes multiple motifs with significant overlap in sequence content and/or positioning. Finally, we demonstrate the use of the NMF approach through characterization of biologically interesting datasets. Specifically, an analysis of mRNA 3âČ-processing (cleavage and polyadenylation) sites from a broad range of higher eukaryotes reveals a conserved core pattern of three elements. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.National Science Foudnation (DBI-0331497); National Institutes of Health (2 P20 RR16463, 1R01GM072706
C. elegans sequences that control trans-splicing and operon pre-mRNA processing
Many mRNAs in Caenorhabditis elegans are generated through a trans-splicing reaction that adds one of two classes of spliced leader RNA to an independently transcribed pre-mRNA. SL1 leaders are spliced mostly to pre-mRNAs from genes with outrons, intron-like sequences at the 5âČ-ends of the pre-mRNAs. In contrast, SL2 leaders are nearly exclusively trans-spliced to genes that occur downstream in polycistronic pre-mRNAs produced from operons. Operon pre-mRNA processing requires separation into individual transcripts, which is accomplished by 3âČ-processing of upstream genes and spliced leader trans-splicing to the downstream genes. We used a novel computational analysis, based on nonnegative matrix factorization, to identify and characterize significant differences in the cis-acting sequence elements that differentiate various types of functional site, including internal versus terminal 3âČ-processing sites, and SL1 versus SL2 trans-splicing sites. We describe several key elements, including the U-rich (Ur) element that couples 3âČ-processing with SL2 trans-splicing, and a novel outron (Ou) element that occurs upstream of SL1 trans-splicing sites. Finally, we present models of the distinct classes of trans-splicing reaction, including SL1 trans-splicing at the outron, SL2 trans-splicing in standard operons, competitive SL1-SL2 trans-splicing in operons with large intergenic separation, and SL1 trans-splicing in SL1-type operons, which have no intergenic separation
A customized and versatile high-density genotyping array for the mouse
We designed a high-density mouse genotyping array containing 623,124 SNPs that capture the known genetic variation present in the laboratory mouse. The array also contains 916,269 invariant genomic probes that are targeted to functional elements and regions known to harbor segmental duplications. The array opens the door to the characterization of genetic diversity, copy number variation, allele specific gene expression and DNA methylation and will extend the successes of human genome-wide association studies to the mouse
Influence of diatom diversity on the ocean biological carbon pump
Diatoms sustain the marine food web and contribute to the export of carbon from the surface ocean to depth. They account for about 40% of marine primary productivity and particulate carbon exported to depth as part of the biological pump. Diatoms have long been known to be abundant in turbulent, nutrient-rich waters, but observations and simulations indicate that they are dominant also in meso- and submesoscale structures such as fronts and filaments, and in the deep chlorophyll maximum. Diatoms vary widely in size, morphology and elemental composition, all of which control the quality, quantity and sinking speed of biogenic matter to depth. In particular, their silica shells provide ballast to marine snow and faecal pellets, and can help transport carbon to both the mesopelagic layer and deep ocean. Herein we show that the extent to which diatoms contribute to the export of carbon varies by diatom type, with carbon transfer modulated by the Si/C ratio of diatom cells, the thickness of the shells and their life strategies; for instance, the tendency to form aggregates or resting spores. Model simulations project a decline in the contribution of diatoms to primary production everywhere outside of the Southern Ocean. We argue that we need to understand changes in diatom diversity, life cycle and plankton interactions in a warmer and more acidic ocean in much more detail to fully assess any changes in their contribution to the biological pump