28 research outputs found
Systems biology discoveries using non-human primate pluripotent stem and germ cells: novel gene and genomic imprinting interactions as well as unique expression patterns
The study of pluripotent stem cells has generated much interest in both biology and medicine. Understanding the fundamentals of biological decisions, including what permits a cell to maintain pluripotency, that is, its ability to self-renew and thereby remain immortal, or to differentiate into multiple types of cells, is of profound importance. For clinical applications, pluripotent cells, including both embryonic stem cells and adult stem cells, have been proposed for cell replacement therapy for a number of human diseases and disorders, including Alzheimer's, Parkinson's, spinal cord injury and diabetes. One challenge in their usage for such therapies is understanding the mechanisms that allow the maintenance of pluripotency and controlling the specific differentiation into required functional target cells. Because of regulatory restrictions and biological feasibilities, there are many crucial investigations that are just impossible to perform using pluripotent stem cells (PSCs) from humans (for example, direct comparisons among panels of inbred embryonic stem cells from prime embryos obtained from pedigreed and fertile donors; genomic analysis of parent versus progeny PSCs and their identical differentiated tissues; intraspecific chimera analyses for pluripotency testing; and so on). However, PSCs from nonhuman primates are being investigated to bridge these knowledge gaps between discoveries in mice and vital information necessary for appropriate clinical evaluations. In this review, we consider the mRNAs and novel genes with unique expression and imprinting patterns that were discovered using systems biology approaches with primate pluripotent stem and germ cells
Long-read sequencing reveals the complex splicing profile of the psychiatric risk gene CACNA1C in human brain
RNA splicing is a key mechanism linking genetic variation with psychiatric disorders. Splicing profiles are particularly diverse in brain and difficult to accurately identify and quantify. We developed a new approach to address this challenge, combining long-range PCR and nanopore sequencing with a novel bioinformatics pipeline. We identify the full-length coding transcripts of CACNA1C in human brain. CACNA1C is a psychiatric risk gene that encodes the voltage-gated calcium channel CaV1.2. We show that CACNA1C’s transcript profile is substantially more complex than appreciated, identifying 38 novel exons and 241 novel transcripts. Importantly, many of the novel variants are abundant, and predicted to encode channels with altered function. The splicing profile varies between brain regions, especially in cerebellum. We demonstrate that human transcript diversity (and thereby protein isoform diversity) remains under-characterised, and provide a feasible and cost-effective methodology to address this. A detailed understanding of isoform diversity will be essential for the translation of psychiatric genomic findings into pathophysiological insights and novel psychopharmacological targets
A user's guide to the Encyclopedia of DNA elements (ENCODE)
The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome
Changes to the Fossil Record of Insects through Fifteen Years of Discovery
The first and last occurrences of hexapod families in the fossil record are compiled from publications up to end-2009. The major features of these data are compared with those of previous datasets (1993 and 1994). About a third of families (>400) are new to the fossil record since 1994, over half of the earlier, existing families have experienced changes in their known stratigraphic range and only about ten percent have unchanged ranges. Despite these significant additions to knowledge, the broad pattern of described richness through time remains similar, with described richness increasing steadily through geological history and a shift in dominant taxa, from Palaeoptera and Polyneoptera to Paraneoptera and Holometabola, after the Palaeozoic. However, after detrending, described richness is not well correlated with the earlier datasets, indicating significant changes in shorter-term patterns. There is reduced Palaeozoic richness, peaking at a different time, and a less pronounced Permian decline. A pronounced Triassic peak and decline is shown, and the plateau from the mid Early Cretaceous to the end of the period remains, albeit at substantially higher richness compared to earlier datasets. Origination and extinction rates are broadly similar to before, with a broad decline in both through time but episodic peaks, including end-Permian turnover. Origination more consistently exceeds extinction compared to previous datasets and exceptions are mainly in the Palaeozoic. These changes suggest that some inferences about causal mechanisms in insect macroevolution are likely to differ as well
Targeted, high-resolution RNA sequencing of non-coding genomic regions associated with neuropsychiatric functions
The human brain is one of the last frontiers of biomedical research. Genome-wide association studies (GWAS) have succeeded in identifying thousands of haplotype blocks associated with a range of neuropsychiatric traits, including disorders such as schizophrenia, Alzheimer's and Parkinson's disease. However, the majority of single nucleotide polymorphisms (SNPs) that mark these haplotype blocks fall within non-coding regions of the genome, hindering their functional validation. While some of these GWAS loci may contain cis-acting regulatory DNA elements such as enhancers, we hypothesized that many are also transcribed into non-coding RNAs that are missing from publicly available transcriptome annotations. Here, we use targeted RNA capture ('RNA CaptureSeq') in combination with nanopore long-read cDNA sequencing to transcriptionally profile 1,023 haplotype blocks across the genome containing non-coding GWAS SNPs associated with neuropsychiatric traits, using post-mortem human brain tissue from three neurologically healthy donors. We find that the majority (62%) of targeted haplotype blocks, including 13% of intergenic blocks, are transcribed into novel, multi-exonic RNAs, most of which are not yet recorded in GENCODE annotations. We validated our findings with short-read RNA-seq, providing orthogonal confirmation of novel splice junctions and enabling a quantitative assessment of the long-read assemblies. Many novel transcripts are supported by independent evidence of transcription including cap analysis of gene expression (CAGE) data and epigenetic marks, and some show signs of potential functional roles. We present these transcriptomes as a preliminary atlas of non-coding transcription in human brain that can be used to connect neurological phenotypes with gene expression
The acidity of atmospheric particles and clouds
202307 bcchVersion of RecordRGCOthersExcellent Science; PANACEA; National Science Foundation; U.S. Department of Energy; U.S. Environmental Protection Agency; Office of Science; Natural Sciences and Engineering Research Council of Canada; European Commission; European Research Council; European Regional Development FundPublishe
Nanopore long-read RNAseq reveals widespread transcriptional variation among the surface receptors of individual B cells
Understanding gene regulation and function requires a genome-wide method capable of capturing both gene expression levels and isoform diversity at the single-cell level. Short-read RNAseq is limited in its ability to resolve complex isoforms because it fails to sequence full-length cDNA copies of RNA molecules. Here, we investigate whether RNAseq using the long-read single-molecule Oxford Nanopore MinION sequencer is able to identify and quantify complex isoforms without sacrificing accurate gene expression quantification. After benchmarking our approach, we analyse individual murine B1a cells using a custom multiplexing strategy. We identify thousands of unannotated transcription start and end sites, as well as hundreds of alternative splicing events in these B1a cells. We also identify hundreds of genes expressed across B1a cells that display multiple complex isoforms, including several B cell-specific surface receptors. Our results show that we can identify and quantify complex isoforms at the single cell level