63 research outputs found
The birth of a human-specific neural gene by incomplete duplication and gene fusion
Background: Gene innovation by duplication is a fundamental evolutionary process but is difficult to study in humans due to the large size, high sequence identity, and mosaic nature of segmental duplication blocks. The human-specific gene hydrocephalus-inducing 2, HYDIN2, was generated by a 364 kbp duplication of 79 internal exons of the large ciliary gene HYDIN from chromosome 16q22.2 to chromosome 1q21.1. Because the HYDIN2 locus lacks the ancestral promoter and seven terminal exons of the progenitor gene, we sought to characterize transcription at this locus by coupling reverse transcription polymerase chain reaction and long-read sequencing. Results: 5' RACE indicates a transcription start site for HYDIN2 outside of the duplication and we observe fusion transcripts spanning both the 5' and 3' breakpoints. We observe extensive splicing diversity leading to the formation of altered open reading frames (ORFs) that appear to be under relaxed selection. We show that HYDIN2 adopted a new promoter that drives an altered pattern of expression, with highest levels in neural tissues. We estimate that the HYDIN duplication occurred ~3.2 million years ago and find that it is nearly fixed (99.9%) for diploid copy number in contemporary humans. Examination of 73 chromosome 1q21 rearrangement patients reveals that HYDIN2 is deleted or duplicated in most cases. Conclusions: Together, these data support a model of rapid gene innovation by fusion of incomplete segmental duplications, altered tissue expression, and potential subfunctionalization or neofunctionalization of HYDIN2 early in the evolution of the Homo lineage
Methods for visual mining of genomic and proteomic data atlases
<p>Abstract</p> <p>Background</p> <p>As the volume, complexity and diversity of the information that scientists work with on a daily basis continues to rise, so too does the requirement for new analytic software. The analytic software must solve the dichotomy that exists between the need to allow for a high level of scientific reasoning, and the requirement to have an intuitive and easy to use tool which does not require specialist, and often arduous, training to use. Information visualization provides a solution to this problem, as it allows for direct manipulation and interaction with diverse and complex data. The challenge addressing bioinformatics researches is how to apply this knowledge to data sets that are continually growing in a field that is rapidly changing.</p> <p>Results</p> <p>This paper discusses an approach to the development of visual mining tools capable of supporting the mining of massive data collections used in systems biology research, and also discusses lessons that have been learned providing tools for both local researchers and the wider community. Example tools were developed which are designed to enable the exploration and analyses of both proteomics and genomics based atlases. These atlases represent large repositories of raw and processed experiment data generated to support the identification of biomarkers through mass spectrometry (the PeptideAtlas) and the genomic characterization of cancer (The Cancer Genome Atlas). Specifically the tools are designed to allow for: the visual mining of thousands of mass spectrometry experiments, to assist in designing informed targeted protein assays; and the interactive analysis of hundreds of genomes, to explore the variations across different cancer genomes and cancer types.</p> <p>Conclusions</p> <p>The mining of massive repositories of biological data requires the development of new tools and techniques. Visual exploration of the large-scale atlas data sets allows researchers to mine data to find new meaning and make sense at scales from single samples to entire populations. Providing linked task specific views that allow a user to start from points of interest (from diseases to single genes) enables targeted exploration of thousands of spectra and genomes. As the composition of the atlases changes, and our understanding of the biology increase, new tasks will continually arise. It is therefore important to provide the means to make the data available in a suitable manner in as short a time as possible. We have done this through the use of common visualization workflows, into which we rapidly deploy visual tools. These visualizations follow common metaphors where possible to assist users in understanding the displayed data. Rapid development of tools and task specific views allows researchers to mine large-scale data almost as quickly as it is produced. Ultimately these visual tools enable new inferences, new analyses and further refinement of the large scale data being provided in atlases such as PeptideAtlas and The Cancer Genome Atlas.</p
Identification of β-Secretase (BACE1) Substrates Using Quantitative Proteomics
β-site APP cleaving enzyme 1 (BACE1) is a transmembrane aspartyl protease with a lumenal active site that sheds the ectodomains of membrane proteins through juxtamembrane proteolysis. BACE1 has been studied principally for its role in Alzheimer's disease as the β-secretase responsible for generating the amyloid-β protein. Emerging evidence from mouse models has identified the importance of BACE1 in myelination and cognitive performance. However, the substrates that BACE1 processes to regulate these functions are unknown, and to date only a few β-secretase substrates have been identified through candidate-based studies. Using an unbiased approach to substrate identification, we performed quantitative proteomic analysis of two human epithelial cell lines stably expressing BACE1 and identified 68 putative β-secretase substrates, a number of which we validated in a cell culture system. The vast majority were of type I transmembrane topology, although one was type II and three were GPI-linked proteins. Intriguingly, a preponderance of these proteins are involved in contact-dependent intercellular communication or serve as receptors and have recognized roles in the nervous system and other organs. No consistent sequence motif predicting BACE1 cleavage was identified in substrates versus non-substrates. These findings expand our understanding of the proteins and cellular processes that BACE1 may regulate, and suggest possible mechanisms of toxicity arising from chronic BACE1 inhibition
Genetics of human hydrocephalus
Human hydrocephalus is a common medical condition that is characterized by abnormalities in the flow or resorption of cerebrospinal fluid (CSF), resulting in ventricular dilatation. Human hydrocephalus can be classified into two clinical forms, congenital and acquired. Hydrocephalus is one of the complex and multifactorial neurological disorders. A growing body of evidence indicates that genetic factors play a major role in the pathogenesis of hydrocephalus. An understanding of the genetic components and mechanism of this complex disorder may offer us significant insights into the molecular etiology of impaired brain development and an accumulation of the cerebrospinal fluid in cerebral compartments during the pathogenesis of hydrocephalus. Genetic studies in animal models have started to open the way for understanding the underlying pathology of hydrocephalus. At least 43 mutants/loci linked to hereditary hydrocephalus have been identified in animal models and humans. Up to date, 9 genes associated with hydrocephalus have been identified in animal models. In contrast, only one such gene has been identified in humans. Most of known hydrocephalus gene products are the important cytokines, growth factors or related molecules in the cellular signal pathways during early brain development. The current molecular genetic evidence from animal models indicate that in the early development stage, impaired and abnormal brain development caused by abnormal cellular signaling and functioning, all these cellular and developmental events would eventually lead to the congenital hydrocephalus. Owing to our very primitive knowledge of the genetics and molecular pathogenesis of human hydrocephalus, it is difficult to evaluate whether data gained from animal models can be extrapolated to humans. Initiation of a large population genetics study in humans will certainly provide invaluable information about the molecular and cellular etiology and the developmental mechanisms of human hydrocephalus. This review summarizes the recent findings on this issue among human and animal models, especially with reference to the molecular genetics, pathological, physiological and cellular studies, and identifies future research directions
Differential Cerebral Cortex Transcriptomes of Baboon Neonates Consuming Moderate and High Docosahexaenoic Acid Formulas
BACKGROUND: Docosahexaenoic acid (DHA, 22:6n-3) and arachidonic acid (ARA, 20:4n-6) are the major long chain polyunsaturated fatty acids (LCPUFA) of the central nervous system (CNS). These nutrients are present in most infant formulas at modest levels, intended to support visual and neural development. There are no investigations in primates of the biological consequences of dietary DHA at levels above those present in formulas but within normal breastmilk levels. METHODS AND FINDINGS: Twelve baboons were divided into three formula groups: Control, with no DHA-ARA; “L”, LCPUFA, with 0.33%DHA-0.67%ARA; “L3”, LCPUFA, with 1.00%DHA-0.67%ARA. All the samples are from the precentral gyrus of cerebral cortex brain regions. At 12 weeks of age, changes in gene expression were detected in 1,108 of 54,000 probe sets (2.05%), with most showing <2-fold change. Gene ontology analysis assigns them to diverse biological functions, notably lipid metabolism and transport, G-protein and signal transduction, development, visual perception, cytoskeleton, peptidases, stress response, transcription regulation, and 400 transcripts having no defined function. PLA2G6, a phospholipase recently associated with infantile neuroaxonal dystrophy, was downregulated in both LCPUFA groups. ELOVL5, a PUFA elongase, was the only LCPUFA biosynthetic enzyme that was differentially expressed. Mitochondrial fatty acid carrier, CPT2, was among several genes associated with mitochondrial fatty acid oxidation to be downregulated by high DHA, while the mitochondrial proton carrier, UCP2, was upregulated. TIMM8A, also known as deafness/dystonia peptide 1, was among several differentially expressed neural development genes. LUM and TIMP3, associated with corneal structure and age-related macular degeneration, respectively, were among visual perception genes influenced by LCPUFA. TIA1, a silencer of COX2 gene translation, is upregulated by high DHA. Ingenuity pathway analysis identified a highly significant nervous system network, with epidermal growth factor receptor (EGFR) as the outstanding interaction partner. CONCLUSIONS: These data indicate that LCPUFA concentrations within the normal range of human breastmilk induce global changes in gene expression across a wide array of processes, in addition to changes in visual and neural function normally associated with formula LCPUFA
Evolution of soil surface roughness and flowpath connectivity in overland flow experiments
International audienc
- …