Search CORE

81 research outputs found

SCN1A: bioinformatically-informed revised boundaries for promoter and enhancer regions

Author: Custodio Helena Martins
Frankish Adam
Mills James D
Mudge Jonathan M
Pagni Susanna
Sisodiya Sanjay M
Publication venue: 'Oxford University Press (OUP)'
Publication date: 28/01/2023
Field of study

Pathogenic variations in the sodium voltage-gated channel alpha subunit 1 (SCN1A) gene are responsible for multiple epilepsy phenotypes, including Dravet syndrome (DS), febrile seizures (FS), and genetic epilepsy with febrile seizures plus (GEFS+). Phenotypic heterogeneity is a hallmark of SCN1A-related epilepsies, the causes of which are yet to be clarified. Genetic variation in the non-coding regulatory regions of SCN1A could be one potential causal factor. However, a comprehensive understanding of the SCN1A regulatory landscape is currently lacking. Here, we summarised the current state of knowledge of SCN1A regulation, providing details of its promoter and enhancer regions. We then integrated currently available data on SCN1A promoters by extracting information related to the SCN1A locus from genome-wide repositories, and clearly defined the promoter and enhancer regions of SCN1A. Further, we explored the cellular specificity of differential SCN1A promoter usage. We also reviewed and integrated the available human brain-derived enhancer databases and mouse-derived data to provide a comprehensive computationally-developed summary of SCN1A brain-active enhancers. By querying genome-wide data repositories, extracting SCN1A-specific data and integrating the different types of independent evidence, we created a comprehensive catalogue that better defines the regulatory landscape of SCN1A, which could be used to explore the role of SCN1A regulatory regions in disease

UCL Discovery

Recommended from our members

Functional signatures of evolutionarily young CTCF binding sites

Author: Azazi Dhoyazan
Flicek Paul
Mudge Jonathan M.
Odom Duncan T.
Publication venue: BMC Biology
Publication date: 23/09/2020
Field of study

Abstract: Background: The introduction of novel CTCF binding sites in gene regulatory regions in the rodent lineage is partly the effect of transposable element expansion, particularly in the murine lineage. The exact mechanism and functional impact of evolutionarily novel CTCF binding sites are not yet fully understood. We investigated the impact of novel subspecies-specific CTCF binding sites in two Mus genus subspecies, Mus musculus domesticus and Mus musculus castaneus, that diverged 0.5 million years ago. Results: CTCF binding site evolution is influenced by the action of the B2-B4 family of transposable elements independently in both lineages, leading to the proliferation of novel CTCF binding sites. A subset of evolutionarily young sites may harbour transcriptional functionality as evidenced by the stability of their binding across multiple tissues in M. musculus domesticus (BL6), while overall the distance of subspecies-specific CTCF binding to the nearest transcription start sites and/or topologically associated domains (TADs) is largely similar to musculus-common CTCF sites. Remarkably, we discovered a recurrent regulatory architecture consisting of a CTCF binding site and an interferon gene that appears to have been tandemly duplicated to create a 15-gene cluster on chromosome 4, thus forming a novel BL6 specific immune locus in which CTCF may play a regulatory role. Conclusions: Our results demonstrate that thousands of CTCF binding sites show multiple functional signatures rapidly after incorporation into the genome

Apollo (Cambridge)

Dynamic instability of the major urinary protein gene family revealed by genomic and phenotypic comparisons between C57 and 129 strain mice

Author: Armstrong Stuart D
Beynon Robert J
Harrow Jennifer L
Hurst Jane L
McLaren Karen
Mudge Jonathan M
Nicholson Christine
Robertson Duncan H
Wilming Laurens G
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Targeted sequencing, manual genome annotation, phylogenetic analysis and mass spectrometry were used to characterise major urinary proteins (MUPs) and the Mup clusters of two strains of inbred mice

Springer - Publisher Connector

PubMed Central

Evidence for a novel overlapping coding sequence in POLG initiated at a CUG start codon

Author: Choudhary Jyoti S.
Firth Andrew E.
Jungreis Irwin
Kellis Manolis
Khan Yousuf A.
Mudge Jonathan M.
Wright James C.
Publication venue: BMC Genetics
Publication date: 06/03/2020
Field of study

Abstract: Background: POLG, located on nuclear chromosome 15, encodes the DNA polymerase γ(Pol γ). Pol γ is responsible for the replication and repair of mitochondrial DNA (mtDNA). Pol γ is the only DNA polymerase found in mitochondria for most animal cells. Mutations in POLG are the most common single-gene cause of diseases of mitochondria and have been mapped over the coding region of the POLG ORF. Results: Using PhyloCSF to survey alternative reading frames, we found a conserved coding signature in an alternative frame in exons 2 and 3 of POLG, herein referred to as ORF-Y that arose de novo in placental mammals. Using the synplot2 program, synonymous site conservation was found among mammals in the region of the POLG ORF that is overlapped by ORF-Y. Ribosome profiling data revealed that ORF-Y is translated and that initiation likely occurs at a CUG codon. Inspection of an alignment of mammalian sequences containing ORF-Y revealed that the CUG codon has a strong initiation context and that a well-conserved predicted RNA stem-loop begins 14 nucleotides downstream. Such features are associated with enhanced initiation at near-cognate non-AUG codons. Reanalysis of the Kim et al. (2014) draft human proteome dataset yielded two unique peptides that map unambiguously to ORF-Y. An additional conserved uORF, herein referred to as ORF-Z, was also found in exon 2 of POLG. Lastly, we surveyed Clinvar variants that are synonymous with respect to the POLG ORF and found that most of these variants cause amino acid changes in ORF-Y or ORF-Z. Conclusions: We provide evidence for a novel coding sequence, ORF-Y, that overlaps the POLG ORF. Ribosome profiling and mass spectrometry data show that ORF-Y is expressed. PhyloCSF and synplot2 analysis show that ORF-Y is subject to strong purifying selection. An abundance of disease-correlated mutations that map to exons 2 and 3 of POLG but also affect ORF-Y provides potential clinical significance to this finding

DSpace@MIT

Apollo (Cambridge)

Institute of Cancer Research Repository

Molecular complexity of the major urinary protein system of the Norway rat, <i>Rattus norvegicus</i>

Author: Armstrong Stuart D
Beynon Robert J
Gómez-Baena Guadalupe
Halstead Josiah O
Hurst Jane L
McLean Lynn
Mudge Jonathan M
Prescott Mark
Roberts Sarah A
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date
Field of study

ABSTRACTMajor urinary proteins (MUP) are the major component of the urinary protein fraction in house mice (Mus spp.) and rats (Rattus spp.). The structure, polymorphism and functions of these lipocalins have been well described in the western European house mouse (Mus musculus domesticus), clarifying their role in semiochemical communication. The complexity of these roles in the mouse raises the question of similar functions in other rodents, including the Norway rat, Rattus norvegicus. Norway rats express MUPs in urine but information about specific MUP isoform sequences and functions is limited. In this study, we present a detailed molecular characterization of the MUP proteoforms expressed in the urine of two laboratory strains, Wistar Han and Brown Norway, and wild caught animals, using a combination of manual gene annotation, intact protein mass spectrometry and bottom-up mass spectrometry-based proteomic approaches. Detailed sequencing of the proteins reveals a less complex pattern of primary sequence polymorphism than the mouse. However, unlike the mouse, rat MUPs exhibit added complexity in the form of post-translational modifications including phosphorylation and exoproteolytic trimming of specific isoforms. The possibility that urinary MUPs may have different roles in rat chemical communication than those they play in the house mouse is also discussed.</jats:p

University of Liverpool Repository

SCN1A overexpression, associated with a genomic region marked by a risk variant for a common epilepsy, raises seizure susceptibility

Author: Alhusaini Saud
Becker Albert J
de Zubicaray Greig I
Esguerra Camila V
Frankish Adam
Gawel Kinga
Kirstein-Smardzewska Karolina J
Martins Custodio Helena
McMahon Katie L
Michalak Zuzanna
Mills James
Mudge Jonathan M
Pagni Susanna
Picardo Richard
Pitsch Julika
Schoch Susanne
Silvennoinen Katri
Sisodiya Sanjay M
Thom Maria
Thompson Paul M
Tiraboschi Ettore
Tsortouktzidis Despina
van der Ent Wietske
van Loo Karen MJ
Whelan Christopher D
Wright Margaret J
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/05/2022
Field of study

Mesial temporal lobe epilepsy with hippocampal sclerosis and a history of febrile seizures is associated with common variation at rs7587026, located in the promoter region of SCN1A. We sought to explore possible underlying mechanisms. SCN1A expression was analysed in hippocampal biopsy specimens of individuals with mesial temporal lobe epilepsy with hippocampal sclerosis who underwent surgical treatment, and hippocampal neuronal cell loss was quantitatively assessed using immunohistochemistry. In healthy individuals, hippocampal volume was measured using MRI. Analyses were performed stratified by rs7587026 type. To study the functional consequences of increased SCN1A expression, we generated, using transposon-mediated bacterial artificial chromosome transgenesis, a zebrafish line expressing exogenous scn1a, and performed EEG analysis on larval optic tecta at 4 day post-fertilization. Finally, we used an in vitro promoter analysis to study whether the genetic motif containing rs7587026 influences promoter activity. Hippocampal SCN1A expression differed by rs7587026 genotype (Kruskal-Wallis test P = 0.004). Individuals homozygous for the minor allele showed significantly increased expression compared to those homozygous for the major allele (Dunn's test P = 0.003), and to heterozygotes (Dunn's test P = 0.035). No statistically significant differences in hippocampal neuronal cell loss were observed between the three genotypes. Among 597 healthy participants, individuals homozygous for the minor allele at rs7587026 displayed significantly reduced mean hippocampal volume compared to major allele homozygotes (Cohen's D = - 0.28, P = 0.02), and to heterozygotes (Cohen's D = - 0.36, P = 0.009). Compared to wild type, scn1lab-overexpressing zebrafish larvae exhibited more frequent spontaneous seizures [one-way ANOVA F(4,54) = 6.95 (P < 0.001)]. The number of EEG discharges correlated with the level of scn1lab overexpression [one-way ANOVA F(4,15) = 10.75 (P < 0.001]. Finally, we showed that a 50 bp promoter motif containing rs7587026 exerts a strong regulatory role on SCN1A expression, though we could not directly link this to rs7587026 itself. Our results develop the mechanistic link between rs7587026 and mesial temporal lobe epilepsy with hippocampal sclerosis and a history of febrile seizures. Furthermore, we propose that quantitative precision may be important when increasing SCN1A expression in current strategies aiming to treat seizures in conditions involving SCN1A haploinsufficiency, such as Dravet syndrome

UCL Discovery

Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.

Author: Aken Bronwen L
Barnes If
Bennett Ruth
Berry Andrew E
Bruford Elspeth A
Bult Carol J
Cox Eric
Davidson Claire
Diekhans Mark
Farrell Catherine M
Frankish Adam
Girón Carlos G
Goldfarb Tamara
Gonzalez Jose M
Hunt Toby
Jackson John
Joardar Vinita
Kay Mike P
Kodali Vamsi K
Loveland Jane E
Martin Fergal J
McAndrews Monica
McGarvey Kelly M
Mudge Jonathan M
Murphy Michael
Murphy Terence
O\u27Leary Nuala A
Pruitt Kim D
Pujar Shashikant
Rajput Bhanu
Rangwala Sanjida H
Riddick Lillian D
Seal Ruth L
Suner Marie-Marthe
Wallin Craig
Webb David
Zhu Sophia
Publication venue: The Mouseion at the JAXlibrary
Publication date: 06/11/2017
Field of study

The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID). Additionally, coordinated manual review by expert curators from the CCDS collaboration helps in maintaining the integrity and high quality of the dataset. The CCDS data are available through an interactive web page (https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) and an FTP site (ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/). In this paper, we outline the ongoing work, growth and stability of the CCDS dataset and provide updates on new collaboration members and new features added to the CCDS user interface. We also present expert curation scenarios, with specific examples highlighting the importance of an accurate reference genome assembly and the crucial role played by input from the research community. Nucleic Acids Res 2018 Jan 4; 46(D1):D221-D228

Crossref

The Jackson Laboratory: The Mouseion at the JAXlibrary

RNAcentral 2021: secondary structure integration, improved sequence search and new member databases

RNAcentral is a comprehensive database of non-coding RNA (ncRNA) sequences that provides a single access point to 44 RNA resources and >18 million ncRNA sequences from a wide range of organisms and RNA types. RNAcentral now also includes secondary (2D) structure information for >13 million sequences, making RNAcentral the world's largest RNA 2D structure database. The 2D diagrams are displayed using R2DT, a new 2D structure visualization method that uses consistent, reproducible and recognizable layouts for related RNAs. The sequence similarity search has been updated with a faster interface featuring facets for filtering search results by RNA type, organism, source database or any keyword. This sequence search tool is available as a reusable web component, and has been integrated into several RNAcentral member databases, including Rfam, miRBase and snoDB. To allow for a more fine-grained assignment of RNA types and subtypes, all RNAcentral sequences have been annotated with Sequence Ontology terms. The RNAcentral database continues to grow and provide a central data resource for the RNA community

Ghent University Academic Bibliography

Copenhagen University Research Information System

RNAcentral 2021: secondary structure integration, improved sequence search and new member databases.

Ghent University Academic Bibliography

Copenhagen University Research Information System

eScholarship - University of California

Apollo (Cambridge)