82 research outputs found
Expansion of the HSFY gene family in pig lineages : HSFY expansion in suids.
BACKGROUND: Amplified gene families on sex chromosomes can harbour genes with important biological functions, especially relating to fertility. The Y-linked heat shock transcription factor (HSFY) family has become amplified on the Y chromosome of the domestic pig (Sus scrofa), in an apparently independent event to an HSFY expansion on the Y chromosome of cattle (Bos taurus). Although the biological functions of HSFY genes are poorly understood, they appear to be involved in gametogenesis in a number of mammalian species, and, in cattle, HSFY gene copy number may correlate with levels of fertility. RESULTS: We have investigated the HSFY family in domestic pig, and other suid species including warthog, bushpig, babirusa and peccaries. The domestic pig contains at least two amplified variants of HSFY, distinguished predominantly by presence or absence of a SINE within the intron. Both these variants are expressed in testis, and both are present in approximately 50 copies each in a single cluster on the short arm of the Y. The longer form has multiple nonsense mutations rendering it likely non-functional, but many of the shorter forms still have coding potential. Other suid species also have these two variants of HSFY, and estimates of copy number suggest the HSFY family may have amplified independently twice during suid evolution. CONCLUSIONS: The HSFY genes have become amplified in multiple species lineages independently. HSFY is predominantly expressed in testis in domestic pig, a pattern conserved with cattle, in which HSFY may play a role in fertility. Further investigation of the potential associations of HSFY with fertility and testis development may be of agricultural interest.We gratefully acknowledge the Wellcome Trust Sanger Institute core teams
for fingerprinting, mapping, archiving, library construction, sequence
improvement and sequencing and Genus for providing the Duroc boar
samples. This work was funded by BBSRC grant BB/F021372/1. The Flow
Cytometry and Cytogenetics Core Facilities at the Wellcome Trust Sanger
Institute and Sanger investigators are funded by the Wellcome Trust (grant
number WT098051)
A Comprehensive Evaluation of Nasal and Bronchial Cytokines and Chemokines Following Experimental Rhinovirus Infection in Allergic Asthma: Increased Interferons (IFN-γ and IFN-λ) and Type 2 Inflammation (IL-5 and IL-13).
BACKGROUND: Rhinovirus infection is a major cause of asthma exacerbations. OBJECTIVES: We studied nasal and bronchial mucosal inflammatory responses during experimental rhinovirus-induced asthma exacerbations. METHODS: We used nasosorption on days 0, 2-5 and 7 and bronchosorption at baseline and day 4 to sample mucosal lining fluid to investigate airway mucosal responses to rhinovirus infection in patients with allergic asthma (n=28) and healthy non-atopic controls (n=11), by using a synthetic absorptive matrix and measuring levels of 34 cytokines and chemokines using a sensitive multiplex assay. RESULTS: Following rhinovirus infection asthmatics developed more upper and lower respiratory symptoms and lower peak expiratory flows compared to controls (all P<0.05). Asthmatics also developed higher nasal lining fluid levels of an anti-viral pathway (including IFN-γ, IFN-λ/IL-29, CXCL11/ITAC, CXCL10/IP10 and IL-15) and a type 2 inflammatory pathway (IL-4, IL-5, IL-13, CCL17/TARC, CCL11/eotaxin, CCL26/eotaxin-3) (area under curve day 0-7, all P<0.05). Nasal IL-5 and IL-13 were higher in asthmatics at day 0 (P<0.01) and levels increased by days 3 and 4 (P<0.01). A hierarchical correlation matrix of 24 nasal lining fluid cytokine and chemokine levels over 7days demonstrated expression of distinct interferon-related and type 2 pathways in asthmatics. In asthmatics IFN-γ, CXCL10/IP10, CXCL11/ITAC, IL-15 and IL-5 increased in bronchial lining fluid following viral infection (all P<0.05). CONCLUSIONS: Precision sampling of mucosal lining fluid identifies robust interferon and type 2 responses in the upper and lower airways of asthmatics during an asthma exacerbation. Nasosorption and bronchosorption have potential to define asthma endotypes in stable disease and at exacerbation
Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.
The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID). Additionally, coordinated manual review by expert curators from the CCDS collaboration helps in maintaining the integrity and high quality of the dataset. The CCDS data are available through an interactive web page (https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) and an FTP site (ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/). In this paper, we outline the ongoing work, growth and stability of the CCDS dataset and provide updates on new collaboration members and new features added to the CCDS user interface. We also present expert curation scenarios, with specific examples highlighting the importance of an accurate reference genome assembly and the crucial role played by input from the research community. Nucleic Acids Res 2018 Jan 4; 46(D1):D221-D228
The pig X and Y Chromosomes: structure, sequence, and evolution.
We have generated an improved assembly and gene annotation of the pig X Chromosome, and a first draft assembly of the pig Y Chromosome, by sequencing BAC and fosmid clones from Duroc animals and incorporating information from optical mapping and fiber-FISH. The X Chromosome carries 1033 annotated genes, 690 of which are protein coding. Gene order closely matches that found in primates (including humans) and carnivores (including cats and dogs), which is inferred to be ancestral. Nevertheless, several protein-coding genes present on the human X Chromosome were absent from the pig, and 38 pig-specific X-chromosomal genes were annotated, 22 of which were olfactory receptors. The pig Y-specific Chromosome sequence generated here comprises 30 megabases (Mb). A 15-Mb subset of this sequence was assembled, revealing two clusters of male-specific low copy number genes, separated by an ampliconic region including the HSFY gene family, which together make up most of the short arm. Both clusters contain palindromes with high sequence identity, presumably maintained by gene conversion. Many of the ancestral X-related genes previously reported in at least one mammalian Y Chromosome are represented either as active genes or partial sequences. This sequencing project has allowed us to identify genes--both single copy and amplified--on the pig Y Chromosome, to compare the pig X and Y Chromosomes for homologous sequences, and thereby to reveal mechanisms underlying pig X and Y Chromosome evolution.This work was funded by BBSRC grant BB/F021372/1. The Flow Cytometry and Cytogenetics Core Facilities at the Wellcome Trust Sanger Institute and Sanger investigators are funded by the Wellcome Trust (grant number WT098051). K.B., D.C.-S., and J.H. acknowledge support from the Wellcome Trust (WT095908), the BBSRC (BB/I025506/1), and the European Molecular Biology Laboratory. The research leading to these results has received funding from the European Community's Seventh Framework Programme (FP7/2007–2013) under grant agreement no. 222664 (“Quantomics”).This is the final version of the article. It first appeared from Cold Spring Harbor Laboratory Press via http://dx.doi.org/10.1101/gr.188839.11
The phase diagram of NiSi under the conditions of small planetary interiors
The phase diagram of NiSi has been determined using in situ synchrotron X-ray powder diffraction multi-anvil experiments to 19 GPa, with further preliminary results in the laser-heated diamond cell reported to 60 GPa. The low-pressure MnP-structured phase transforms to two different high-pressure phases depending on the temperature: the ε-FeSi structure is stable at temperatures above ∼1100 K and a previously reported distorted-CuTi structure (with Pmmn symmetry) is stable at lower temperature. The invariant point is located at 12.8 ± 0.2 GPa and 1100 ± 20 K. At higher pressures, ε -FeSi-structured NiSi transforms to the CsCl structure with CsCl-NiSi as the liquidus phase above 30 GPa. The Clapeyron slope of this transition is -67 MPa/K. The phase boundary between the ε -FeSi and Pmmn structured phases is nearly pressure independent implying there will be a second sub-solidus invariant point between CsCl, ε -FeSi and Pmmn structures at higher pressure than attained in this study. In addition to these stable phases, the MnP structure was observed to spontaneously transform at room temperature to a new orthorhombic structure (also with Pnma symmetry) which had been detailed in previous ab initio simulations. This new phase of NiSi is shown here to be metastable
Effects of antiplatelet therapy on stroke risk by brain imaging features of intracerebral haemorrhage and cerebral small vessel diseases: subgroup analyses of the RESTART randomised, open-label trial
Background
Findings from the RESTART trial suggest that starting antiplatelet therapy might reduce the risk of recurrent symptomatic intracerebral haemorrhage compared with avoiding antiplatelet therapy. Brain imaging features of intracerebral haemorrhage and cerebral small vessel diseases (such as cerebral microbleeds) are associated with greater risks of recurrent intracerebral haemorrhage. We did subgroup analyses of the RESTART trial to explore whether these brain imaging features modify the effects of antiplatelet therapy
Structural and functional annotation of the porcine immunome
Background: The domestic pig is known as an excellent model for human immunology and the two species share many pathogens. Susceptibility to infectious disease is one of the major constraints on swine performance, yet the structure and function of genes comprising the pig immunome are not well-characterized. The completion of the pig genome provides the opportunity to annotate the pig immunome, and compare and contrast pig and human immune systems.[br/] Results: The Immune Response Annotation Group (IRAG) used computational curation and manual annotation of the swine genome assembly 10.2 (Sscrofa10.2) to refine the currently available automated annotation of 1,369 immunity-related genes through sequence-based comparison to genes in other species. Within these genes, we annotated 3,472 transcripts. Annotation provided evidence for gene expansions in several immune response families, and identified artiodactyl-specific expansions in the cathelicidin and type 1 Interferon families. We found gene duplications for 18 genes, including 13 immune response genes and five non-immune response genes discovered in the annotation process. Manual annotation provided evidence for many new alternative splice variants and 8 gene duplications. Over 1,100 transcripts without porcine sequence evidence were detected using cross-species annotation. We used a functional approach to discover and accurately annotate porcine immune response genes. A co-expression clustering analysis of transcriptomic data from selected experimental infections or immune stimulations of blood, macrophages or lymph nodes identified a large cluster of genes that exhibited a correlated positive response upon infection across multiple pathogens or immune stimuli. Interestingly, this gene cluster (cluster 4) is enriched for known general human immune response genes, yet contains many un-annotated porcine genes. A phylogenetic analysis of the encoded proteins of cluster 4 genes showed that 15% exhibited an accelerated evolution as compared to 4.1% across the entire genome.[br/] Conclusions: This extensive annotation dramatically extends the genome-based knowledge of the molecular genetics and structure of a major portion of the porcine immunome. Our complementary functional approach using co-expression during immune response has provided new putative immune response annotation for over 500 porcine genes. Our phylogenetic analysis of this core immunome cluster confirms rapid evolutionary change in this set of genes, and that, as in other species, such genes are important components of the pig’s adaptation to pathogen challenge over evolutionary time. These comprehensive and integrated analyses increase the value of the porcine genome sequence and provide important tools for global analyses and data-mining of the porcine immune response
GENCODE: reference annotation for the human and mouse genomes in 2023.
GENCODE produces high quality gene and transcript annotation for the human and mouse genomes. All GENCODE annotation is supported by experimental data and serves as a reference for genome biology and clinical genomics. The GENCODE consortium generates targeted experimental data, develops bioinformatic tools and carries out analyses that, along with externally produced data and methods, support the identification and annotation of transcript structures and the determination of their function. Here, we present an update on the annotation of human and mouse genes, including developments in the tools, data, analyses and major collaborations which underpin this progress. For example, we report the creation of a set of non-canonical ORFs identified in GENCODE transcripts, the LRGASP collaboration to assess the use of long transcriptomic data to build transcript models, the progress in collaborations with RefSeq and UniProt to increase convergence in the annotation of human and mouse protein-coding genes, the propagation of GENCODE across the human pan-genome and the development of new tools to support annotation of regulatory features by GENCODE. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org
Systematic assessment of long-read RNA-seq methods for transcript identification and quantification
The Long-read RNA-Seq Genome Annotation Assessment Project (LRGASP) Consortium was formed to evaluate the effectiveness of long-read approaches for transcriptome analysis. The consortium generated over 427 million long-read sequences from cDNA and direct RNA datasets, encompassing human, mouse, and manatee species, using different protocols and sequencing platforms. These data were utilized by developers to address challenges in transcript isoform detection and quantification, as well as de novo transcript isoform identification. The study revealed that libraries with longer, more accurate sequences produce more accurate transcripts than those with increased read depth, whereas greater read depth improved quantification accuracy. In well-annotated genomes, tools based on reference sequences demonstrated the best performance. When aiming to detect rare and novel transcripts or when using reference-free approaches, incorporating additional orthogonal data and replicate samples are advised. This collaborative study offers a benchmark for current practices and provides direction for future method development in transcriptome analysis
ELM: the status of the 2010 eukaryotic linear motif resource
Linear motifs are short segments of multidomain proteins that provide regulatory functions independently of protein tertiary structure. Much of intracellular signalling passes through protein modifications at linear motifs. Many thousands of linear motif instances, most notably phosphorylation sites, have now been reported. Although clearly very abundant, linear motifs are difficult to predict de novo in protein sequences due to the difficulty of obtaining robust statistical assessments. The ELM resource at http://elm.eu.org/ provides an expanding knowledge base, currently covering 146 known motifs, with annotation that includes >1300 experimentally reported instances. ELM is also an exploratory tool for suggesting new candidates of known linear motifs in proteins of interest. Information about protein domains, protein structure and native disorder, cellular and taxonomic contexts is used to reduce or deprecate false positive matches. Results are graphically displayed in a ‘Bar Code’ format, which also displays known instances from homologous proteins through a novel ‘Instance Mapper’ protocol based on PHI-BLAST. ELM server output provides links to the ELM annotation as well as to a number of remote resources. Using the links, researchers can explore the motifs, proteins, complex structures and associated literature to evaluate whether candidate motifs might be worth experimental investigation
- …