81 research outputs found
ProteinArchitect: Protein Evolution above the Sequence Level
While many authors have discussed models and tools for studying protein evolution at the sequence level, molecular function is usually mediated by complex, higher order features such as independently folding domains and linear motifs that are based on or embedded in a particular arrangment of features such as secondary structure elements, transmembrane domains and regions with intrinsic disorder. This 'protein architecture' can, in its most simplistic representation, be visualized as domain organization cartoons that can be used to compare proteins in terms of the order of their mostly globular domains.Here, we describe a visual approach and a webserver for protein comparison that extend the domain organization cartoon concept. By developing an information-rich, compact visualization of different protein features above the sequence level, potentially related proteins can be compared at the level of propensities for secondary structure, transmembrane domains and intrinsic disorder, in addition to PFAM domains. A public Web server is available at www.proteinarchitect.net, while the code is provided at protarchitect.sourceforge.net.Due to recent advances in sequencing technologies we are now flooded with millions of predicted proteins that await comparative analysis. In many cases, mature tools focused on revealing hits with considerable global or local similarity to well-characterized proteins will not be able to lead us to testable hypotheses about a protein's function, or the function of a particular region. The visual comparison of different types of protein features with ProteinArchitect will be useful when assessing the relevance of similarity search hits, to discover subgroups in protein families and superfamilies, and to understand protein regions with conserved features outside globular regions. Therefore, this approach is likely to help researchers to develop testable hypotheses about a protein's function even if is somewhat distant from the more characterized proteins, by facilitating the discovery of features that are conserved above the sequence level for comparison and further experimental investigation
Plasma Metabolomics Implicate Modified Transfer RNAs and Altered Bioenergetics in the Outcome of Pulmonary Arterial Hypertension.
BACKGROUND: -Pulmonary arterial hypertension (PAH) is a heterogeneous disorder with high mortality. METHODS: -We conducted a comprehensive study of plasma metabolites using ultra-performance liquid chromatography mass-spectrometry to (1) identify patients at high risk of early death, (2) identify patients who respond well to treatment and (3) provide novel molecular insights into disease pathogenesis. RESULTS: -53 circulating metabolites distinguished well-phenotyped patients with idiopathic or heritable PAH (n=365) from healthy controls (n=121) following correction for multiple testing (p<7.3e-5) and confounding factors, including drug therapy, renal and hepatic impairment. A subset of 20/53 metabolites also discriminated PAH patients from disease controls (symptomatic patients without pulmonary hypertension, n=139). 62 metabolites were prognostic in PAH, with 36/62 independent of established prognostic markers. Increased levels of tRNA-specific modified nucleosides (N2,N2-dimethylguanosine, N1-methylinosine), TCA cycle intermediates (malate, fumarate), glutamate, fatty acid acylcarnitines, tryptophan and polyamine metabolites and decreased levels of steroids, sphingomyelins and phosphatidylcholines distinguished patients from controls. The largest differences correlated with increased risk of death and correction of several metabolites over time was associated with a better outcome. Patients who responded to calcium channel blocker therapy had metabolic profiles similar to healthy controls. CONCLUSIONS: -Metabolic profiles in PAH are strongly related to survival and should be considered part of the deep phenotypic characterisation of this disease. Our results support the investigation of targeted therapeutic strategies that seek to address the alterations in translational regulation and energy metabolism that characterize these patients
Identification of germline monoallelic mutations in IKZF2 in patients with immune dysregulation
Helios, encoded by IKZF2, is a member of the Ikaros family of transcription factors with pivotal roles in T-follicular helper, NK- and T-regulatory cell physiology. Somatic IKZF2 mutations are frequently found in lymphoid malignancies. Although germline mutations in IKZF1 and IKZF3 encoding Ikaros and Aiolos have recently been identified in patients with phenotypically similar immunodeficiency syndromes, the effect of germline mutations in IKZF2 on human hematopoiesis and immunity remains enigmatic. We identified germline IKZF2 mutations (one nonsense (p.R291X)- and 4 distinct missense variants) in six patients with systemic lupus erythematosus, immune thrombocytopenia or EBV-associated hemophagocytic lymphohistiocytosis. Patients exhibited hypogammaglobulinemia, decreased number of T-follicular helper and NK cells. Single-cell RNA sequencing of PBMCs from the patient carrying the R291X variant revealed upregulation of proinflammatory genes associated with T-cell receptor activation and T-cell exhaustion. Functional assays revealed the inability of HeliosR291X to homodimerize and bind target DNA as dimers. Moreover, proteomic analysis by proximity-dependent Biotin Identification revealed aberrant interaction of 3/5 Helios mutants with core components of the NuRD complex conveying HELIOS-mediated epigenetic and transcriptional dysregulation.Peer reviewe
Ensembl Genomes: Extending Ensembl across the taxonomic space
Ensembl Genomes (http://www.ensemblgenomes.org) is a new portal offering integrated access to genome-scale data from non-vertebrate species of scientific interest, developed using the Ensembl genome annotation and visualisation platform. Ensembl Genomes consists of five sub-portals (for bacteria, protists, fungi, plants and invertebrate metazoa) designed to complement the availability of vertebrate genomes in Ensembl. Many of the databases supporting the portal have been built in close collaboration with the scientific community, which we consider as essential for maintaining the accuracy and usefulness of the resource. A common set of user interfaces (which include a graphical genome browser, FTP, BLAST search, a query optimised data warehouse, programmatic access, and a Perl API) is provided for all domains. Data types incorporated include annotation of (protein and non-protein coding) genes, cross references to external resources, and high throughput experimental data (e.g. data from large scale studies of gene expression and polymorphism visualised in their genomic context). Additionally, extensive comparative analysis has been performed, both within defined clades and across the wider taxonomy, and sequence alignments and gene trees resulting from this can be accessed through the site
Next-generation sequencing: A challenge to meet the increasing demand for training workshops in Australia
The widespread adoption of high-throughput next-generation sequencing (NGS) technology among the Australian life science research community is highlighting an urgent need to up-skill biologists in tools required for handling and analysing their NGS data. There is currently a shortage of cutting-edge bioinformatics training courses in Australia as a consequence of a scarcity of skilled trainers with time and funding to develop and deliver training courses. To address this, a consortium of Australian research organizations, including Bioplatforms Australia, the Commonwealth Scientific and Industrial Research Organisation and the Australian Bioinformatics Network, have been collaborating with EMBL-EBI training team. A group of Australian bioinformaticians attended the train-the-trainer workshop to improve training skills in developing and delivering bioinformatics workshop curriculum. A 2-day NGS workshop was jointly developed to provide hands-on knowledge and understanding of typical NGS data analysis workflows. The road show–style workshop was successfully delivered at five geographically distant venues in Australia using the newly established Australian NeCTAR Research Cloud. We highlight the challenges we had to overcome at different stages from design to delivery, including the establishment of an Australian bioinformatics training network and the computing infrastructure and resource development. A virtual machine image, workshop materials and scripts for configuring a machine with workshop contents have all been made available under a Creative Commons Attribution 3.0 Unported License. This means participants continue to have convenient access to an environment they had become familiar and bioinformatics trainers are able to access and reuse these resources.Nathan S.Watson-Haigh, Catherine A. Shang, Matthias Haimel, Myrto Kostadima, Remco Loos, Nandan Deshpande, Konsta Duesing, Xi Li, Annette McGrath, Sean McWilliam, Simon Michnowicz, Paula Moolhuijzen, Steve Quenette, Jerico Nico De Leon Revote, SonikaTyagi and Maria V. Schneide
Recommended from our members
Biallelic variants of ATP13A3 cause dose-dependent childhood-onset pulmonary arterial hypertension characterised by extreme morbidity and mortality.
BACKGROUND: The molecular genetic basis of pulmonary arterial hypertension (PAH) is heterogeneous, with at least 26 genes displaying putative evidence for disease causality. Heterozygous variants in the ATP13A3 gene were recently identified as a new cause of adult-onset PAH. However, the contribution of ATP13A3 risk alleles to child-onset PAH remains largely unexplored. METHODS AND RESULTS: We report three families with a novel, autosomal recessive form of childhood-onset PAH due to biallelic ATP13A3 variants. Disease onset ranged from birth to 2.5 years and was characterised by high mortality. Using genome sequencing of parent-offspring trios, we identified a homozygous missense variant in one case, which was subsequently confirmed to cosegregate with disease in an affected sibling. Independently, compound heterozygous variants in ATP13A3 were identified in two affected siblings and in an unrelated third family. The variants included three loss of function variants (two frameshift, one nonsense) and two highly conserved missense substitutions located in the catalytic phosphorylation domain. The children were largely refractory to treatment and four died in early childhood. All parents were heterozygous for the variants and asymptomatic. CONCLUSION: Our findings support biallelic predicted deleterious ATP13A3 variants in autosomal recessive, childhood-onset PAH, indicating likely semidominant dose-dependent inheritance for this gene
Identification of rare sequence variation underlying heritable pulmonary arterial hypertension.
Pulmonary arterial hypertension (PAH) is a rare disorder with a poor prognosis. Deleterious variation within components of the transforming growth factor-β pathway, particularly the bone morphogenetic protein type 2 receptor (BMPR2), underlies most heritable forms of PAH. To identify the missing heritability we perform whole-genome sequencing in 1038 PAH index cases and 6385 PAH-negative control subjects. Case-control analyses reveal significant overrepresentation of rare variants in ATP13A3, AQP1 and SOX17, and provide independent validation of a critical role for GDF2 in PAH. We demonstrate familial segregation of mutations in SOX17 and AQP1 with PAH. Mutations in GDF2, encoding a BMPR2 ligand, lead to reduced secretion from transfected cells. In addition, we identify pathogenic mutations in the majority of previously reported PAH genes, and provide evidence for further putative genes. Taken together these findings contribute new insights into the molecular basis of PAH and indicate unexplored pathways for therapeutic intervention
De Novo Truncating Mutations in WASF1 Cause Intellectual Disability with Seizures.
Next-generation sequencing has been invaluable in the elucidation of the genetic etiology of many subtypes of intellectual disability in recent years. Here, using exome sequencing and whole-genome sequencing, we identified three de novo truncating mutations in WAS protein family member 1 (WASF1) in five unrelated individuals with moderate to profound intellectual disability with autistic features and seizures. WASF1, also known as WAVE1, is part of the WAVE complex and acts as a mediator between Rac-GTPase and actin to induce actin polymerization. The three mutations connected by Matchmaker Exchange were c.1516C>T (p.Arg506Ter), which occurs in three unrelated individuals, c.1558C>T (p.Gln520Ter), and c.1482delinsGCCAGG (p.Ile494MetfsTer23). All three variants are predicted to partially or fully disrupt the C-terminal actin-binding WCA domain. Functional studies using fibroblast cells from two affected individuals with the c.1516C>T mutation showed a truncated WASF1 and a defect in actin remodeling. This study provides evidence that de novo heterozygous mutations in WASF1 cause a rare form of intellectual disability
Traffic exposures, air pollution and outcomes in pulmonary arterial hypertension: A United Kingdom cohort study analysis
While traffic and air pollution exposure is associated with increased mortality in numerous diseases, its association with disease severity and outcomes in pulmonary arterial hypertension (PAH) remains unknown.Exposure to particulate matter ≤2.5 μm3 (PM2.5), nitrogen dioxide (NO2) and indirect measures of traffic-related air pollution (distance to main road and length of roads within buffer zones surrounding residential addresses) were estimated for 301 patients with idiopathic/heritable PAH recruited in the UK PAH national Cohort study. Associations with transplant-free survival and pulmonary hemodynamic severity at baseline were assessed, adjusting for confounding variables defined a priori.Higher estimated exposure to PM2.5 was associated with higher risk of death or lung transplant (Unadjusted hazard ratio (HR) 2.68; 95% CI 1.11-6.47 per 3 μg·m-3, p=0.028). This association remained similar when adjusted for potential confounding variables (HR 4.38; 95% CI 1.44-13.36 per 3 μg·m-3, p=0.009). No associations were found between NO2 exposure or other traffic pollution indicators and transplant-free survival Conversely, indirect measures of exposure to traffic-related air pollution within the 500-1000 m buffer zones correlated with the ERS/ESC risk categories as well as pulmonary hemodynamics at baseline. This association was strongest for pulmonary vascular resistance.In idiopathic/heritable PAH, indirect measures of exposure to traffic-related air pollution were associated with disease severity at baseline, whereas higher PM2.5 exposure may independently predict shorter transplant-free survival
- …