78 research outputs found
Implementing a genomic data management system using iRODS in the Wellcome Trust Sanger Institute
<p>Abstract</p> <p>Background</p> <p>Increasingly large amounts of DNA sequencing data are being generated within the Wellcome Trust Sanger Institute (WTSI). The traditional file system struggles to handle these increasing amounts of sequence data. A good data management system therefore needs to be implemented and integrated into the current WTSI infrastructure. Such a system enables good management of the IT infrastructure of the sequencing pipeline and allows biologists to track their data.</p> <p>Results</p> <p>We have chosen a data grid system, iRODS (Rule-Oriented Data management systems), to act as the data management system for the WTSI. iRODS provides a rule-based system management approach which makes data replication much easier and provides extra data protection. Unlike the metadata provided by traditional file systems, the metadata system of iRODS is comprehensive and allows users to customize their own application level metadata. Users and IT experts in the WTSI can then query the metadata to find and track data.</p> <p>The aim of this paper is to describe how we designed and used (from both system and user viewpoints) iRODS as a data management system. Details are given about the problems faced and the solutions found when iRODS was implemented. A simple use case describing how users within the WTSI use iRODS is also introduced.</p> <p>Conclusions</p> <p>iRODS has been implemented and works as the production system for the sequencing pipeline of the WTSI. Both biologists and IT experts can now track and manage data, which could not previously be achieved. This novel approach allows biologists to define their own metadata and query the genomic data using those metadata.</p
Recommended from our members
Individual and Joint Effects of Early-Life Ambient PM2.5 Exposure and Maternal Prepregnancy Obesity on Childhood Overweight or Obesity
Background: Although previous studies suggest that exposure to traffic-related pollution during childhood increases the risk of childhood overweight or obesity (COWO), the role of early life exposure to fine particulate matter (aerodynamic diameter <2.5μm; PM2.5) and its joint effect with the mother's prepregnancy body mass index (MPBMI) on COWO remain unclear. Objectives: The present study was conducted to examine the individual and joint effects of ambient PM2.5 exposures and MPBMI on the risk of COWO. Methods: We estimated exposures to ambient PM2.5 in utero and during the first 2 y of life (F2YL), using data from the U.S. Environmental Protection Agency’s (EPA's) Air Quality System matched to residential address, in 1,446 mother–infant pairs who were recruited at birth from 1998 and followed up prospectively through 2012 at the Boston Medical Center in Massachusetts. We quantified the individual and joint effects of PM2.5 exposure with MPBMI on COWO, defined as the child's age- and sex-specific BMI z-score ≥85th percentile at the last well-child care visit between 2 and 9 y of age. Additivity was assessed by estimating the reduced excess risk due to interaction. Results: Comparing the highest and lowest quartiles of PM2.5, the adjusted relative risks (RRs) [95% confidence intervals (CIs)] of COWO were 1.3 (95% CI: 1.1, 1.5), 1.2 (95% CI: 1.0, 1.4), 1.2 (95% CI: 1.0, 1.4), 1.3 (95% CI: 1.1, 1.6), 1.3 (95% CI: 1.1, 1.5) and 1.3 (1.1, 1.5) during preconception; the first, second, and third trimesters; the entire period of pregnancy; and F2YL, respectively. Spline regression showed a dose–response relationship between PM2.5 levels and COWO after a threshold near the median exposure (10.46μg/m3–10.89μg/m3). Compared with their counterparts, children of obese mothers exposed to high levels of PM2.5 had the highest risk of COWO [RR≥2.0, relative excess risk due to interaction (RERI) not significant]. Conclusions: In the present study, we observed that early life exposure to PM2.5 may play an important role in the early life origins of COWO and may increase the risk of COWO in children of mothers who were overweight or obese before pregnancy beyond the risk that can be attributed to MPBMI alone. Our findings emphasize the clinical and public health policy relevance of early life PM2.5 exposure. https://doi.org/10.1289/EHP26
QKI is a critical pre-mRNA alternative splicing regulator of cardiac myofibrillogenesis and contractile function
The RNA-binding protein QKI belongs to the hnRNP K-homology domain protein family, a well-known regulator of pre-mRNA alternative splicing and is associated with several neurodevelopmental disorders. Qki is found highly expressed in developing and adult hearts. By employing the human embryonic stem cell (hESC) to cardiomyocyte differentiation system and generating QKI-deficient hESCs (hESCs-QKIdel) using CRISPR/Cas9 gene editing technology, we analyze the physiological role of QKI in cardiomyocyte differentiation, maturation, and contractile function. hESCs-QKIdel largely maintain normal pluripotency and normal differentiation potential for the generation of early cardiogenic progenitors, but they fail to transition into functional cardiomyocytes. In this work, by using a series of transcriptomic, cell and biochemical analyses, and the Qki-deficient mouse model, we demonstrate that QKI is indispensable to cardiac sarcomerogenesis and cardiac function through its regulation of alternative splicing in genes involved in Z-disc formation and contractile physiology, suggesting that QKI is associated with the pathogenesis of certain forms of cardiomyopathies
The oyster genome reveals stress adaptation and complexity of shell formation
The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress response and the proteome of the shell. The oyster genome is highly polymorphic and rich in repetitive sequences, with some transposable elements still actively shaping variation. Transcriptome studies reveal an extensive set of genes responding to environmental stress. The expansion of genes coding for heat shock protein 70 and inhibitors of apoptosis is probably central to the oyster's adaptation to sessile life in the highly stressful intertidal zone. Our analyses also show that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes. The oyster genome sequence fills a void in our understanding of the Lophotrochozoa. © 2012 Macmillan Publishers Limited. All rights reserved
A comprehensive and non-redundant database of protein domain movements
EThOS - Electronic Theses Online ServiceGBUnited Kingdo
- …