34 research outputs found
Intercellular network structure and regulatory motifs in the human hematopoietic system.
The hematopoietic system is a distributed tissue that consists of functionally distinct cell types continuously produced through hematopoietic stem cell (HSC) differentiation. Combining genomic and phenotypic data with high-content experiments, we have built a directional cell-cell communication network between 12 cell types isolated from human umbilical cord blood. Network structure analysis revealed that ligand production is cell type dependent, whereas ligand binding is promiscuous. Consequently, additional control strategies such as cell frequency modulation and compartmentalization were needed to achieve specificity in HSC fate regulation. Incorporating the in vitro effects (quiescence, self-renewal, proliferation, or differentiation) of 27 HSC binding ligands into the topology of the cell-cell communication network allowed coding of cell type-dependent feedback regulation of HSC fate. Pathway enrichment analysis identified intracellular regulatory motifs enriched in these cell type- and ligand-coupled responses. This study uncovers cellular mechanisms of hematopoietic cell feedback in HSC fate regulation, provides insight into the design principles of the human hematopoietic system, and serves as a foundation for the analysis of intercellular regulation in multicellular systems
Prevalence and Clinical Features of Inflammatory Bowel Diseases Associated With Monogenic Variants, Identified by Whole-Exome Sequencing in 1000 Children at a Single Center
BACKGROUND & AIMS: A proportion of infants and young children with inflammatory bowel diseases (IBDs) have subtypes associated with a single gene variant (monogenic IBD). We aimed to determine the prevalence of monogenic disease in a cohort of pediatric patients with IBD.
METHODS: We performed whole-exome sequencing analyses of blood samples from an unselected cohort of 1005 children with IBD, aged 0-18 years (median age at diagnosis, 11.96 years) at a single center in Canada and their family members (2305 samples total). Variants believed to cause IBD were validated using Sanger sequencing. Biopsies from patients were analyzed by immunofluorescence and histochemical analyses.
RESULTS: We identified 40 rare variants associated with 21 monogenic genes among 31 of the 1005 children with IBD (including 5 variants in XIAP, 3 in DOCK8, and 2 each in FOXP3, GUCY2C, and LRBA). These variants occurred in 7.8% of children younger than 6 years and 2.3% of children aged 6-18 years. Of the 17 patients with monogenic Crohn\u27s disease, 35% had abdominal pain, 24% had nonbloody loose stool, 18% had vomiting, 18% had weight loss, and 5% had intermittent bloody loose stool. The 14 patients with monogenic ulcerative colitis or IBD-unclassified received their diagnosis at a younger age, and their most predominant feature was bloody loose stool (78%). Features associated with monogenic IBD, compared to cases of IBD not associated with a single variant, were age of onset younger than 2 years (odds ratio [OR], 6.30; P = .020), family history of autoimmune disease (OR, 5.12; P = .002), extra-intestinal manifestations (OR, 15.36; P \u3c .0001), and surgery (OR, 3.42; P = .042). Seventeen patients had variants in genes that could be corrected with allogeneic hematopoietic stem cell transplantation.
CONCLUSIONS: In whole-exome sequencing analyses of more than 1000 children with IBD at a single center, we found that 3% had rare variants in genes previously associated with pediatric IBD. These were associated with different IBD phenotypes, and 1% of the patients had variants that could be potentially corrected with allogeneic hematopoietic stem cell transplantation. Monogenic IBD is rare, but should be considered in analysis of all patients with pediatric onset of IBD
Expanding the Landscape of Chromatin Modification (CM)-Related Functional Domains and Genes in Human
Chromatin modification (CM) plays a key role in regulating transcription, DNA replication, repair and recombination. However, our knowledge of these processes in humans remains very limited. Here we use computational approaches to study proteins and functional domains involved in CM in humans. We analyze the abundance and the pair-wise domain-domain co-occurrences of 25 well-documented CM domains in 5 model organisms: yeast, worm, fly, mouse and human. Results show that domains involved in histone methylation, DNA methylation, and histone variants are remarkably expanded in metazoan, reflecting the increased demand for cell type-specific gene regulation. We find that CM domains tend to co-occur with a limited number of partner domains and are hence not promiscuous. This property is exploited to identify 47 potentially novel CM domains, including 24 DNA-binding domains, whose role in CM has received little attention so far. Lastly, we use a consensus Machine Learning approach to predict 379 novel CM genes (coding for 329 proteins) in humans based on domain compositions. Several of these predictions are supported by very recent experimental studies and others are slated for experimental verification. Identification of novel CM genes and domains in humans will aid our understanding of fundamental epigenetic processes that are important for stem cell differentiation and cancer biology. Information on all the candidate CM domains and genes reported here is publicly available
Spatiotemporal integration of molecular and anatomical data in virtual reality using semantic mapping
We have developed a computational framework for spatiotemporal integration of molecular and anatomical datasets in a virtual reality environment. Using two case studies involving gene expression data and pharmacokinetic data, respectively, we demonstrate how existing knowledge bases for molecular data can be semantically mapped onto a standardized anatomical context of human body. Our data mapping methodology uses ontological representations of heterogeneous biomedical datasets and an ontology reasoner to create complex semantic descriptions of biomedical processes. This framework provides a means to systematically combine an increasing amount of biomedical imaging and numerical data into spatiotemporally coherent graphical representations. Our work enables medical researchers with different expertise to simulate complex phenomena visually and to develop insights through the use of shared data, thus paving the way for pathological inference, developmental pattern discovery and biomedical hypothesis testing
DNA methylation signature is prognostic of choroid plexus tumor aggressiveness
Abstract: Background: Histological grading of choroid plexus tumors (CPTs) remains the best prognostic tool to distinguish between aggressive choroid plexus carcinoma (CPC) and the more benign choroid plexus papilloma (CPP) or atypical choroid plexus papilloma (aCPP); however, these distinctions can be challenging. Standard treatment of CPC is very aggressive and often leads to severe damage to the young child’s brain. Therefore, it is crucial to distinguish between CPC and less aggressive entities (CPP or aCPP) to avoid unnecessary exposure of the young patient to neurotoxic therapy. To better stratify CPTs, we utilized DNA methylation (DNAm) to identify prognostic epigenetic biomarkers for CPCs. Methods: We obtained DNA methylation profiles of 34 CPTs using the HumanMethylation450 BeadChip from Illumina, and the data was analyzed using the Illumina Genome Studio analysis software. Validation of differentially methylated CpG sites chosen as biomarkers was performed using pyrosequencing analysis on additional 22 CPTs. Sensitivity testing of the CPC DNAm signature was performed on a replication cohort of 61 CPT tumors obtained from Neuropathology, University Hospital Münster, Germany. Results: Generated genome-wide DNAm profiles of CPTs showed significant differences in DNAm between CPCs and the CPPs or aCPPs. The prediction of clinical outcome could be improved by combining the DNAm profile with the mutational status of TP53. CPCs with homozygous TP53 mutations clustered as a group separate from those carrying a heterozygous TP53 mutation or CPCs with wild type TP53 (TP53-wt) and showed the worst survival outcome. Specific DNAm signatures for CPCs revealed AK1, PER2, and PLSCR4 as potential biomarkers for CPC that can be used to improve molecular stratification for diagnosis and treatment. Conclusions: We demonstrate that combining specific DNAm signature for CPCs with histological approaches better differentiate aggressive tumors from those that are not life threatening. These findings have important implications for future prognostic risk prediction in clinical disease management
Objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro DNA-binding preferences
Classifying proteins into subgroups with similar molecular function on the basis of sequence is an important step in deriving reliable functional annotations computationally. So far, however, available classification procedures have been evaluated against protein subgroups that are defined by experts using mainly qualitative descriptions of molecular function. Recently, in vitro DNA-binding preferences to all possible 8-nt DNA sequences have been measured for 178 mouse homeodomains using protein-binding microarrays, offering the unprecedented opportunity of evaluating the classification methods against quantitative measures of molecular function. To this end, we automatically derive homeodomain subtypes from the DNA-binding data and independently group the same domains using sequence information alone. We test five sequence-based methods, which use different sequence-similarity measures and algorithms to group sequences. Results show that methods that optimize the classification robustness reflect well the detailed functional specificity revealed by the experimental data. In some of these classifications, 73–83% of the subfamilies exactly correspond to, or are completely contained in, the function-based subtypes. Our findings demonstrate that certain sequence-based classifications are capable of yielding very specific molecular function annotations. The availability of quantitative descriptions of molecular function, such as DNA-binding data, will be a key factor in exploiting this potential in the future.Canadian Institutes of Health Research (MOP#82940)Sickkids FoundationOntario Research FundNational Science Foundation (U.S.)National Human Genome Research Institute (U.S.) (R01 HG003985
Balancing cost and accuracy in distributed data mining.
Balancing cost and accuracy in distributed data mining