13 research outputs found
Annotations for Rule-Based Models
The chapter reviews the syntax to store machine-readable annotations and
describes the mapping between rule-based modelling entities (e.g., agents and
rules) and these annotations. In particular, we review an annotation framework
and the associated guidelines for annotating rule-based models of molecular
interactions, encoded in the commonly used Kappa and BioNetGen languages, and
present prototypes that can be used to extract and query the annotations. An
ontology is used to annotate models and facilitate their description
Exact score distribution computation for ontological similarity searches
<p>Abstract</p> <p>Background</p> <p>Semantic similarity searches in ontologies are an important component of many bioinformatic algorithms, e.g., finding functionally related proteins with the Gene Ontology or phenotypically similar diseases with the Human Phenotype Ontology (HPO). We have recently shown that the performance of semantic similarity searches can be improved by ranking results according to the probability of obtaining a given score at random rather than by the scores themselves. However, to date, there are no algorithms for computing the exact distribution of semantic similarity scores, which is necessary for computing the exact <it>P</it>-value of a given score.</p> <p>Results</p> <p>In this paper we consider the exact computation of score distributions for similarity searches in ontologies, and introduce a simple null hypothesis which can be used to compute a <it>P</it>-value for the statistical significance of similarity scores. We concentrate on measures based on Resnik's definition of ontological similarity. A new algorithm is proposed that collapses subgraphs of the ontology graph and thereby allows fast score distribution computation. The new algorithm is several orders of magnitude faster than the naive approach, as we demonstrate by computing score distributions for similarity searches in the HPO. It is shown that exact <it>P</it>-value calculation improves clinical diagnosis using the HPO compared to approaches based on sampling.</p> <p>Conclusions</p> <p>The new algorithm enables for the first time exact <it>P</it>-value calculation via exact score distribution computation for ontology similarity searches. The approach is applicable to any ontology for which the annotation-propagation rule holds and can improve any bioinformatic method that makes only use of the raw similarity scores. The algorithm was implemented in Java, supports any ontology in OBO format, and is available for non-commercial and academic usage under: <url>https://compbio.charite.de/svn/hpo/trunk/src/tools/significance/</url></p
De novo and recessive forms of congenital heart disease have distinct genetic and phenotypic landscapes.
The genetic architecture of sporadic congenital heart disease (CHD) is characterized by enrichment in damaging de novo variants in chromatin-modifying genes. To test the hypothesis that gene pathways contributing to de novo forms of CHD are distinct from those for recessive forms, we analyze 2391 whole-exome trios from the Pediatric Cardiac Genomics Consortium. We deploy a permutation-based gene-burden analysis to identify damaging recessive and compound heterozygous genotypes and disease genes, controlling for confounding effects, such as background mutation rate and ancestry. Cilia-related genes are significantly enriched for damaging rare recessive genotypes, but comparatively depleted for de novo variants. The opposite trend is observed for chromatin-modifying genes. Other cardiac developmental gene classes have less stratification by mode of inheritance than cilia and chromatin-modifying gene classes. Our analyses reveal dominant and recessive CHD are associated with distinct gene functions, with cilia-related genes providing a reservoir of rare segregating variation leading to CHD