Search CORE

Digital Commons@Becker

Linking genes to diseases with a SNPedia-Gene Wiki mashup

Author: Clarke Erik L
Good Benjamin M
Loguercio Salvatore
Su Andrew I
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background A variety of topic-focused wikis are used in the biomedical sciences to enable the mass-collaborative synthesis and distribution of diverse bodies of knowledge. To address complex problems such as defining the relationships between genes and disease, it is important to bring the knowledge from many different domains together. Here we show how advances in wiki technology and natural language processing can be used to automatically assemble ‘meta-wikis’ that present integrated views over the data collaboratively created in multiple source wikis. Results We produced a semantic meta-wiki called the Gene Wiki+ that automatically mirrors and integrates data from the Gene Wiki and SNPedia. The Gene Wiki+, available at (<url>http://genewikiplus.org/</url>), captures 8,047 distinct gene-disease relationships. SNPedia accounts for 4,149 of the gene-disease pairs, the Gene Wiki provides 4,377 and only 479 appear independently in both sources. All of this content is available to query and browse and is provided as linked open data. Conclusions Wikis contain increasing amounts of diverse, biological information useful for elucidating the connections between genes and disease. The Gene Wiki+ shows how wiki technology can be used in concert with natural language processing to provide integrated views over diverse underlying data sources.</p

Springer - Publisher Connector

eScholarship - University of California

Recommended from our members

Quantitating the epigenetic transformation contributing to cholesterol homeostasis using Gaussian process.

Author: Balch William E
Farhat Nicole Y
Hutt Darren M
Loguercio Salvatore
Porter Forbes D
Scott Samantha M
Subramanian Kanagaraj
Wang Chao
Zhao Pei
Publication venue: eScholarship, University of California
Publication date: 01/11/2019
Field of study

To understand the impact of epigenetics on human misfolding disease, we apply Gaussian-process regression (GPR) based machine learning (ML) (GPR-ML) through variation spatial profiling (VSP). VSP generates population-based matrices describing the spatial covariance (SCV) relationships that link genetic diversity to fitness of the individual in response to histone deacetylases inhibitors (HDACi). Niemann-Pick C1 (NPC1) is a Mendelian disorder caused by >300 variants in the NPC1 gene that disrupt cholesterol homeostasis leading to the rapid onset and progression of neurodegenerative disease. We determine the sequence-to-function-to-structure relationships of the NPC1 polypeptide fold required for membrane trafficking and generation of a tunnel that mediates cholesterol flux in late endosomal/lysosomal (LE/Ly) compartments. HDACi treatment reveals unanticipated epigenomic plasticity in SCV relationships that restore NPC1 functionality. GPR-ML based matrices capture the epigenetic processes impacting information flow through central dogma, providing a framework for quantifying the effect of the environment on the healthspan of the individual

Epigenetic Enhancer Marks and Transcription Factor Binding Influence Vκ Gene Rearrangement in Pre-B Cells and Pro-B Cells

Author: Ann J. Feeney
Eden Kleiman
Salvatore Loguercio
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

To date there has not been a study directly comparing relative Igκ rearrangement frequencies obtained from genomic DNA (gDNA) and cDNA and since each approach has potential biases, this is an important issue to clarify. Here we used deep sequencing to compare the unbiased gDNA and RNA Igκ repertoire from the same pre-B cell pool. We find that ~20% of Vκ genes have rearrangement frequencies ≥2-fold up or down in RNA vs. DNA libraries, including many members of the Vκ3, Vκ4, and Vκ6 families. Regression analysis indicates Ikaros and E2A binding are associated with strong promoters. Within the pre-B cell repertoire, we observed that individual Vκ genes rearranged at very different frequencies, and also displayed very different Jκ usage. Regression analysis revealed that the greatly unequal Vκ gene rearrangement frequencies are best predicted by epigenetic marks of enhancers. In particular, the levels of newly arising H3K4me1 peaks associated with many Vκ genes in pre-B cells are most predictive of rearrangement levels. Since H3K4me1 is associated with long range chromatin interactions which are created during locus contraction, our data provides mechanistic insight into unequal rearrangement levels. Comparison of Igκ rearrangements occurring in pro-B cells and pre-B cells from the same mice reveal a pro-B cell bias toward usage of Jκ-distal Vκ genes, particularly Vκ10-96 and Vκ1-135. Regression analysis indicates that PU.1 binding is the highest predictor of Vκ gene rearrangement frequency in pro-B cells. Lastly, the repertoires of iEκ−/− pre-B cells reveal that iEκ actively influences Vκ gene usage, particularly Vκ3 family genes, overlapping with a zone of iEκ-regulated germline transcription. These represent new roles for iEκ in addition to its critical function in promoting overall Igκ rearrangement. Together, this study provides insight into many aspects of Igκ repertoire formation

Frontiers - Publisher Connector

Probiotics reduce the inflammatory response induced by a high-fat diet in the liver of young rats

Author: Anna Iacono
Antonio Calignano
Arkan
Bibiloni
Boden
Cortez-Pinto
Day
Deleve
Ding
Dunn
Duvnjak
Emanuela Esposito
Esposito
Farrell
Ferrante
Gionchetti
Giuseppe Bianco
Giuseppina Autore
Giuseppina Mattace Raso
Goetzl
Guglielmi
Ilagan
Kashireddy
Kersten
Kojima
Le May
Lee
Li
Lieber
Lieber
Lirussi
Loguercio
Loguercio
Loguercio
Louet
Louet
Mach
McCullough
McGeehan
Mélançon
Mishra
Nanji
O'Hara
Okada
Pietro Vajro
Reddy
Roberto Berni Canani
Roberts
Rosaria Meli
Salvatore Cuzzocrea
Shapiro
Shibolet
Solga
Stienstra
Sumida
Ulisse
Venturi
Xia
Yu
Publication venue
Publication date: 01/01/2009
Field of study

Archivio della ricerca - Università degli studi di Napoli Federico II

Public Library of Science (PLOS)

Integrative Analysis of Low- and High-Resolution eQTL

The study of expression quantitative trait loci (eQTL) is a powerful way of detecting transcriptional regulators at a genomic scale and for elucidating how natural genetic variation impacts gene expression. Power and genetic resolution are heavily affected by the study population: whereas recombinant inbred (RI) strains yield greater statistical power with low genetic resolution, using diverse inbred or outbred strains improves genetic resolution at the cost of lower power. In order to overcome the limitations of both individual approaches, we combine data from RI strains with genetically more diverse strains and analyze hippocampus eQTL data obtained from mouse RI strains (BXD) and from a panel of diverse inbred strains (Mouse Diversity Panel, MDP). We perform a systematic analysis of the consistency of eQTL independently obtained from these two populations and demonstrate that a significant fraction of eQTL can be replicated. Based on existing knowledge from pathway databases we assess different approaches for using the high-resolution MDP data for fine mapping BXD eQTL. Finally, we apply this framework to an eQTL hotspot on chromosome 1 (Qrr1), which has been implicated in a range of neurological traits. Here we present the first systematic examination of the consistency between eQTL obtained independently from the BXD and MDP populations. Our analysis of fine-mapping approaches is based on ‘real life’ data as opposed to simulated data and it allows us to propose a strategy for using MDP data to fine map BXD eQTL. Application of this framework to Qrr1 reveals that this eQTL hotspot is not caused by just one (or few) ‘master regulators’, but actually by a set of polymorphic genes specific to the central nervous system

Carolina Digital Repository

Reductionist and Integrative approaches to explore the H.pylori genome

Author: Loguercio Salvatore
Publication venue: Università degli studi di Padova
Publication date: 01/01/2008
Field of study

The reductionist approach of decomposing biological systems into their constituent parts has dominated molecular biology for half a century. Since organisms are composed solely of atoms and molecules without the participation of extraneous forces, it has been assumed that it should be possible to explain biological systems on the basis of the physico-chemical properties of their individual components, down to the atomic level. However, despite the remarkable success of methodological reductionism in analyzing individual cellular components, it is now generally accepted that the behavior of complex biological systems cannot be understood by studying their individual parts in isolation. To tackle the complexity inherent in understanding large networks of interacting biomolecules, the integrative viewpoint emphasizes cybernetic and systems theoretical methods, using a combination of mathematics, computation and empirical observation. Such an approach is beginning to become feasible in prokaryotes, combining an almost complete view of the genome and transcriptome with a reasonably extensive picture of the proteome. Pathogenic bacteria are undoubtedly the most investigated subjects among prokaryotes. A paradigmatic example is the the human pathogen H.pylori, a causative agent of severe gastroduodenal disorders that infects almost half of the world population. In this thesis, we investigated various aspects of Helicobacter pylori molecular physiology using both reductionist and integrative approaches. In Section I, we have employed a reductionist, bottom-up perspective in studying the Cysteine oxidised/reduced state and the disulphide bridge pattern of an unusual GroES homolog expressed by H.pylori, Heat Shock protein A (HspA). This protein possesses a high Cys content, is involved in nickel binding and exhibits an extended subcellular localization, ranging from cytoplasm to cell surface. We have produced and characterized a recombinant HspA and mutants Cys94Ala and C94A/C111A. The disulphide bridge pattern has been assigned by integrating biochemical methodologies with mass spectrometry. All Cys are engaged in disulphide bonds that force the C-term domain to assume a peculiar closed loop structure, prone to host nickel ions. This novel Ni binding structural arrangement can be related to the Ni uptake/delivery to the extracellular urease, essential for the bacterium survival. In Section II, we combined different computational methods with two main goals: 1) Analyze the H.pylori biomolecular interaction network in an attempt to select new molecular targets against H.pylori infection (Chapters 4 & 5); 2) Model and simulate the signaling perturbations induced by invading H.pylori proteins in the host ephitelial cells (Chapter 6). Chapter 4 explores the 'robust yet fragile' feature of the H.pylori cell, viewed as a complex system in which robustness in response to certain perturbation is inevitably associated with fragility in response to other perturbations. With this in mind, we developed a general strategy aimed at identify control points in bacterial metabolic networks, which could be targets for novel drugs. The methodology is implemented on Helicobacter pylori 26695. The entire metabolic network of the pathogen is analyzed to find biochemically critical points, e.g. enzymes which uniquely consume and/or produce a certain metabolite. Once identified, the list of critical enzymes is filtered in order to find candidate targets wich are non-homologous with the human enzymes. Finally, the essentiality of the identified targets is cross-validated by in silico deletion studies using flux-balance analysis (FBA) on a recent genome-scale metabolic model of H. pylori. Following this approach, we identified some enzymes which could be interesting targets for inhibition studies of H.pylori infection. The study reported in Chapter 5 extends the previously described approach in light of recent theoretical studies on biological networks. These studies suggested that multiple weak attacks on selected targets are inevitably more efficient than the knockout of a single target, thus providing a conceptual framework for the recent success of multi-target drugs. We used this concept to exploit H.pylori metabolic robustness through multiple weak attacks on selected enzymes, therefore directing us toward target-sets discovery for combinatorial therapies. We used the known metabolic and protein interaction data to build an integrated biomolecular network of the pathogen. The network was subsequently screened to find central elements of network communication, e.g. hubs, bridges with high betweenness centrality and overlaps of network communities. The selected enzymes were then classified on the basis of available data about cellular function and essentiality in an attempt to predict successful target-combinations. In order to evaluate the network effect triggered by the partial inactivation of candidate targets, robustness analysis was performed on small groups of selected enzymes using flux balance analysis (FBA) on a recent genome-scale metabolic model of H.pylori. In particular, the FBA simulation framework allowed to predict the growth phenotype associated to every partial inactivation set. The preliminary results obtained so far may help to restrict the initial target-pool in search of target-sets for novel combinatorial drugs against H.pylori persistence. However, our long-term goal is to better understand the indirect network effects that lie at the heart of multi-target drug action and, ultimately, how multiple weak hits can perturb complex biological systems. H.pylori produces various a cytotoxic protein, CagA, that interfere with a very important host signaling pathway, i.e. the epidermal growth factor receptor (EGFR) signaling network. EGFR signaling is one of the most extensively studied areas of signal transduction, since it regulates growth, survival, proliferation and differentiation in mammalian cells. In Chapter 6, we attempted to build an executable model of the EGFR-signaling core process using a process algebra approach. In the EGFR network, the core process is the heart of its underlying hour-glass architecture, as it plays a central role in downstream signaling cascades to gene expression through activation of multiple transcription factors. It consists in a dense array of molecules and interactions wich are tightly coupled to each other. In order to build the executable model, a small set of EGFR core molecules and their interactions is tentatively translated in a BetaWB model. BetaWB is a framework for modelling and simulating biological processes based on Beta-binders language and its stochastic extension. Once obtained, the computational model of the EGFR core process can be used to test and compare hypotheses regarding the principles of operation of the signaling network, i.e. how the EGFR network generates different responses for each set of combinatorial stimuli. In particular, probabilistic model checking can be used to explore the states and possible state changes of the computational model, whereas stochastic simulation (corresponding to the execution of the BetaWB model) may give quantitative insights into the dynamic behaviour of the system in response to different stimuli. Information from the above tecniques allows model validation through comparison within the experimental data available in the literature. The inherent compositionality of the process algebra modeling approach enables further expansion of the EGFR core model, as well as the study of its behavior under specific perturbations, such as invading H.pylori proteins. This latter aspect might be of great value for H.pylori pathogenesis research, as signaling through the EGF receptors is intricately involved in gastric cancer and in many other gastroduodenal diseases

Archivio istituzionale della ricerca - Università di Padova

Dizeez: an online game for human gene-disease annotation.

Author: Andrew I Su
Benjamin M Good
Salvatore Loguercio
Publication venue: Public Library of Science (PLoS)
Publication date: 07/08/2013
Field of study

Structured gene annotations are a foundation upon which many bioinformatics and statistical analyses are built. However the structured annotations available in public databases are a sparse representation of biological knowledge as a whole. The rate of biomedical data generation is such that centralized biocuration efforts struggle to keep up. New models for gene annotation need to be explored that expand the pace at which we are able to structure biomedical knowledge. Recently, online games have emerged as an effective way to recruit, engage and organize large numbers of volunteers to help address difficult biological challenges. For example, games have been successfully developed for protein folding (Foldit), multiple sequence alignment (Phylo) and RNA structure design (EteRNA). Here we present Dizeez, a simple online game built with the purpose of structuring knowledge of gene-disease associations. Preliminary results from game play online and at scientific conferences suggest that Dizeez is producing valid gene-disease annotations not yet present in any public database. These early results provide a basic proof of principle that online games can be successfully applied to the challenge of gene annotation. Dizeez is available at http://genegames.org

Correction: Dizeez: An Online Game for Human Gene-Disease Annotation.

Author: Andrew I. Su
Benjamin M. Good
Salvatore Loguercio
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study