11,568 research outputs found
NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases
Enrichment analysis is a widely applied procedure for shedding light on the molecular mechanisms and functions at the basis of phenotypes, for enlarging the dataset of possibly related genes/proteins and for helping interpretation and prioritization of newly determined variations. Several standard and Network-based enrichment methods are available. Both approaches rely on the annotations that characterize the genes/proteins included in the input set; network based ones also include in different ways physical and functional relationships among different genes or proteins that can be extracted from the available biological networks of interactions
Annotation Enrichment Analysis: An Alternative Method for Evaluating the Functional Properties of Gene Sets
Gene annotation databases (compendiums maintained by the scientific community
that describe the biological functions performed by individual genes) are
commonly used to evaluate the functional properties of experimentally derived
gene sets. Overlap statistics, such as Fisher's Exact Test (FET), are often
employed to assess these associations, but don't account for non-uniformity in
the number of genes annotated to individual functions or the number of
functions associated with individual genes. We find FET is strongly biased
toward over-estimating overlap significance if a gene set has an unusually high
number of annotations. To correct for these biases, we develop Annotation
Enrichment Analysis (AEA), which properly accounts for the non-uniformity of
annotations. We show that AEA is able to identify biologically meaningful
functional enrichments that are obscured by numerous false-positive enrichment
scores in FET, and we therefore suggest it be used to more accurately assess
the biological properties of gene sets
A Systemic Receptor Network Triggered by Human cytomegalovirus Entry
Virus entry is a multistep process that triggers a variety of cellular
pathways interconnecting into a complex network, yet the molecular complexity
of this network remains largely unsolved. Here, by employing systems biology
approach, we reveal a systemic virus-entry network initiated by human
cytomegalovirus (HCMV), a widespread opportunistic pathogen. This network
contains all known interactions and functional modules (i.e. groups of
proteins) coordinately responding to HCMV entry. The number of both genes and
functional modules activated in this network dramatically declines shortly,
within 25 min post-infection. While modules annotated as receptor system, ion
transport, and immune response are continuously activated during the entire
process of HCMV entry, those for cell adhesion and skeletal movement are
specifically activated during viral early attachment, and those for immune
response during virus entry. HCMV entry requires a complex receptor network
involving different cellular components, comprising not only cell surface
receptors, but also pathway components in signal transduction, skeletal
development, immune response, endocytosis, ion transport, macromolecule
metabolism and chromatin remodeling. Interestingly, genes that function in
chromatin remodeling are the most abundant in this receptor system, suggesting
that global modulation of transcriptions is one of the most important events in
HCMV entry. Results of in silico knock out further reveal that this entire
receptor network is primarily controlled by multiple elements, such as EGFR
(Epidermal Growth Factor) and SLC10A1 (sodium/bile acid cotransporter family,
member 1). Thus, our results demonstrate that a complex systemic network, in
which components coordinating efficiently in time and space contributes to
virus entry.Comment: 26 page
Methods for interpreting lists of affected genes obstained in a DNA microarray experiment
Background - The aim of this paper was to describe and compare the methods used and the results obtained by the participants in a joint EADGENE (European Animal Disease Genomic Network of Excellence) and SABRE (Cutting Edge Genomics for Sustainable Animal Breeding) workshop focusing on post analysis of microarray data. The participating groups were provided with identical lists of microarray probes, including test statistics for three different contrasts, and the normalised log-ratios for each array, to be used as the starting point for interpreting the affected probes. The data originated from a microarray experiment conducted to study the host reactions in broilers occurring shortly after a secondary challenge with either a homologous or heterologous species of Eimeria. Results - Several conceptually different analytical approaches, using both commercial and public available software, were applied by the participating groups. The following tools were used: Ingenuity Pathway Analysis, MAPPFinder, LIMMA, GOstats, GOEAST, GOTM, Globaltest, TopGO, ArrayUnlock, Pathway Studio, GIST and AnnotationDbi. The main focus of the approaches was to utilise the relation between probes/genes and their gene ontology and pathways to interpret the affected probes/genes. The lack of a well-annotated chicken genome did though limit the possibilities to fully explore the tools. The main results from these analyses showed that the biological interpretation is highly dependent on the statistical method used but that some common biological conclusions could be reached. Conclusion - It is highly recommended to test different analytical methods on the same data set and compare the results to obtain a reliable biological interpretation of the affected genes in a DNA microarray experimen
Recommended from our members
Dissecting the genetic basis of comorbid epilepsy phenotypes in neurodevelopmental disorders.
BACKGROUND:Neurodevelopmental disorders (NDDs) such as autism spectrum disorder, intellectual disability, developmental disability, and epilepsy are characterized by abnormal brain development that may affect cognition, learning, behavior, and motor skills. High co-occurrence (comorbidity) of NDDs indicates a shared, underlying biological mechanism. The genetic heterogeneity and overlap observed in NDDs make it difficult to identify the genetic causes of specific clinical symptoms, such as seizures. METHODS:We present a computational method, MAGI-S, to discover modules or groups of highly connected genes that together potentially perform a similar biological function. MAGI-S integrates protein-protein interaction and co-expression networks to form modules centered around the selection of a single "seed" gene, yielding modules consisting of genes that are highly co-expressed with the seed gene. We aim to dissect the epilepsy phenotype from a general NDD phenotype by providing MAGI-S with high confidence NDD seed genes with varying degrees of association with epilepsy, and we assess the enrichment of de novo mutation, NDD-associated genes, and relevant biological function of constructed modules. RESULTS:The newly identified modules account for the increased rate of de novo non-synonymous mutations in autism, intellectual disability, developmental disability, and epilepsy, and enrichment of copy number variations (CNVs) in developmental disability. We also observed that modules seeded with genes strongly associated with epilepsy tend to have a higher association with epilepsy phenotypes than modules seeded at other neurodevelopmental disorder genes. Modules seeded with genes strongly associated with epilepsy (e.g., SCN1A, GABRA1, and KCNB1) are significantly associated with synaptic transmission, long-term potentiation, and calcium signaling pathways. On the other hand, modules found with seed genes that are not associated or weakly associated with epilepsy are mostly involved with RNA regulation and chromatin remodeling. CONCLUSIONS:In summary, our method identifies modules enriched with de novo non-synonymous mutations and can capture specific networks that underlie the epilepsy phenotype and display distinct enrichment in relevant biological processes. MAGI-S is available at https://github.com/jchow32/magi-s
WormBase: a comprehensive resource for nematode research
WormBase (http://www.wormbase.org) is a central data repository for nematode biology. Initially created as a service to the Caenorhabditis elegans research field, WormBase has evolved into a powerful research tool in its own right. In the past 2 years, we expanded WormBase to include the complete genomic sequence, gene predictions and orthology assignments from a range of related nematodes. This comparative data enrich the C. elegans data with improved gene predictions and a better understanding of gene function. In turn, they bring the wealth of experimental knowledge of C. elegans to other systems of medical and agricultural importance. Here, we describe new species and data types now available at WormBase. In addition, we detail enhancements to our curatorial pipeline and website infrastructure to accommodate new genomes and an extensive user base
eXamine: a Cytoscape app for exploring annotated modules in networks
Background. Biological networks have growing importance for the
interpretation of high-throughput "omics" data. Statistical and combinatorial
methods allow to obtain mechanistic insights through the extraction of smaller
subnetwork modules. Further enrichment analyses provide set-based annotations
of these modules.
Results. We present eXamine, a set-oriented visual analysis approach for
annotated modules that displays set membership as contours on top of a
node-link layout. Our approach extends upon Self Organizing Maps to
simultaneously lay out nodes, links, and set contours.
Conclusions. We implemented eXamine as a freely available Cytoscape app.
Using eXamine we study a module that is activated by the virally-encoded
G-protein coupled receptor US28 and formulate a novel hypothesis about its
functioning
- …