485 research outputs found

    Detection of regulator genes and eQTLs in gene networks

    Full text link
    Genetic differences between individuals associated to quantitative phenotypic traits, including disease states, are usually found in non-coding genomic regions. These genetic variants are often also associated to differences in expression levels of nearby genes (they are "expression quantitative trait loci" or eQTLs for short) and presumably play a gene regulatory role, affecting the status of molecular networks of interacting genes, proteins and metabolites. Computational systems biology approaches to reconstruct causal gene networks from large-scale omics data have therefore become essential to understand the structure of networks controlled by eQTLs together with other regulatory genes, and to generate detailed hypotheses about the molecular mechanisms that lead from genotype to phenotype. Here we review the main analytical methods and softwares to identify eQTLs and their associated genes, to reconstruct co-expression networks and modules, to reconstruct causal Bayesian gene and module networks, and to validate predicted networks in silico.Comment: minor revision with typos corrected; review article; 24 pages, 2 figure

    Systematic gene function prediction from gene expression data by using a fuzzy nearest-cluster method

    Get PDF
    BACKGROUND: Quantitative simultaneous monitoring of the expression levels of thousands of genes under various experimental conditions is now possible using microarray experiments. However, there are still gaps toward whole-genome functional annotation of genes using the gene expression data. RESULTS: In this paper, we propose a novel technique called Fuzzy Nearest Clusters for genome-wide functional annotation of unclassified genes. The technique consists of two steps: an initial hierarchical clustering step to detect homogeneous co-expressed gene subgroups or clusters in each possibly heterogeneous functional class; followed by a classification step to predict the functional roles of the unclassified genes based on their corresponding similarities to the detected functional clusters. CONCLUSION: Our experimental results with yeast gene expression data showed that the proposed method can accurately predict the genes' functions, even those with multiple functional roles, and the prediction performance is most independent of the underlying heterogeneity of the complex functional classes, as compared to the other conventional gene function prediction approaches

    Global Considerations in Hierarchical Clustering Reveal Meaningful Patterns in Data

    Get PDF
    BACKGROUND: A hierarchy, characterized by tree-like relationships, is a natural method of organizing data in various domains. When considering an unsupervised machine learning routine, such as clustering, a bottom-up hierarchical (BU, agglomerative) algorithm is used as a default and is often the only method applied. METHODOLOGY/PRINCIPAL FINDINGS: We show that hierarchical clustering that involve global considerations, such as top-down (TD, divisive), or glocal (global-local) algorithms are better suited to reveal meaningful patterns in the data. This is demonstrated, by testing the correspondence between the results of several algorithms (TD, glocal and BU) and the correct annotations provided by experts. The correspondence was tested in multiple domains including gene expression experiments, stock trade records and functional protein families. The performance of each of the algorithms is evaluated by statistical criteria that are assigned to clusters (nodes of the hierarchy tree) based on expert-labeled data. Whereas TD algorithms perform better on global patterns, BU algorithms perform well and are advantageous when finer granularity of the data is sought. In addition, a novel TD algorithm that is based on genuine density of the data points is presented and is shown to outperform other divisive and agglomerative methods. Application of the algorithm to more than 500 protein sequences belonging to ion-channels illustrates the potential of the method for inferring overlooked functional annotations. ClustTree, a graphical Matlab toolbox for applying various hierarchical clustering algorithms and testing their quality is made available. CONCLUSIONS: Although currently rarely used, global approaches, in particular, TD or glocal algorithms, should be considered in the exploratory process of clustering. In general, applying unsupervised clustering methods can leverage the quality of manually-created mapping of proteins families. As demonstrated, it can also provide insights in erroneous and missed annotations

    Search for new physics with same-sign isolated dilepton events with jets and missing transverse energy

    Get PDF
    A search for new physics is performed in events with two same-sign isolated leptons, hadronic jets, and missing transverse energy in the final state. The analysis is based on a data sample corresponding to an integrated luminosity of 4.98 inverse femtobarns produced in pp collisions at a center-of-mass energy of 7 TeV collected by the CMS experiment at the LHC. This constitutes a factor of 140 increase in integrated luminosity over previously published results. The observed yields agree with the standard model predictions and thus no evidence for new physics is found. The observations are used to set upper limits on possible new physics contributions and to constrain supersymmetric models. To facilitate the interpretation of the data in a broader range of new physics scenarios, information on the event selection, detector response, and efficiencies is provided.Comment: Published in Physical Review Letter

    Compressed representation of a partially defined integer function over multiple arguments

    Get PDF
    In OLAP (OnLine Analitical Processing) data are analysed in an n-dimensional cube. The cube may be represented as a partially defined function over n arguments. Considering that often the function is not defined everywhere, we ask: is there a known way of representing the function or the points in which it is defined, in a more compact manner than the trivial one

    Measurement of jet fragmentation into charged particles in pp and PbPb collisions at sqrt(s[NN]) = 2.76 TeV

    Get PDF
    Jet fragmentation in pp and PbPb collisions at a centre-of-mass energy of 2.76 TeV per nucleon pair was studied using data collected with the CMS detector at the LHC. Fragmentation functions are constructed using charged-particle tracks with transverse momenta pt > 4 GeV for dijet events with a leading jet of pt > 100 GeV. The fragmentation functions in PbPb events are compared to those in pp data as a function of collision centrality, as well as dijet-pt imbalance. Special emphasis is placed on the most central PbPb events including dijets with unbalanced momentum, indicative of energy loss of the hard scattered parent partons. The fragmentation patterns for both the leading and subleading jets in PbPb collisions agree with those seen in pp data at 2.76 TeV. The results provide evidence that, despite the large parton energy loss observed in PbPb collisions, the partition of the remaining momentum within the jet cone into high-pt particles is not strongly modified in comparison to that observed for jets in vacuum.Comment: Submitted to the Journal of High Energy Physic

    Haplotypes of DNA repair and cell cycle control genes, X-ray exposure, and risk of childhood acute lymphoblastic leukemia

    Get PDF
    [[abstract]]Background: Acute leukemias of childhood are a heterogeneous group of malignancies characterized by cytogenetic abnormalities, such as translocations and changes in ploidy. These abnormalities may be influenced by altered DNA repair and cell cycle control processes. Methods: We examined the association between childhood acute lymphoblastic leukemia (ALL) and 32 genes in DNA repair and cell cycle pathways using a haplotype-based approach, among 377 childhood ALL cases and 448 controls enrolled during 1995-2002. Results: We found that haplotypes in APEX1, BRCA2, ERCC2, and RAD51 were significantly associated with total ALL, while haplotypes in NBN and XRCC4, and CDKN2A were associated with structural and numerical change subtypes, respectively. In addition, we observed statistically significant interaction between exposure to 3 or more diagnostic X-rays and haplotypes of XRCC4 on risk of structural abnormality-positive childhood ALL. Conclusions: These results support a role of altered DNA repair and cell cycle processes in the risk of childhood ALL, and show that this genetic susceptibility can differ by cytogenetic subtype and may be modified by exposure to ionizing radiation. To our knowledge, our study is the first to broadly examine the DNA repair and cell cycle pathways using a haplotype approach in conjunction with X-ray exposures in childhood ALL risk. If confirmed, future studies are needed to identify specific functional SNPs in the regions of interest identified in this analysis

    Prediction of Phenotype-Associated Genes via a Cellular Network Approach: A Candida albicans Infection Case Study

    Get PDF
    Candida albicans is the most prevalent opportunistic fungal pathogen in humans causing superficial and serious systemic infections. The infection process can be divided into three stages: adhesion, invasion, and host cell damage. To enhance our understanding of these C. albicans infection stages, this study aimed to predict phenotype-associated genes involved during these three infection stages and their roles in C. albicans–host interactions. In light of the principles that proteins that lie closer to one another in a protein interaction network are more likely to have similar functions, and that genes regulated by the same transcription factors tend to have similar functions, a cellular network approach was proposed to predict the phenotype-associated genes in this study. A total of 4, 12, and 3 genes were predicted as adhesion-, invasion-, and damage-associated genes during C. albicans infection, respectively. These predicted genes highlight the facts that cell surface components are critical for cell adhesion, and that morphogenesis is crucial for cell invasion. In addition, they provide targets for further investigations into the mechanisms of the three C. albicans infection stages. These results give insights into the responses elicited in C. albicans during interaction with the host, possibly instrumental in identifying novel therapies to treat C. albicans infection
    corecore