11 research outputs found

    Statistical inference framework for source detection of contagion processes on arbitrary network structures

    Get PDF
    In this paper we introduce a statistical inference framework for estimating the contagion source from a partially observed contagion spreading process on an arbitrary network structure. The framework is based on a maximum likelihood estimation of a partial epidemic realization and involves large scale simulation of contagion spreading processes from the set of potential source locations. We present a number of different likelihood estimators that are used to determine the conditional probabilities associated to observing partial epidemic realization with particular source location candidates. This statistical inference framework is also applicable for arbitrary compartment contagion spreading processes on networks. We compare estimation accuracy of these approaches in a number of computational experiments performed with the SIR (susceptible-infected-recovered), SI (susceptible-infected) and ISS (ignorant-spreading-stifler) contagion spreading models on synthetic and real-world complex networks

    Crowdsourced mapping of unexplored target space of kinase inhibitors

    Get PDF
    Despite decades of intensive search for compounds that modulate the activity of particular protein targets, a large proportion of the human kinome remains as yet undrugged. Effective approaches are therefore required to map the massive space of unexplored compound-kinase interactions for novel and potent activities. Here, we carry out a crowdsourced benchmarking of predictive algorithms for kinase inhibitor potencies across multiple kinase families tested on unpublished bioactivity data. We find the top-performing predictions are based on various models, including kernel learning, gradient boosting and deep learning, and their ensemble leads to a predictive accuracy exceeding that of single-dose kinase activity assays. We design experiments based on the model predictions and identify unexpected activities even for under-studied kinases, thereby accelerating experimental mapping efforts. The open-source prediction algorithms together with the bioactivities between 95 compounds and 295 kinases provide a resource for benchmarking prediction algorithms and for extending the druggable kinome. The IDG-DREAM Challenge carried out crowdsourced benchmarking of predictive algorithms for kinase inhibitor activities on unpublished data. This study provides a resource to compare emerging algorithms and prioritize new kinase activities to accelerate drug discovery and repurposing efforts

    Disentangling Sources of Influence in Online Social Networks

    Full text link
    Information propagation in online social networks is facilitated by two types of influence - endogenous (peer) influence that acts between users of the social network and exogenous (external) that corresponds to various external mediators such as online news media. However, inference of these influences from data remains a challenge, especially when data on the activation of users is scarce. In this paper we propose a methodology that yields estimates of both endogenous and exogenous influence using only a social network structure and a single activation cascade. Our method exploits the statistical differences between the two types of influence - endogenous is dependent on the social network structure and current state of each user while exogenous is independent of these. We evaluate our methodology on simulated activation cascades as well as on cascades obtained from several large Facebook political survey applications. We show that our methodology is able to provide estimates of endogenous and exogenous influence in online social networks, characterize activation of each individual user as being endogenously or exogenously driven, and identify most influential groups of users

    Good Governance Problems and Recent Financial Crises in Some EU Countries [Dataset]

    No full text
    This study examines 147 banking crises in the period of 1976-2011 documented by the International Monetary Fund. The countries affected by crises are analysed in respect of publicly available World Bank indicators in the periods of three years before the crises. Machine learning methodology for subgroup discovery is used for the analysis. It enabled identification of five subsets of crises. Two of them are identified as especially useful for the characterization of EU countries affected by the banking crises in the year 2008. Fast growing credit activity is a characteristic for the first subgroup while socioeconomic problems recognized by non-increasing quality of public health are decisive for the second subgroup. Comparative analysis of the EU countries included into the second subgroup and the EU countries affected by the banking crises but not included into this subgroup demonstrated statistically significant differences in respect of World Bank good governance indicator values for the period before the crisis. Control of corruption, rule of law, and government effectiveness are the indicators that are statistically different for these sets of countries. The result is fully in accordance with the Francisâ's model connecting governance indicators and financial fragility. The significance of the result is in the segmentation of the corpus of countries with banking crises and recognition of connections between banking crises, socioeconomic problems, and governance effectiveness in some EU countries. The conclusions of the study might be useful for the policy makers in stressing that future banking crises prevention should also focus on governance effectiveness, more strict law implementation and especially on measures against corruption

    Good Governance Problems and Recent Financial Crises in Some EU Countriestitle of article [Dataset]

    No full text
    The starting point for the research has been the list of 147 banking crises within the period 1976-2011 prepared by the International Monetary Fund. The countries with crises have been analysed with respect to publicly available World Bank indicators in the periods of three years before the crises. The machine learning methodology for subgroup discovery has been used for the analysis. It enabled identification of five subsets of crises. Two of them have been identified as especially useful for the characterization of EU countries with banking crises in the year 2008. Fast growing credit activity is characteristic for the first subgroup while socioeconomic problems recognized by non-increasing quality of public health are decisive for the second subgroup. Comparative analysis of EU countries included into these subgroups demonstrated statistically significant differences with respect to World Bank good governance indicator values for the period before the crisis. Control of corruption, rule of law, and government effectiveness are the indicators which are statistically different for these sets of countries. The significance of the result is in the segmentation of the corpus of countries with banking crises and the recognition of connections between banking crises, socioeconomic problems, and governance effectiveness in some EU countrie
    corecore