9,214 research outputs found
Recommended from our members
Large-scale social-media analytics on stratosphere
The importance of social-media platforms and online communities - in business as well as public context - is more and more acknowledged and appreciated by industry and researchers alike. Consequently, a wide range of analytics has been proposed to understand, steer, and exploit the mechanics and laws driving their functionality and creating the resulting benefits. However, analysts usually face significant problems in scaling existing and novel approaches to match the data volume and size of modern online communities. In this work, we propose and demonstrate the usage of the massively parallel data processing system Stratosphere, based on second order functions as an extended notion of the MapReduce paradigm, to provide a new level of scalability to such social-media analytics. Based on the popular example of role analysis, we present and illustrate how this massively parallel approach can be leveraged to scale out complex data-mining tasks, while providing a programming approach that eases the formulation of complete analytical workflows
CGIAR Excellence in Breeding Platform - Plan of Work and Budget 2020
At the end of 2019, all CGIAR centers had submitted improvement plans based on an EiB template and in close collaboration with EiB staff while – in a parallel process with breeding programs, funders and private sector representatives – a vision for breeding program modernization was developed and presented to CGIAR breeding leadership at the EiB Annual Meeting. This vision represents an evolution of EiB in the context of the Crops to End Hunger Initiative (CtEH) beyond the initial scope of providing tools, services and expert advice, and serves as a guide for Center leadership to drive changes with EiB support. In addition, EiB has taken the role of managing and disbursing funding, made available by Funders via CtEH to modernize breeding and enable CGIAR breeding programs to implement the vision provided by EiB
ExplainIt! -- A declarative root-cause analysis engine for time series data (extended version)
We present ExplainIt!, a declarative, unsupervised root-cause analysis engine
that uses time series monitoring data from large complex systems such as data
centres. ExplainIt! empowers operators to succinctly specify a large number of
causal hypotheses to search for causes of interesting events. ExplainIt! then
ranks these hypotheses, reducing the number of causal dependencies from
hundreds of thousands to a handful for human understanding. We show how a
declarative language, such as SQL, can be effective in declaratively
enumerating hypotheses that probe the structure of an unknown probabilistic
graphical causal model of the underlying system. Our thesis is that databases
are in a unique position to enable users to rapidly explore the possible causal
mechanisms in data collected from diverse sources. We empirically demonstrate
how ExplainIt! had helped us resolve over 30 performance issues in a commercial
product since late 2014, of which we discuss a few cases in detail.Comment: SIGMOD Industry Track 201
- …