140 research outputs found

    On the CALM principle for BSP computation

    Get PDF
    In recent times, considerable emphasis has been given to two apparently disjoint research topics: data-parallel and eventually consistent, distributed systems. In this paper we propose a study on an eventually consistent, dataparallel computational model, the keystone of which is provided by the recent finding that a class of programs exists that can be computed in an eventually consistent, coordination-free way: monotonic programs. This principle is called CALM and has been proven by Ameloot et al. for distributed, asynchronous settings. We advocate that CALM should be employed as a basic theoretical tool also for data-parallel systems, wherein computation usually proceeds synchronously in rounds and where communication is assumed to be reliable. We deem this problem relevant and interesting, especially for what concerns parallel workflow optimization, and make the case that CALM does not hold in general for dataparallel systems if the techniques developed by Ameloot et al. are directly used. In this paper we sketch how, using novel techniques, the satisfiability of the if direction of the CALM principle can still be obtained, although just for a subclass of monotonic queries

    Pushing ML Predictions into DBMSs

    Get PDF
    In the past decade, many approaches have been suggested to execute ML workloads on a DBMS. However, most of them have looked at in-DBMS ML from a training perspective, whereas ML inference has been largely overlooked. We think that this is an important gap to fill for two main reasons: (1) in the near future, every application will be infused with some sort of ML capability; (2) behind every web page, application, and enterprise there is a DBMS, whereby in-DBMS inference is an appealing solution both for efficiency (e.g., less data movement), performance (e.g., cross-optimizations between relational operators and ML) and governance. In this paper, we study whether DBMSs are a good fit for prediction serving. We introduce a technique for translating trained ML pipelines containing both featurizers (e.g., one-hot encoding) and models (e.g., linear and tree-based models) into SQL queries, and we compare in-DBMS performance against popular ML frameworks such as Sklearn and ml.net. Our experiments show that, when pushed inside a DBMS, trained ML pipelines can have performance comparable to ML frameworks in several scenarios, while they perform quite poorly on text featurization and over (even simple) neural networks

    Variabilità geo-socio-prosodica: dati linguistici e statistici

    Get PDF

    Effects of Total Resources, Resource Ratios, and Species Richness on Algal Productivity and Evenness at Both Metacommunity and Local Scales

    Get PDF
    The study of the interrelationship between productivity and biodiversity is a major research field in ecology. Theory predicts that if essential resources are heterogeneously distributed across a metacommunity, single species may dominate productivity in individual metacommunity patches, but a mixture of species will maximize productivity across the whole metacommunity. It also predicts that a balanced supply of resources within local patches should favor species coexistence, whereas resource imbalance would favor the dominance of one species. We performed an experiment with five freshwater algal species to study the effects of total supply of resources, their ratios, and species richness on biovolume production and evenness at the scale of both local patches and metacommunities. Generally, algal biovolume increased, whereas algal resource use efficiency (RUE) and evenness decreased with increasing total supply of resources in mixed communities containing all five species. In contrast to predictions for biovolume production, the species mixtures did not outperform all monocultures at the scale of metacommunities. In other words, we observed no general transgressive overyielding. However, RUE was always higher in mixtures than predicted from monocultures, and analyses indicate that resource partitioning or facilitation in mixtures resulted in higher-than-expected productivity at high resource supply. Contrasting our predictions for the local scale, balanced supply of resources did not generally favor higher local evenness, however lowest evenness was confined to patches with the most imbalanced supply. Thus, our study provides mixed support for recent theoretical advancements to understand biodiversity-productivity relationships

    Influence of phosphorus on copper sensitivity of fluvial periphyton: the role of chemical, physiological and community-related factors

    Get PDF
    The influence of eutrophication of fluvial ecosystems (caused by increased phosphorus concentrations) on periphyton Cu sensitivity is explored from a multi-scale perspective, going from the field to the laboratory. The study design included three tiers: a field study including the characterization of land use and the ecological state of the corresponding river sections in the Fluvià River watershed, an experimental investigation performed with natural periphyton from the previously studied stream sites in indoor channels, and finally a culture study in the laboratory. Results showed that differences in copper sensitivity of natural periphyton communities followed the gradient of nutrient concentration found in the field. Results from the culture experiments demonstrated that both, P-conditions during growth and P-content in the media are important factors modulating the toxicological response of algae to Cu. The observations from this study indicate that the ecological effects of metal pollution in rivers might be obscured by eutrophication

    Data provenance to audit compliance with privacy policy in the Internet of Things

    Get PDF
    Managing privacy in the IoT presents a significant challenge. We make the case that information obtained by auditing the flows of data can assist in demonstrating that the systems handling personal data satisfy regulatory and user requirements. Thus, components handling personal data should be audited to demonstrate that their actions comply with all such policies and requirements. A valuable side-effect of this approach is that such an auditing process will highlight areas where technical enforcement has been incompletely or incorrectly specified. There is a clear role for technical assistance in aligning privacy policy enforcement mechanisms with data protection regulations. The first step necessary in producing technology to accomplish this alignment is to gather evidence of data flows. We describe our work producing, representing and querying audit data and discuss outstanding challenges.Engineering and Applied Science
    corecore