Search CORE

64,534 research outputs found

StreamLearner: Distributed Incremental Machine Learning on Event Streams: Grand Challenge

Author: Abdo Majd
Mayer Christian
Mayer Ruben
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 26/06/2017
Field of study

Today, massive amounts of streaming data from smart devices need to be analyzed automatically to realize the Internet of Things. The Complex Event Processing (CEP) paradigm promises low-latency pattern detection on event streams. However, CEP systems need to be extended with Machine Learning (ML) capabilities such as online training and inference in order to be able to detect fuzzy patterns (e.g., outliers) and to improve pattern recognition accuracy during runtime using incremental model training. In this paper, we propose a distributed CEP system denoted as StreamLearner for ML-enabled complex event detection. The proposed programming model and data-parallel system architecture enable a wide range of real-world applications and allow for dynamically scaling up and out system resources for low-latency, high-throughput event processing. We show that the DEBS Grand Challenge 2017 case study (i.e., anomaly detection in smart factories) integrates seamlessly into the StreamLearner API. Our experiments verify scalability and high event throughput of StreamLearner.Comment: Christian Mayer, Ruben Mayer, and Majd Abdo. 2017. StreamLearner: Distributed Incremental Machine Learning on Event Streams: Grand Challenge. In Proceedings of the 11th ACM International Conference on Distributed and Event-based Systems (DEBS '17), 298-30

arXiv.org e-Print Archive

Crossref

Machine Learning at Microsoft with ML .NET

Author: Ahmed Zeeshan
Amizadeh Saeed
Bilenko Mikhail
Carr Rogan
Chin Wei-Sheng
Dekel Yael
Dupre Xavier
Eksarevskiy Vadim
Erhardt Eric
Eseanu Costin
Filipi Senja
Finley Tom
Goswami Abhishek
Hoover Monte
Inglis Scott
Interlandi Matteo
Katzenberger Shon
Kazmi Najeeb
Krivosheev Gleb
Luferenko Pete
Matantsev Ivan
Matusevych Sergiy
Moradi Shahab
Nazirov Gani
Ormont Justin
Oshri Gal
Pagnoni Artidoro
Parmar Jignesh
Roy Prabhat
Shah Sarthak
Siddiqui Mohammad Zeeshan
Weimer Markus
Zahirazami Shauheen
Zhu Yiwen
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 15/05/2019
Field of study

Machine Learning is transitioning from an art and science into a technology available to every developer. In the near future, every application on every platform will incorporate trained models to encode data-based decisions that would be impossible for developers to author. This presents a significant engineering challenge, since currently data science and modeling are largely decoupled from standard software development processes. This separation makes incorporating machine learning capabilities inside applications unnecessarily costly and difficult, and furthermore discourage developers from embracing ML in first place. In this paper we present ML .NET, a framework developed at Microsoft over the last decade in response to the challenge of making it easy to ship machine learning models in large software applications. We present its architecture, and illuminate the application demands that shaped it. Specifically, we introduce DataView, the core data abstraction of ML .NET which allows it to capture full predictive pipelines efficiently and consistently across training and inference lifecycles. We close the paper with a surprisingly favorable performance study of ML .NET compared to more recent entrants, and a discussion of some lessons learned

arXiv.org e-Print Archive

Crossref

Scipedia