14 research outputs found
FAIR AI Models in High Energy Physics
The findable, accessible, interoperable, and reusable (FAIR) data principles serve as a framework for examining, evaluating, and improving data sharing to advance scientific endeavors. There is an emerging trend to adapt these principles for machine learning models—algorithms that learn from data without specific coding—and, more generally, AI models, due to AI’s swiftly growing impact on scientific and engineering sectors. In this paper, we propose a practical definition of the FAIR principles for AI models and provide a template program for their adoption. We exemplify this strategy with an implementation from high-energy physics, where a graph neural network is employed to detect Higgs bosons decaying into two bottom quarks
A Cloud-Based Framework for Machine Learning Workloads and Applications
[EN] In this paper we propose a distributed architecture to provide machine learning practitioners with a set of tools and cloud services that cover the whole machine learning development cycle: ranging from the models creation, training, validation and testing to the models serving as a service, sharing and publication. In such respect, the DEEP-Hybrid-DataCloud framework allows transparent access to existing e-Infrastructures, effectively exploiting distributed resources for the most compute-intensive tasks coming from the machine learning development cycle. Moreover, it provides scientists with a set of Cloud-oriented services to make their models publicly available, by adopting a serverless architecture and a DevOps approach, allowing an easy share, publish and deploy of the developed models.This work was supported by the project DEEP-Hybrid-DataCloud ``Designing and Enabling E-infrastructures for intensive Processing in a Hybrid DataCloud'' that has received funding from the European Union's Horizon 2020 Research and Innovation Programme under Grant 777435Lopez Garcia, A.; Marco De Lucas, J.; Antonacci, M.; Zu Castell, W.; David, M.; Hardt, M.; Lloret Iglesias, L.... (2020). A Cloud-Based Framework for Machine Learning Workloads and Applications. IEEE Access. 8:18681-18692. https://doi.org/10.1109/ACCESS.2020.2964386S1868118692
The Pipeline for the Continuous Development of Artificial Intelligence Models -- Current State of Research and Practice
Companies struggle to continuously develop and deploy AI models to complex
production systems due to AI characteristics while assuring quality. To ease
the development process, continuous pipelines for AI have become an active
research area where consolidated and in-depth analysis regarding the
terminology, triggers, tasks, and challenges is required. This paper includes a
Multivocal Literature Review where we consolidated 151 relevant formal and
informal sources. In addition, nine-semi structured interviews with
participants from academia and industry verified and extended the obtained
information. Based on these sources, this paper provides and compares
terminologies for DevOps and CI/CD for AI, MLOps, (end-to-end) lifecycle
management, and CD4ML. Furthermore, the paper provides an aggregated list of
potential triggers for reiterating the pipeline, such as alert systems or
schedules. In addition, this work uses a taxonomy creation strategy to present
a consolidated pipeline comprising tasks regarding the continuous development
of AI. This pipeline consists of four stages: Data Handling, Model Learning,
Software Development and System Operations. Moreover, we map challenges
regarding pipeline implementation, adaption, and usage for the continuous
development of AI to these four stages.Comment: accepted in the Journal Systems and Softwar
An exchange-based AIoT platform for fast AI application development
AIoT is the combination of Internet of Things (IoT) and Artificial Intelligence (AI) technologies. While IoT emphasizes more on scalable and efficient communications, AI focuses more on reproducing human capabilities such as recognition and forecasting. An efficient AIoT platform may not be obtained directly from integrating existing IoT and AI serving platforms by considering the AIoT service reproduction and evolution. In this work, we propose an AIoT platform that empowers developers to build sophisticated and scalable applications. Our platform is derived based on exchange-based RabbitMQ broker and Advanced Message Queuing Protocol (AMQP) to facilitate the communications among heterogeneous data sources and AI models. By incorporating an AMQP broker, it supports diverse data exchanges, AI models chaining, and flexible message routing and processing. AI models can be deployed efficiently through containerization with flexible and shared data paths to facilitate computations. Hence, developers can focus on service and application requirements. We also present a case study in smart healthcare to validate our design