Search CORE

14 research outputs found

FAIR AI Models in High Energy Physics

Author: Diaz Daniel
Duarte Javier
Harris Philip
Huerta E. A.
Kansal Raghav
Katz Daniel S.
Kavoori Ishaan H.
Kindratenko Volodymyr V.
Li Haoyang
Mokhtar Farouk
Neubauer Mark S.
Park Sang Eon
Quinnan Melissa
Roy Avik
Rusack Roger
Zhao Zhizhen
Zhu Ruike
Publication venue: EDP Sciences
Publication date: 01/01/2024
Field of study

The findable, accessible, interoperable, and reusable (FAIR) data principles serve as a framework for examining, evaluating, and improving data sharing to advance scientific endeavors. There is an emerging trend to adapt these principles for machine learning models—algorithms that learn from data without specific coding—and, more generally, AI models, due to AI’s swiftly growing impact on scientific and engineering sectors. In this paper, we propose a practical definition of the FAIR principles for AI models and provide a template program for their adoption. We exemplify this strategy with an implementation from high-energy physics, where a graph neural network is employed to detect Higgs bosons decaying into two bottom quarks

Directory of Open Access Journals

A Cloud-Based Framework for Machine Learning Workloads and Applications

Author: Alic A.S.
Antonacci M.
Caballer M.
Costantini A.
David M.
De Lucas J.M.
Dlugolinsky S.
Donvito G.
Duma D.C.
Gomes J.
Hardt M.
Heredia Cacha I.
Ito K.
Kozlov V.Y.
Lloret Iglesias L.
Lopez Garcia A.
Molto G.
Nguyen G.
Orviz Fernandez P.
Plasencia I.C.
Plociennik M.
Sustr Z.
Tran V.
Wolniewicz P.
zu Castell W.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

[EN] In this paper we propose a distributed architecture to provide machine learning practitioners with a set of tools and cloud services that cover the whole machine learning development cycle: ranging from the models creation, training, validation and testing to the models serving as a service, sharing and publication. In such respect, the DEEP-Hybrid-DataCloud framework allows transparent access to existing e-Infrastructures, effectively exploiting distributed resources for the most compute-intensive tasks coming from the machine learning development cycle. Moreover, it provides scientists with a set of Cloud-oriented services to make their models publicly available, by adopting a serverless architecture and a DevOps approach, allowing an easy share, publish and deploy of the developed models.This work was supported by the project DEEP-Hybrid-DataCloud ``Designing and Enabling E-infrastructures for intensive Processing in a Hybrid DataCloud'' that has received funding from the European Union's Horizon 2020 Research and Innovation Programme under Grant 777435Lopez Garcia, A.; Marco De Lucas, J.; Antonacci, M.; Zu Castell, W.; David, M.; Hardt, M.; Lloret Iglesias, L.... (2020). A Cloud-Based Framework for Machine Learning Workloads and Applications. IEEE Access. 8:18681-18692. https://doi.org/10.1109/ACCESS.2020.2964386S1868118692

The Pipeline for the Continuous Development of Artificial Intelligence Models -- Current State of Research and Practice

Author: Felderer Michael
Ramler Rudolf
Steidl Monika
Publication venue: 'Elsevier BV'
Publication date: 01/01/2023
Field of study

Companies struggle to continuously develop and deploy AI models to complex production systems due to AI characteristics while assuring quality. To ease the development process, continuous pipelines for AI have become an active research area where consolidated and in-depth analysis regarding the terminology, triggers, tasks, and challenges is required. This paper includes a Multivocal Literature Review where we consolidated 151 relevant formal and informal sources. In addition, nine-semi structured interviews with participants from academia and industry verified and extended the obtained information. Based on these sources, this paper provides and compares terminologies for DevOps and CI/CD for AI, MLOps, (end-to-end) lifecycle management, and CD4ML. Furthermore, the paper provides an aggregated list of potential triggers for reiterating the pipeline, such as alert systems or schedules. In addition, this work uses a taxonomy creation strategy to present a consolidated pipeline comprising tasks regarding the continuous development of AI. This pipeline consists of four stages: Data Handling, Model Learning, Software Development and System Operations. Moreover, we map challenges regarding pipeline implementation, adaption, and usage for the continuous development of AI to these four stages.Comment: accepted in the Journal Systems and Softwar

arXiv.org e-Print Archive

Blekinge Institute of Technology

Digitala Vetenskapliga Arkivet - Academic Archive On-line

An exchange-based AIoT platform for fast AI application development

Author: Liang Yu-Cheng
Ren Yi
Tong Kit-Lun
Tseng Yu-Chee
Wu Kun-Ru
Publication venue
Publication date: 30/10/2023
Field of study

AIoT is the combination of Internet of Things (IoT) and Artificial Intelligence (AI) technologies. While IoT emphasizes more on scalable and efficient communications, AI focuses more on reproducing human capabilities such as recognition and forecasting. An efficient AIoT platform may not be obtained directly from integrating existing IoT and AI serving platforms by considering the AIoT service reproduction and evolution. In this work, we propose an AIoT platform that empowers developers to build sophisticated and scalable applications. Our platform is derived based on exchange-based RabbitMQ broker and Advanced Message Queuing Protocol (AMQP) to facilitate the communications among heterogeneous data sources and AI models. By incorporating an AMQP broker, it supports diverse data exchanges, AI models chaining, and flexible message routing and processing. AI models can be deployed efficiently through containerization with flexible and shared data paths to facilitate computations. Hence, developers can focus on service and application requirements. We also present a case study in smart healthcare to validate our design

University of East Anglia digital repository

Toward a definition for serverless computing

Author: Abad Cristina
Al-Kiswany Samer
Balis Bartosz
Bauer Andre
Bondi Andre
Chard Kyle
Chard Ryan
Chatley Robert
Chien Andrew
Davis Jesse
Donkervliet Jesse
Eismann Simon
Elmroth Erik
Ferrier Nicola
Foster Ian
Hassan Ahmed Ali-Eldin
Herbst Nikolas
Iosup Alexandru
Jacobsen Hans-Arno
Jamshidi Pooyan
Kounev Samuel
Kousiouris Georgios
Leitner Philipp
Lopes Pedro
Maggio Martina
Malawski Maciej
Metzler Bernard
Muthusamy Vinod
Papadopoulos Alessandro
Patros Panos
Pierre Guillaume
Rana Omer
Ricci Robert
Scheuner Joel
Sedaghat Mina
Shahrad Mohammad
Shenoy Prashant
Spillner Josef
Taibi Davide
Thain Douglas
Trivedi Animesh
Uta Alexandru
van Beek Vincent
van Eyk Erwin
van Hoorn Andre
Vasani Soam
Wamser Florian
Wirtz Guido
Yussupov Vladimir
Publication venue: Schloss Dagstuhl -- Leibniz-Zentrum fur Informatik
Publication date
Field of study

Online Research @ Cardiff