Search CORE

596 research outputs found

MultiTASC: A Multi-Tenancy-Aware Scheduler for Cascaded DNN Inference at the Consumer Edge

Author: Nikolaidis Sokratis
Venieris Iakovos S.
Venieris Stylianos I.
Publication venue
Publication date: 22/06/2023
Field of study

Cascade systems comprise a two-model sequence, with a lightweight model processing all samples and a heavier, higher-accuracy model conditionally refining harder samples to improve accuracy. By placing the light model on the device side and the heavy model on a server, model cascades constitute a widely used distributed inference approach. With the rapid expansion of intelligent indoor environments, such as smart homes, the new setting of Multi-Device Cascade is emerging where multiple and diverse devices are to simultaneously use a shared heavy model on the same server, typically located within or close to the consumer environment. This work presents MultiTASC, a multi-tenancy-aware scheduler that adaptively controls the forwarding decision functions of the devices in order to maximize the system throughput, while sustaining high accuracy and low latency. By explicitly considering device heterogeneity, our scheduler improves the latency service-level objective (SLO) satisfaction rate by 20-25 percentage points (pp) over state-of-the-art cascade methods in highly heterogeneous setups, while serving over 40 devices, showcasing its scalability.Comment: Accepted at 28th IEEE Symposium on Computers and Communications (ISCC), 202

arXiv.org e-Print Archive

Recommended from our members

Movie analysis with emphasis to dialogue and action scene detection

Author: Benetos E.
Kotropoulos C.
Nikolaidis N.
Pitas I.
Siatras S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

City Research Online

Crossref

Exploring the Performance and Efficiency of Transformer Models for NLP on Mobile Devices

Author: Nikolaidis Sokratis
Panopoulos Ioannis
Venieris Iakovos S.
Venieris Stylianos I.
Publication venue
Publication date: 20/06/2023
Field of study

Deep learning (DL) is characterised by its dynamic nature, with new deep neural network (DNN) architectures and approaches emerging every few years, driving the field's advancement. At the same time, the ever-increasing use of mobile devices (MDs) has resulted in a surge of DNN-based mobile applications. Although traditional architectures, like CNNs and RNNs, have been successfully integrated into MDs, this is not the case for Transformers, a relatively new model family that has achieved new levels of accuracy across AI tasks, but poses significant computational challenges. In this work, we aim to make steps towards bridging this gap by examining the current state of Transformers' on-device execution. To this end, we construct a benchmark of representative models and thoroughly evaluate their performance across MDs with different computational capabilities. Our experimental results show that Transformers are not accelerator-friendly and indicate the need for software and hardware optimisations to achieve efficient deployment.Comment: Accepted at the 3rd IEEE International Workshop on Distributed Intelligent Systems (DistInSys), 202

arXiv.org e-Print Archive

Recommended from our members

MUSCLE movie-database: a multimodal corpus with rich annotation for dialogue and saliency detection

Author: Antonopoulos P.
Benetos E.
Kotropoulos C.
Kotti M.
Maragos P.
Moschou V.
Nikolaidis N.
Pitas I.
Spachos D.
Tzimouli K.
Zlantintsi A.
Publication venue
Publication date: 01/01/2008
Field of study

City Research Online

Spiral - Imperial College Digital Repository

Region-based image watermarking

Author: A. Nikolaidis
I. Pitas
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

A New Era in Engineering Plastics: Compatibility and Perspectives of Sustainable Alipharomatic Poly(ethylene terephthalate)/Poly(ethylene 2,5-furandicarboxylate) Blends

Author: Achilias DS
Bikiaris DN
Exarhopoulos S
Ioannidis RO
Kasmi N
Nikolaidis GN
Papageorgiou DG
Papageorgiou GZ
Tsetsou I
Publication venue: 'MDPI AG'
Publication date: 29/03/2021
Field of study

The industrialisation of poly(ethylene 2,5-furandicarboxylate) for total replacement of poly(ethylene terephthalate) in the polyester market is under question. Preparation of high-performing polymer blends is a well-established strategy for tuning the properties of certain homopolymers and create tailor-made materials to meet the demands for a number of applications. In this work, the structure, thermal properties and the miscibility of a series of poly(ethylene terephthalate)/poly(ethylene 2,5-furandicarboxylate) (PET/PEF) blends have been studied. A number of thermal treatments were followed in order to examine the thermal transitions, their dynamic state and the miscibility characteristics for each blend composition. Based on their glass transition temperatures and melting behaviour the PET/PEF blends are miscible at high and low poly(ethylene terephthalate) (PET) contents, while partial miscibility was observed at intermediate compositions. The multiple melting was studied and their melting point depression was analysed with the Flory-Huggins theory. In an attempt to further improve miscibility, reactive blending was also investigated

Multidisciplinary Digital Publishing Institute

Queen Mary Research Online

The use of the MPEG-7 AVDP profile in 3DTV audiovisual content description

Author: Benaroya L.
Gerke S.
Linnemann A.
Liu M.
Liuni M.
Nikolaidis N.
Papachristou K.
Peeters G.
Pitas I.
Roebel A.
Publication venue
Publication date: 03/06/2014
Field of study

Explore Bristol Research

Off-design performance comparison between single and two-shaft engines: part 1 — fixed geometry

Author: Aslanidou I.
Kalfas A.
Nikolaidis Theoklis
Pellegrini Alvise
Pilidis Pericles
Saravanamuttoo H. I. H.
Publication venue: 'ASME International'
Publication date: 11/01/2021
Field of study

This paper describes an investigation into the off-design performance comparison of single and two-shaft gas turbine engines. A question that has been asked for a long time which gas turbine delivers a better thermal efficiency at part load. The authors, notwithstanding their intensive searches, were unable to find a comprehensive answer to this question. A detailed investigation was carried out using a state of the art performance evaluation method and the answer was found to be: It depends! In this work, the performance of two engine configurations is assessed. In the first one, the single-shaft gas turbine operates at constant shaft rotational speed. Thus, the shape of the compressor map rotational speed line will have an important influence on the performance of the engine. To explore the implications of the shape of the speed line, two single-shaft cases are examined. The first case is when the speed line is curved and as the compressor pressure ratio falls, the non-dimensional mass flow increases. The second case is when the speed line is vertical and as the compressor pressure ratio falls, the non-dimensional mass flow remains constant. In the second configuration, the two-shaft engine, the two-shafts can be controlled to operate at different rotational speeds and also varying relationships between the rotational speeds. The part-load operation is characterized by a reduction in the gas generator rotational speed. The tool, which was used in this study, is a 0-D whole engine simulation tool, named Turbomatch. It was developed at Cranfield and it is based on mass and energy balance, carried out through an iterative method, which is based on component maps. These generic, experimentally derived maps are scaled to match the design point of a particular engine before an off-design calculation is performed. The code has been validated against experimental data elsewhere, it has been used extensively for academic purposes and the research activities that have taken place at Cranfield University. For an ideal cycle, the single-shaft engine was found to be a clear winner in terms of part-load thermal efficiency. However, this picture changed when realistic component maps were utilized. The basic cycle and the shape of component maps had a profound influence on the outcome. The authors explored the influence of speed line shapes, levels of component efficiencies and the variation of these component efficiencies within the operating range. This paper describes how each one of these factors, individually, influences the outcome

Crossref

CERES Research Repository (Cranfield Univ.)

Análisis de la distribución de la práctica de actividad física en alumnos de educación infantil

Author: Beltrán J.
Calahorro F.
López-Fernández I.
Nikolaidis P.T.
Torre-Luque G.
Publication venue: Sociedad Iberoamericana de Psicología del Deporte
Publication date: 23/03/2016
Field of study

La práctica de actividad física (AF) es un elemento muy importante en la promoción de la salud durante la infancia. Sin embargo, hay pocos estudios que profundicen en las características de la AF que desarrollan los niños a lo largo del día. El objetivo del presente estudio fue describir la distribución de la actividad física de una intensidad determinada en bouts o secuencias de diferente duración y durante tres períodos: lunes a viernes, horario escolar y horario extraescolar. Se seleccionaron 107 alumnos de segundo ciclo de Educación Infantil (3,81±0,87 años; 19,64±4,32 kg, 1,19±0,29 m). Los sujetos llevaron un acelerómetro (Actigraph GT3X) durante 5 días consecutivos. Se calcularon los niveles de actividad física, de acuerdo con los puntos de corte propuestos por Sirard et al. (2005) y se analizó la agrupación de la AF de intensidad moderada y vigorosa (AFMV) en bouts de 1, 2, 4, 5, 6, 8, 10 y 12 min de duración. Los resultados mostraron el predominio de una AF de carácter esporádico, donde alrededor del 90% de la AFMV se realizó en bouts de 1 y 2 min. Se observó un mayor volumen de bouts de 1, 2, 4, 5 y 6 min de duración de lunes a viernes y en horario extraescolar que en el horario escolar (p<0,001). Se pone de manifiesto que la AF en niños de estas edades tiene un carácter intermitente, realizándose en secuencias o intervalos cortos que oscilan entre 1 y 2 min de duración

Revistas Científicas de la Universidad de Murcia