596 research outputs found
MultiTASC: A Multi-Tenancy-Aware Scheduler for Cascaded DNN Inference at the Consumer Edge
Cascade systems comprise a two-model sequence, with a lightweight model
processing all samples and a heavier, higher-accuracy model conditionally
refining harder samples to improve accuracy. By placing the light model on the
device side and the heavy model on a server, model cascades constitute a widely
used distributed inference approach. With the rapid expansion of intelligent
indoor environments, such as smart homes, the new setting of Multi-Device
Cascade is emerging where multiple and diverse devices are to simultaneously
use a shared heavy model on the same server, typically located within or close
to the consumer environment. This work presents MultiTASC, a
multi-tenancy-aware scheduler that adaptively controls the forwarding decision
functions of the devices in order to maximize the system throughput, while
sustaining high accuracy and low latency. By explicitly considering device
heterogeneity, our scheduler improves the latency service-level objective (SLO)
satisfaction rate by 20-25 percentage points (pp) over state-of-the-art cascade
methods in highly heterogeneous setups, while serving over 40 devices,
showcasing its scalability.Comment: Accepted at 28th IEEE Symposium on Computers and Communications
(ISCC), 202
Exploring the Performance and Efficiency of Transformer Models for NLP on Mobile Devices
Deep learning (DL) is characterised by its dynamic nature, with new deep
neural network (DNN) architectures and approaches emerging every few years,
driving the field's advancement. At the same time, the ever-increasing use of
mobile devices (MDs) has resulted in a surge of DNN-based mobile applications.
Although traditional architectures, like CNNs and RNNs, have been successfully
integrated into MDs, this is not the case for Transformers, a relatively new
model family that has achieved new levels of accuracy across AI tasks, but
poses significant computational challenges. In this work, we aim to make steps
towards bridging this gap by examining the current state of Transformers'
on-device execution. To this end, we construct a benchmark of representative
models and thoroughly evaluate their performance across MDs with different
computational capabilities. Our experimental results show that Transformers are
not accelerator-friendly and indicate the need for software and hardware
optimisations to achieve efficient deployment.Comment: Accepted at the 3rd IEEE International Workshop on Distributed
Intelligent Systems (DistInSys), 202
Recommended from our members
MUSCLE movie-database: a multimodal corpus with rich annotation for dialogue and saliency detection
A New Era in Engineering Plastics: Compatibility and Perspectives of Sustainable Alipharomatic Poly(ethylene terephthalate)/Poly(ethylene 2,5-furandicarboxylate) Blends
The industrialisation of poly(ethylene 2,5-furandicarboxylate) for total replacement of poly(ethylene terephthalate) in the polyester market is under question. Preparation of high-performing polymer blends is a well-established strategy for tuning the properties of certain homopolymers and create tailor-made materials to meet the demands for a number of applications. In this work, the structure, thermal properties and the miscibility of a series of poly(ethylene terephthalate)/poly(ethylene 2,5-furandicarboxylate) (PET/PEF) blends have been studied. A number of thermal treatments were followed in order to examine the thermal transitions, their dynamic state and the miscibility characteristics for each blend composition. Based on their glass transition temperatures and melting behaviour the PET/PEF blends are miscible at high and low poly(ethylene terephthalate) (PET) contents, while partial miscibility was observed at intermediate compositions. The multiple melting was studied and their melting point depression was analysed with the Flory-Huggins theory. In an attempt to further improve miscibility, reactive blending was also investigated
Off-design performance comparison between single and two-shaft engines: part 1 — fixed geometry
This paper describes an investigation into the off-design performance comparison of single and two-shaft gas turbine engines. A question that has been asked for a long time which gas turbine delivers a better thermal efficiency at part load. The authors, notwithstanding their intensive searches, were unable to find a comprehensive answer to this question. A detailed investigation was carried out using a state of the art performance evaluation method and the answer was found to be: It depends!
In this work, the performance of two engine configurations is assessed. In the first one, the single-shaft gas turbine operates at constant shaft rotational speed. Thus, the shape of the compressor map rotational speed line will have an important influence on the performance of the engine. To explore the implications of the shape of the speed line, two single-shaft cases are examined. The first case is when the speed line is curved and as the compressor pressure ratio falls, the non-dimensional mass flow increases. The second case is when the speed line is vertical and as the compressor pressure ratio falls, the non-dimensional mass flow remains constant.
In the second configuration, the two-shaft engine, the two-shafts can be controlled to operate at different rotational speeds and also varying relationships between the rotational speeds. The part-load operation is characterized by a reduction in the gas generator rotational speed.
The tool, which was used in this study, is a 0-D whole engine simulation tool, named Turbomatch. It was developed at Cranfield and it is based on mass and energy balance, carried out through an iterative method, which is based on component maps. These generic, experimentally derived maps are scaled to match the design point of a particular engine before an off-design calculation is performed. The code has been validated against experimental data elsewhere, it has been used extensively for academic purposes and the research activities that have taken place at Cranfield University.
For an ideal cycle, the single-shaft engine was found to be a clear winner in terms of part-load thermal efficiency. However, this picture changed when realistic component maps were utilized. The basic cycle and the shape of component maps had a profound influence on the outcome.
The authors explored the influence of speed line shapes, levels of component efficiencies and the variation of these component efficiencies within the operating range. This paper describes how each one of these factors, individually, influences the outcome
Análisis de la distribución de la práctica de actividad física en alumnos de educación infantil
La práctica de actividad física (AF) es un elemento muy importante en la promoción de la salud durante la infancia. Sin embargo, hay pocos estudios que profundicen en las características de la AF que desarrollan los niños a lo largo del día. El objetivo del presente estudio fue describir la distribución de la actividad física de una intensidad determinada en bouts o secuencias de diferente duración y durante tres períodos: lunes a viernes, horario escolar y horario extraescolar. Se seleccionaron 107 alumnos de segundo ciclo de Educación Infantil (3,81±0,87 años; 19,64±4,32 kg, 1,19±0,29 m). Los sujetos llevaron un acelerómetro (Actigraph GT3X) durante 5 días consecutivos. Se calcularon los niveles de actividad física, de acuerdo con los puntos de corte propuestos por Sirard et al. (2005) y se analizó la agrupación de la AF de intensidad moderada y vigorosa (AFMV) en bouts de 1, 2, 4, 5, 6, 8, 10 y 12 min de duración. Los resultados mostraron el predominio de una AF de carácter esporádico, donde alrededor del 90% de la AFMV se realizó en bouts de 1 y 2 min. Se observó un mayor volumen de bouts de 1, 2, 4, 5 y 6 min de duración de lunes a viernes y en horario extraescolar que en el horario escolar (p<0,001). Se pone de manifiesto que la AF en niños de estas edades tiene un carácter intermitente, realizándose en secuencias o intervalos cortos que oscilan entre 1 y 2 min de duración
- …