Search CORE

4 research outputs found

Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT

Author: Kundu Souvik
Nittur Sharath Sridhar
Sundaresan Sairam
Szankin Maciej
Publication venue
Publication date: 14/07/2023
Field of study

Large pre-trained language models have recently gained significant traction due to their improved performance on various down-stream tasks like text classification and question answering, requiring only few epochs of fine-tuning. However, their large model sizes often prohibit their applications on resource-constrained edge devices. Existing solutions of yielding parameter-efficient BERT models largely rely on compute-exhaustive training and fine-tuning. Moreover, they often rely on additional compute heavy models to mitigate the performance gap. In this paper, we present Sensi-BERT, a sensitivity driven efficient fine-tuning of BERT models that can take an off-the-shelf pre-trained BERT model and yield highly parameter-efficient models for downstream tasks. In particular, we perform sensitivity analysis to rank each individual parameter tensor, that then is used to trim them accordingly during fine-tuning for a given parameter or FLOPs budget. Our experiments show the efficacy of Sensi-BERT across different downstream tasks including MNLI, QQP, QNLI, and SST-2, demonstrating better performance at similar or smaller parameter budget compared to various existing alternatives.Comment: 6 pages, 4 figures, 2 table

arXiv.org e-Print Archive

Thermal Image Processing for Respiratory Estimation from Cubical Data with Expandable Depth

Author: Alicja Kwasniewska
Jacek Ruminski
Maciej Szankin
Publication venue: MDPI AG
Publication date: 01/09/2023
Field of study

As healthcare costs continue to rise, finding affordable and non-invasive ways to monitor vital signs is increasingly important. One of the key metrics for assessing overall health and identifying potential issues early on is respiratory rate (RR). Most of the existing methods require multiple steps that consist of image and signal processing. This might be difficult to deploy on edge devices that often do not have specialized digital signal processors (DSP). Therefore, the goal of this study is to develop a single neural network realizing the entire process of RR estimation in a single forward pass. The proposed solution builds on recent advances in video recognition, capturing both spatial and temporal information in a multi-path network. Both paths process the data at different sampling rates to capture rapid and slow changes that are associated with differences in the temperature of the nostril area during the breathing episodes. The preliminary results show that the introduced end-to-end solution achieves better performance compared to state-of-the-art methods, without requiring additional pre/post-processing steps and signal-processing techniques. In addition, the presented results demonstrate its robustness on low-resolution thermal video sequences that are often used at the embedded edge due to the size and power constraints of such systems. Taking that into account, the proposed approach has the potential for efficient and convenient respiratory rate estimation across various markets in solutions deployed locally, close to end users

Directory of Open Access Journals

A Hardware-Aware Framework for Accelerating Neural Architecture Search Across Modalities

Author: Cummings Daniel
Munoz Juan Pablo
Sarah Anthony
Sridhar Sharath Nittur
Sundaresan Sairam
Szankin Maciej
Publication venue
Publication date: 19/05/2022
Field of study

Recent advances in Neural Architecture Search (NAS) such as one-shot NAS offer the ability to extract specialized hardware-aware sub-network configurations from a task-specific super-network. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still under-explored. Popular methods decouple the super-network training from the sub-network search and use performance predictors to reduce the computational burden of searching on different hardware platforms. We propose a flexible search framework that automatically and efficiently finds optimal sub-networks that are optimized for different performance metrics and hardware configurations. Specifically, we show how evolutionary algorithms can be paired with lightly trained objective predictors in an iterative cycle to accelerate architecture search in a multi-objective setting for various modalities including machine translation and image classification

arXiv.org e-Print Archive

Super-resolved thermal imagery for high-accuracy facial areas detection and analysis

Author: Abdel-Nasser
Alicja Kwasniewska
Bergstra
Chang
Chen
Cui
Dong
Freedman
Gade
Gao
Glasner
Goodfellow
Hanmandlu
Hore
Jacek Ruminski
Jia
Keys
Kim
Kwaśniewska
Kwaśniewska
Ledig
Lewandowska
Li
Liu
Liu
Maciej Szankin
Mariusz Kaczmarek
Park
Qi
Romano
Ruminski
Sajjadi
Torrey
Villanueva
Zhang
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref