Search CORE

83 research outputs found

Context-aware Fine-tuning of Self-supervised Speech Models

Author: Kim Kwangyoun
Livescu Karen
Shon Suwon
Sridhar Prashant
Watanabe Shinji
Wu Felix
Publication venue
Publication date: 28/03/2023
Field of study

Self-supervised pre-trained transformers have improved the state of the art on a variety of speech tasks. Due to the quadratic time and space complexity of self-attention, they usually operate at the level of relatively short (e.g., utterance) segments. In this paper, we study the use of context, i.e., surrounding segments, during fine-tuning and propose a new approach called context-aware fine-tuning. We attach a context module on top of the last layer of a pre-trained model to encode the whole segment into a context embedding vector which is then used as an additional feature for the final prediction. During the fine-tuning stage, we introduce an auxiliary loss that encourages this context embedding vector to be similar to context vectors of surrounding segments. This allows the model to make predictions without access to these surrounding segments at inference time and requires only a tiny overhead compared to standard fine-tuned models. We evaluate the proposed approach using the SLUE and Libri-light benchmarks for several downstream tasks: Automatic speech recognition (ASR), named entity recognition (NER), and sentiment analysis (SA). The results show that context-aware fine-tuning not only outperforms a standard fine-tuning baseline but also rivals a strong context injection baseline that uses neighboring speech segments during inference

arXiv.org e-Print Archive

Coexistence of WiFi and WiMAX Systems Based on PS-Request Protocols†

Author: Golmie
Ho Young Hwang
Jongwoo Kim
Kim
Kim
Seung Hyong Rhee
Suwon Park
Yong-Hoon Choi
Young-uk Chung
Publication venue: Molecular Diversity Preservation International (MDPI)
Publication date: 01/10/2011
Field of study

We introduce both the coexistence zone within the WiMAX frame structure and a PS-Request protocol for the coexistence of WiFi and WiMAX systems sharing a frequency band. Because we know that the PS-Request protocol has drawbacks, we propose a revised PS-Request protocol to improve the performance. Two PS-Request protocols are based on the time division operation (TDO) of WiFi system and WiMAX system to avoid the mutual interference, and use the vestigial power management (PwrMgt) bit within the Frame Control field of the frames transmitted by a WiFi AP. The performance of the revised PS-Request protocol is evaluated by computer simulation, and compared to those of the cases without a coexistence protocol and to the original PS-Request protocol

Crossref

Directory of Open Access Journals

PubMed Central

DNN Transfer Learning based Non-linear Feature Extraction for Acoustic Event Classification

Author: Han David K.
Kim Wooil
Ko Hanseok
Mun Seongkyu
Shin Minkyu
Shon Suwon
Publication venue: 'Institute of Electronics, Information and Communications Engineers (IEICE)'
Publication date: 01/01/2017
Field of study

Recent acoustic event classification research has focused on training suitable filters to represent acoustic events. However, due to limited availability of target event databases and linearity of conventional filters, there is still room for improving performance. By exploiting the non-linear modeling of deep neural networks (DNNs) and their ability to learn beyond pre-trained environments, this letter proposes a DNN-based feature extraction scheme for the classification of acoustic events. The effectiveness and robustness to noise of the proposed method are demonstrated using a database of indoor surveillance environments

arXiv.org e-Print Archive

Crossref

Restructuring field layouts for embedded memory systems

Author: Daejeon
Hwansoo Han
Jungeun Kim
Keoncheol Shin
Korea
Korea Daejeon
Korea Daejeon
Korea Suwon
Seonggun Kim
Publication venue
Publication date: 01/01/2006
Field of study

Abstrac

CiteSeerX