Search CORE

941 research outputs found

Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms

Author: Kim Taejun
Lee Jongpil
Nam Juhan
Publication venue
Publication date: 13/02/2018
Field of study

Recent work has shown that the end-to-end approach using convolutional neural network (CNN) is effective in various types of machine learning tasks. For audio signals, the approach takes raw waveforms as input using an 1-D convolution layer. In this paper, we improve the 1-D CNN architecture for music auto-tagging by adopting building blocks from state-of-the-art image classification models, ResNets and SENets, and adding multi-level feature aggregation to it. We compare different combinations of the modules in building CNN architectures. The results show that they achieve significant improvements over previous state-of-the-art models on the MagnaTagATune dataset and comparable results on Million Song Dataset. Furthermore, we analyze and visualize our model to show how the 1-D CNN operates.Comment: Accepted for publication at ICASSP 201

arXiv.org e-Print Archive

Crossref

Temporal Feedback Convolutional Recurrent Neural Networks for Keyword Spotting

Author: Kim Taejun
Nam Juhan
Publication venue
Publication date: 30/10/2019
Field of study

While end-to-end learning has become a trend in deep learning, the model architecture is often designed to incorporate domain knowledge. We propose a novel convolutional recurrent neural network (CRNN) architecture with temporal feedback connections, inspired by the feedback pathways from the brain to ears in the human auditory system. The proposed architecture uses a hidden state of the RNN module at the previous time to control the sensitivity of channel-wise feature activations in the CNN blocks at the current time, which is analogous to the mechanism of the outer hair-cell. We apply the proposed model to keyword spotting where the speech commands have sequential nature. We show the proposed model consistently outperforms the compared model without temporal feedback for different input/output settings in the CRNN framework. We also investigate the details of the performance improvement by conducting a failure analysis of the keyword spotting task and a visualization of the channel-wise feature scaling in each CNN block.Comment: This paper is submitted to ICASSP 202

arXiv.org e-Print Archive

Dating for start and end period of historical crash and its application to real time bubble and crash detection

Author: Kim Taejun
Publication venue
Publication date: 14/12/2023
Field of study

Nottingham eTheses

Dating for start and end period of historical crash and its application to real time bubble and crash detection

Author: Kim Taejun
Publication venue
Publication date
Field of study

Nottingham ePrints

Multiphoton tissue imaging by using moxifloxacin

Author: Chung Euiheon
Kim Ki Hean
Kim Myung Joon
Wang Taejun
Publication venue: ECI Digital Archives
Publication date: 23/07/2017
Field of study

Multiphoton microscopy has been widely used for in-vivo tissue imaging of various biological studies. However, its application to clinical studies has been limited due to either lack of clinically compatible exogenous contrast agents or weak autofluorescence of tissues. We investigated moxifloxacin as a contrast agent of cells for multiphoton tissue imaging. Moxifloxacin is an FDA approved antibiotic with relatively good pharmacokinetic properties for tissue penetration and intrinsic fluorescence. Two-photon microscopy (TPM) of moxifloxacin treated mouse corneas showed good tissue penetration and high concentration inside the corneal cells [1]. Cell labeling of moxifloxacin was tested in both cultured cells and isolated immune cells. Moxifloxacin tissue applications were tested in various mouse organs such as the skin, small intestine, and brain. Most of tissues were labeled well via topical administration, and only the skin required additional gentle removal of the outermost stratum corneum by tape stripping. TPM of these tissues showed non-specific cell labeling of moxifloxacin and fluorescence enhancement [2]. Although most of experimental results were from mouse tissues, its clinical application would be possible. Clinical application is promising since imaging based on moxifloxacin labeling could be 10 times faster than imaging based on endogenous fluorescence. Moxifloxacin labeling of cultured cells was demonstrated by comparing TPM images with and without moxifloxacin treatment. Bright fluorescence inside cells were observed only with moxifloxacin at the same imaging condition. TPM of the skin dermis visualized many dermal cells with increased fluorescence, and TPM of the villus in the small intestine showed the covering epithelial cells and cells inside the villus clearly. Please click Additional Files below to see the full abstract

Engineering Conferences International

An improved Bayesian inference model for auto healing of concrete specimen due to a cyclic freeze-thaw experiment

Author: Changhwan Jang
Jeongbae Lee
Seongsoo Kim
Taejun Cho
Publication venue: Vibroengineering
Publication date: 30/06/2013
Field of study

This paper presents an innovative solution for the auto healing porous structures damaged by cyclic freeze-thaw, followed by predicting the results of recovered damage due to freezing based on Bayesian inference. The additional hydration of high strength material, cured in high temperature, is applied as auto curing for the damaged micro-pore structures. Modeling of micro pore structure is prior to damage analysis. The amount of ice volume with temperature dependent surface tensions, freezing pressure and resulting deformations, and cycle and temperature dependent pore volume has been predicted and compared with available test results. By heating the selected area of specimen in frozen chamber, approximately 100 % of strength recovery has been observed after 10 days of freeze-thaw tests in the proposed nonlinear stochastic prediction models and the experimental results

Journal of Vibroengineering

Reliability assessment based on an adaptive response surface method considering correlation among random variables

Author: Changhwan Jang
Jeongbae Lee
Seongsoo Kim
Taejun Cho
Publication venue: Vibroengineering
Publication date: 31/12/2013
Field of study

Although the Monte-Carlo Simulation (MCS) technique can evaluate a reliability of most structural systems, its processing time equals, approximately, the reciprocal of the probability of failure. While the Stochastic Finite Element (SFE) method could help to solve such a drawback, it is limited to specific computer programs, in which the mean and the coefficient of random variables are estimated by a perturbation, or by a weighted integral method. Therefore, SFE may not be easily applicable when using commercial software or systems that are not prepared with the prerequisite programming. To overcome these limitations, the RSM can be applied, because its accuracy depends on both the distance of axial points, and the linearity of the Limit State Functions (LSFs). The correlation among random variables and the response of a system is evaluated by composing a Bayesian belief nets (BBN). Consequently, the proposed Linear Adaptive Weighted Response Surface Method (LAW-RSM) with BBN modeling produces improved converged reliability indices than conventional RSMs and detail observation for the uncertainties in structural components

Journal of Vibroengineering

X-SNS: Cross-Lingual Transfer Prediction through Sub-Network Similarity

Author: Kang Deokyeong
Kim Jihoon
Kim Jinhyeon
Kim Taeuk
Lim Seong Hoon
Yun Taejun
Publication venue
Publication date: 26/10/2023
Field of study

Cross-lingual transfer (XLT) is an emergent ability of multilingual language models that preserves their performance on a task to a significant extent when evaluated in languages that were not included in the fine-tuning process. While English, due to its widespread usage, is typically regarded as the primary language for model adaption in various tasks, recent studies have revealed that the efficacy of XLT can be amplified by selecting the most appropriate source languages based on specific conditions. In this work, we propose the utilization of sub-network similarity between two languages as a proxy for predicting the compatibility of the languages in the context of XLT. Our approach is model-oriented, better reflecting the inner workings of foundation models. In addition, it requires only a moderate amount of raw text from candidate languages, distinguishing it from the majority of previous methods that rely on external resources. In experiments, we demonstrate that our method is more effective than baselines across diverse tasks. Specifically, it shows proficiency in ranking candidates for zero-shot XLT, achieving an improvement of 4.6% on average in terms of NDCG@3. We also provide extensive analyses that confirm the utility of sub-networks for XLT prediction.Comment: Accepted to EMNLP 2023 (Findings

arXiv.org e-Print Archive

Improvement of P300-Based Brain-Computer Interfaces for Home Appliances Control by Data Balancing Techniques

Author: Kim Minju
Kim Sung-Phil
Lee Taejun
Publication venue: 'MDPI AG'
Publication date: 01/10/2020
Field of study

The oddball paradigm used in P300-based brain-computer interfaces (BCIs) intrinsically poses the issue of data imbalance between target stimuli and nontarget stimuli. Data imbalance can cause overfitting problems and, consequently, poor classification performance. The purpose of this study is to improve BCI performance by solving this data imbalance problem with sampling techniques. The sampling techniques were applied to BCI data in 15 subjects controlling a door lock, 15 subjects an electric light, and 14 subjects a Bluetooth speaker. We explored two categories of sampling techniques: oversampling and undersampling. Oversampling techniques, including random oversampling, synthetic minority oversampling technique (SMOTE), borderline-SMOTE, support vector machine (SVM) SMOTE, and adaptive synthetic sampling, were used to increase the number of samples for the class of target stimuli. Undersampling techniques, including random undersampling, neighborhood cleaning rule, Tomek's links, and weighted undersampling bagging, were used to reduce the class size of nontarget stimuli. The over- or undersampled data were classified by an SVM classifier. Overall, some oversampling techniques improved BCI performance while undersampling techniques often degraded performance. Particularly, using borderline-SMOTE yielded the highest accuracy (87.27%) and information transfer rate (8.82 bpm) across all three appliances. Moreover, borderline-SMOTE led to performance improvement, especially for poor performers. A further analysis showed that borderline-SMOTE improved SVM by generating more support vectors within the target class and enlarging margins. However, there was no difference in the accuracy between borderline-SMOTE and the method of applying the weighted regularization parameter of the SVM. Our results suggest that although oversampling improves performance of P300-based BCIs, it is not just the effect of the oversampling techniques, but rather the effect of solving the data imbalance problem

ScholarWorks@UNIST