4,036 research outputs found

    Outlier-aware Inlier Modeling and Multi-scale Scoring for Anomalous Sound Detection via Multitask Learning

    Full text link
    This paper proposes an approach for anomalous sound detection that incorporates outlier exposure and inlier modeling within a unified framework by multitask learning. While outlier exposure-based methods can extract features efficiently, it is not robust. Inlier modeling is good at generating robust features, but the features are not very effective. Recently, serial approaches are proposed to combine these two methods, but it still requires a separate training step for normal data modeling. To overcome these limitations, we use multitask learning to train a conformer-based encoder for outlier-aware inlier modeling. Moreover, our approach provides multi-scale scores for detecting anomalies. Experimental results on the MIMII and DCASE 2020 task 2 datasets show that our approach outperforms state-of-the-art single-model systems and achieves comparable results with top-ranked multi-system ensembles.Comment: accepted at INTERSPEECH 202

    Two-timescale Beamforming Optimization for Intelligent Reflecting Surface Aided Multiuser Communication with QoS Constraints

    Full text link
    Intelligent reflecting surface (IRS) is an emerging technology that is able to reconfigure the wireless channel via tunable passive signal reflection and thereby enhance the spectral and energy efficiency of wireless networks cost-effectively. In this paper, we study an IRS-aided multiuser multiple-input single-output (MISO) wireless system and adopt the two-timescale (TTS) transmission to reduce the signal processing complexity and channel training overhead as compared to the existing schemes based on the instantaneous channel state information (I-CSI), and at the same time, exploit the multiuser channel diversity in transmission scheduling. Specifically, the long-term passive beamforming is designed based on the statistical CSI (S-CSI) of all links, while the short-term active beamforming is designed to cater to the I-CSI of all users' reconfigured channels with optimized IRS phase shifts. We aim to minimize the average transmit power at the access point (AP), subject to the users' individual quality of service (QoS) constraints. The formulated stochastic optimization problem is non-convex and difficult to solve since the long-term and short-term design variables are complicatedly coupled in the QoS constraints. To tackle this problem, we propose an efficient algorithm, called the primal-dual decomposition based TTS joint active and passive beamforming (PDD-TJAPB), where the original problem is decomposed into a long-term problem and a family of short-term problems, and the deep unfolding technique is employed to extract gradient information from the short-term problems to construct a convex surrogate problem for the long-term problem. The proposed algorithm is proved to converge to a stationary solution of the original problem almost surely. Simulation results are presented which demonstrate the advantages and effectiveness of the proposed algorithm as compared to benchmark schemes.Comment: 16 pages, 10 figures, accepted by IEEE Transactions on Wireless communication

    A P300-speller based on event-related spectral perturbation (ERSP)

    Get PDF
    A brain-computer interface (BCI) P300 speller is a novel technique that helps people spell words using the electroencephalography (EEG) without the involvement of muscle activities. However, only time domain ERP features (P300) are used for controlling of the BCI speller. In this paper, we investigated the time-frequency EEG features for the P300-based brain-computer interface speller. A signal preprocessing method integrated ensemble average, principal component analysis, and independent component analysis to remove noise and artifacts in the EEG data. A time-frequency analysis based on wavelet transform was carried out to extract event-related spectral perturbation (ERSP) and inter-trial coherence (ITC) features. Results showed that the proposed signal processing method can effectively extract EEG time-frequency features in the P300 speller, suggesting that ERSP and ITC may be useful for improving the performance of BCI P300 speller. © 2012 IEEE.published_or_final_versio
    • …
    corecore