Search CORE

1,591 research outputs found

Low Rank and Sparsity Analysis Applied to Speech Enhancement via Online Estimated Dictionary

Author: Qin Jun
Sun Pengfei
Publication venue: OpenSIUC
Publication date: 29/09/2016
Field of study

In this letter, we propose an online estimated local dictionary based single-channel speech enhancement algorithm, which focuses on low-rank and sparse matrix decomposition. In the proposed algorithm, a noisy speech spectrogram can be decomposed into low-rank background noise components and an activation of the online speech dictionary, on which both low-rank and sparsity constraints are imposed. This decomposition takes the advantage of local estimated exemplar’s high expressiveness on speech components and also accommodates nonstationary background noise. The local dictionary can be obtained through estimating the speech presence probability (SPP) by applying expectation–maximal algorithm, in which a generalized Gamma prior for speech magnitude spectrum is used. The proposed algorithm is evaluated using signal-to-distortion ratio, and perceptual evaluation of speech quality. The results show that the proposed algorithm achieves significant improvements at various SNRs when compared to four other speech enhancement algorithms, including improved Karhunen–Loeve transform approach, SPP-based MMSE, nonnegative matrix factorization-based robust principal component analysis (RPCA), and RPCA

arXiv.org e-Print Archive

OpenSIUC

Deep Learning Topological Invariants of Band Insulators

Author: Shen Huitao
Sun Ning
Yi Jinmin
Zhai Hui
Zhang Pengfei
Publication venue: 'American Physical Society (APS)'
Publication date: 09/06/2018
Field of study

In this work we design and train deep neural networks to predict topological invariants for one-dimensional four-band insulators in AIII class whose topological invariant is the winding number, and two-dimensional two-band insulators in A class whose topological invariant is the Chern number. Given Hamiltonians in the momentum space as the input, neural networks can predict topological invariants for both classes with accuracy close to or higher than 90%, even for Hamiltonians whose invariants are beyond the training data set. Despite the complexity of the neural network, we find that the output of certain intermediate hidden layers resembles either the winding angle for models in AIII class or the solid angle (Berry curvature) for models in A class, indicating that neural networks essentially capture the mathematical formula of topological invariants. Our work demonstrates the ability of neural networks to predict topological invariants for complicated models with local Hamiltonians as the only input, and offers an example that even a deep neural network is understandable.Comment: 8 pages, 5 figure

arXiv.org e-Print Archive

DSpace@MIT

Wavelet Packet Transform based Speech Enhancement via Two-Dimensional SPP Estimator with Generalized Gamma Priors

Author: Qin Jun
Sun Pengfei
Publication venue: OpenSIUC
Publication date: 01/01/2016
Field of study

Despite various speech enhancement techniques have been developed for different applications, existing methods are limited in noisy environments with high ambient noise levels. Speech presence probability (SPP) estimation is a speech enhancement technique to reduce speech distortions, especially in low signal-to-noise ratios (SNRs) scenario. In this paper, we propose a new two-dimensional (2D) Teager-energyoperators (TEOs) improved SPP estimator for speech enhancement in time-frequency (T-F) domain. Wavelet packet transform (WPT) as a multiband decomposition technique is used to concentrate the energy distribution of speech components. A minimum mean-square error (MMSE) estimator is obtained based on the generalized gamma distribution speech model in WPT domain. In addition, the speech samples corrupted by environment and occupational noises (i.e., machine shop, factory and station) at different input SNRs are used to validate the proposed algorithm. Results suggest that the proposed method achieves a significant enhancement on perceptual quality, compared with four conventional speech enhancement algorithms (i.e., MMSE-84, MMSE-04, Wiener-96, and BTW)

Biblioteka Nauki - repozytorium artykuÅÃ³w

OpenSIUC

Applications and Comparison of Continuous Wavelets Transforms on Analysis of A-wave Impulse Noise

Author: Qin Jun
Sun Pengfei
Publication venue: OpenSIUC
Publication date: 01/01/2015
Field of study

Noise induced hearing loss (NIHL) is a serious occupational related health problem worldwide. The A-wave impulse noise could cause severe hearing loss, and characteristics of such kind of impulse noise in the joint time-frequency (T-F) domain are critical for evaluation of auditory hazard level. This study focuses on the analysis of A-wave impulse noise in the T-F domain using continual wavelet transforms. Three different wavelets, referring to Morlet, Mexican hat, and Meyer wavelets, were investigated and compared based on theoretical analysis and applications to experimental generated A-wave impulse noise signals. The underlying theory of continuous wavelet transform was given and the temporal and spectral resolutions were theoretically analyzed. The main results showed that the Mexican hat wavelet demonstrated significant advantages over the Morlet and Meyer wavelets for the characterization and analysis of the A-wave impulse noise. The results of this study provide useful information for applying wavelet transform on signal processing of the A-wave impulse noise

Biblioteka Nauki - repozytorium artykuÅÃ³w

OpenSIUC

Meromorphic Parahoric Higgs Torsors and Filtered Stokes G-local Systems on Curves

Author: Huang Pengfei
Sun Hao
Publication venue
Publication date: 09/12/2022
Field of study

In this paper, we consider the wild nonabelian Hodge correspondence for principal

G

-bundles on curves, where

G

is a connected complex reductive group. We first give a version of Kobayashi--Hitchin correspondence, which induces a one-to-one correspondence between stable meromorphic parahoric Higgs torsors of degree zero (Dolbeault side) and stable meromorphic parahoric connections of degree zero (de Rham side). Then, by introducing a notion of stability condition on filtered Stokes local systems, we prove a one-to-one correspondence between stable meromorphic parahoric connections of degree zero (de Rham side) and stable filtered Stokes

G

-local systems of degree zero (Betti side). When

G={\rm GL}_n(\mathbb{C})

, the main result in this paper reduces to Biquad--Boalch's result.Comment: 26 page

arXiv.org e-Print Archive

MAPM: MULTI-MODAL AUTO-AGILE PROJECT RISK MANAGEMENT AND PREDICTION FOR COLLABORATION PLATFORM USING AI/ML

Author: Shao Qihong
Sun Pengfei
Publication venue: Technical Disclosure Commons
Publication date: 22/05/2023
Field of study

Agile software development tools have been created to assist project management and enhance productivity. However, it may be challenging to properly employ those tools, especially in a hybrid work environment. Techniques are presented herein, which may be referred to herein as a MaPM system, that leverage a conferencing platform to offer a real-time agile project risk management and prediction framework that utilizes multi-modal collaboration data sources. Aspects of the presented techniques encompass an artificial intelligence (AI)-backed summarization model that may be utilized to extract project details and an auto-agile that model may consume those loggings and generate the predicted project sprint backlogs and their risks. Further aspects of the presented techniques support an optimization module that may jointly update the predicted sprint backlogs and the estimated task risks to realize the finalized backlog sequences. In summary, an MaPM system, according to the presented techniques, offers four novelties compared with conventional agile project management tools – a conferencing platform-centralized solution for automatic agile project risk prediction and management, a real-time multi-modal-based project risk monitoring and prediction system, the generation of sprint backlogs based on fully evaluated contexts that are collected from all of a project’s participants, and the liberation of project contributors from having to conduct manual project tracking and recording

Technical Disclosure Common