1,591 research outputs found
Low Rank and Sparsity Analysis Applied to Speech Enhancement via Online Estimated Dictionary
In this letter, we propose an online estimated local dictionary based single-channel speech enhancement algorithm, which focuses on low-rank and sparse matrix decomposition. In the proposed algorithm, a noisy speech spectrogram can be decomposed into low-rank background noise components and an activation of the online speech dictionary, on which both low-rank and sparsity constraints are imposed. This decomposition takes the advantage of local estimated exemplar’s high expressiveness on speech components and also accommodates nonstationary background noise. The local dictionary can be obtained through estimating the speech presence probability (SPP) by applying expectation–maximal algorithm, in which a generalized Gamma prior for speech magnitude spectrum is used. The proposed algorithm is evaluated using signal-to-distortion ratio, and perceptual evaluation of speech quality. The results show that the proposed algorithm achieves significant improvements at various SNRs when compared to four other speech enhancement algorithms, including improved Karhunen–Loeve transform approach, SPP-based MMSE, nonnegative matrix factorization-based robust principal component analysis (RPCA), and RPCA
Deep Learning Topological Invariants of Band Insulators
In this work we design and train deep neural networks to predict topological
invariants for one-dimensional four-band insulators in AIII class whose
topological invariant is the winding number, and two-dimensional two-band
insulators in A class whose topological invariant is the Chern number. Given
Hamiltonians in the momentum space as the input, neural networks can predict
topological invariants for both classes with accuracy close to or higher than
90%, even for Hamiltonians whose invariants are beyond the training data set.
Despite the complexity of the neural network, we find that the output of
certain intermediate hidden layers resembles either the winding angle for
models in AIII class or the solid angle (Berry curvature) for models in A
class, indicating that neural networks essentially capture the mathematical
formula of topological invariants. Our work demonstrates the ability of neural
networks to predict topological invariants for complicated models with local
Hamiltonians as the only input, and offers an example that even a deep neural
network is understandable.Comment: 8 pages, 5 figure
Wavelet Packet Transform based Speech Enhancement via Two-Dimensional SPP Estimator with Generalized Gamma Priors
Despite various speech enhancement techniques have been developed for different applications, existing methods are limited in noisy environments with high ambient noise levels. Speech presence probability (SPP) estimation is a speech enhancement technique to reduce speech distortions, especially in low signal-to-noise ratios (SNRs) scenario. In this paper, we propose a new two-dimensional (2D) Teager-energyoperators (TEOs) improved SPP estimator for speech enhancement in time-frequency (T-F) domain. Wavelet packet transform (WPT) as a multiband decomposition technique is used to concentrate the energy distribution of speech components. A minimum mean-square error (MMSE) estimator is obtained based on the generalized gamma distribution speech model in WPT domain. In addition, the speech samples corrupted by environment and occupational noises (i.e., machine shop, factory and station) at different input SNRs are used to validate the proposed algorithm. Results suggest that the proposed method achieves a significant enhancement on perceptual quality, compared with four conventional speech enhancement algorithms (i.e., MMSE-84, MMSE-04, Wiener-96, and BTW)
Applications and Comparison of Continuous Wavelets Transforms on Analysis of A-wave Impulse Noise
Noise induced hearing loss (NIHL) is a serious occupational related health problem worldwide. The A-wave impulse noise could cause severe hearing loss, and characteristics of such kind of impulse noise in the joint time-frequency (T-F) domain are critical for evaluation of auditory hazard level. This study focuses on the analysis of A-wave impulse noise in the T-F domain using continual wavelet transforms. Three different wavelets, referring to Morlet, Mexican hat, and Meyer wavelets, were investigated and compared based on theoretical analysis and applications to experimental generated A-wave impulse noise signals. The underlying theory of continuous wavelet transform was given and the temporal and spectral resolutions were theoretically analyzed. The main results showed that the Mexican hat wavelet demonstrated significant advantages over the Morlet and Meyer wavelets for the characterization and analysis of the A-wave impulse noise. The results of this study provide useful information for applying wavelet transform on signal processing of the A-wave impulse noise
Meromorphic Parahoric Higgs Torsors and Filtered Stokes G-local Systems on Curves
In this paper, we consider the wild nonabelian Hodge correspondence for
principal -bundles on curves, where is a connected complex reductive
group. We first give a version of Kobayashi--Hitchin correspondence, which
induces a one-to-one correspondence between stable meromorphic parahoric Higgs
torsors of degree zero (Dolbeault side) and stable meromorphic parahoric
connections of degree zero (de Rham side). Then, by introducing a notion of
stability condition on filtered Stokes local systems, we prove a one-to-one
correspondence between stable meromorphic parahoric connections of degree zero
(de Rham side) and stable filtered Stokes -local systems of degree zero
(Betti side). When , the main result in this paper
reduces to Biquad--Boalch's result.Comment: 26 page
MAPM: MULTI-MODAL AUTO-AGILE PROJECT RISK MANAGEMENT AND PREDICTION FOR COLLABORATION PLATFORM USING AI/ML
Agile software development tools have been created to assist project management and enhance productivity. However, it may be challenging to properly employ those tools, especially in a hybrid work environment. Techniques are presented herein, which may be referred to herein as a MaPM system, that leverage a conferencing platform to offer a real-time agile project risk management and prediction framework that utilizes multi-modal collaboration data sources. Aspects of the presented techniques encompass an artificial intelligence (AI)-backed summarization model that may be utilized to extract project details and an auto-agile that model may consume those loggings and generate the predicted project sprint backlogs and their risks. Further aspects of the presented techniques support an optimization module that may jointly update the predicted sprint backlogs and the estimated task risks to realize the finalized backlog sequences. In summary, an MaPM system, according to the presented techniques, offers four novelties compared with conventional agile project management tools – a conferencing platform-centralized solution for automatic agile project risk prediction and management, a real-time multi-modal-based project risk monitoring and prediction system, the generation of sprint backlogs based on fully evaluated contexts that are collected from all of a project’s participants, and the liberation of project contributors from having to conduct manual project tracking and recording
- …