Search CORE

3 research outputs found

CheapNET: Improving Light-weight speech enhancement network by projected loss function

Author: Dai Benzhe
Li Jiakui
Mao Wenyu
Tan Kaijun
Publication venue
Publication date: 27/11/2023
Field of study

Noise suppression and echo cancellation are critical in speech enhancement and essential for smart devices and real-time communication. Deployed in voice processing front-ends and edge devices, these algorithms must ensure efficient real-time inference with low computational demands. Traditional edge-based noise suppression often uses MSE-based amplitude spectrum mask training, but this approach has limitations. We introduce a novel projection loss function, diverging from MSE, to enhance noise suppression. This method uses projection techniques to isolate key audio components from noise, significantly improving model performance. For echo cancellation, the function enables direct predictions on LAEC pre-processed outputs, substantially enhancing performance. Our noise suppression model achieves near state-of-the-art results with only 3.1M parameters and 0.4GFlops/s computational load. Moreover, our echo cancellation model outperforms replicated industry-leading models, introducing a new perspective in speech enhancement

arXiv.org e-Print Archive

ACQ: Improving Generative Data-free Quantization Via Attention Correction

Author: Chen Gang
Dai Benzhe
Gong Guoliang
Guo Xiaozhou
Jin Min
Li Jixing
Lu Huaxiang
Mao Wenyu
Publication venue
Publication date: 29/07/2023
Field of study

Data-free quantization aims to achieve model quantization without accessing any authentic sample. It is significant in an application-oriented context involving data privacy. Converting noise vectors into synthetic samples through a generator is a popular data-free quantization method, which is called generative data-free quantization. However, there is a difference in attention between synthetic samples and authentic samples. This is always ignored and restricts the quantization performance. First, since synthetic samples of the same class are prone to have homogenous attention, the quantized network can only learn limited modes of attention. Second, synthetic samples in eval mode and training mode exhibit different attention. Hence, the batch-normalization statistics matching tends to be inaccurate. ACQ is proposed in this paper to fix the attention of synthetic samples. An attention center position-condition generator is established regarding the homogenization of intra-class attention. Restricted by the attention center matching loss, the attention center position is treated as the generator's condition input to guide synthetic samples in obtaining diverse attention. Moreover, we design adversarial loss of paired synthetic samples under the same condition to prevent the generator from paying overmuch attention to the condition, which may result in mode collapse. To improve the attention similarity of synthetic samples in different network modes, we introduce a consistency penalty to guarantee accurate BN statistics matching. The experimental results demonstrate that ACQ effectively improves the attention problems of synthetic samples. Under various training settings, ACQ achieves the best quantization performance. For the 4-bit quantization of Resnet18 and Resnet50, ACQ reaches 67.55% and 72.23% accuracy, respectively

arXiv.org e-Print Archive

A novel analysis method for biomarker identification based on horizontal relationship: identifying potential biomarkers from large-scale hepatocellular carcinoma metabolomics data

Author: A Fabregat
A Kamburov
A Reverter
AC Tan
AL Barabasi
Benzhe Su
BW Yang
C Baumgartner
CX Qu
CY Yu
D Beisser
D Geman
D Ghosh
DN Slenter
E Glazer
GG Harrigan
Guowang Xu
HW Ma
HY Chuang
I Guyon
J Fu
J Krumsiek
JA Marrero
JB Wang
JH Fan
Jinhu Fan
JJ Faith
KA McGlynn
KS Park
L Lecuyer
LI Furlong
Lina Zhou
M Kanehisa
ML Han
MT Dittrich
P Bertuccio
P Luo
P Romero
P Stefaniuk
Pei Yu
Peiyuan Yin
Ping Luo
RN Sa
S Sakaue
S Yamamoto
SK Bharti
T Jewison
T Takahashi
TH Bø
W Chen
WD Dai
X Huang
Xiaohui Lin
Xin Huang
XN Wang
XP Song
Y Kuang
Y Liu
Youlin Qiao
YX Tan
Zaifang Li
Zhao Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref