Search CORE

276 research outputs found

Evaluation of Wind Turbine Operation Status Based on ACO + FAHP

Author: Wang Xueyao
Zhang Ruizhe
Publication venue: 'Universe Scientific Publishing Pte. Ltd.'
Publication date: 29/05/2023
Field of study

Aiming at the shortcomings of the fuzzy analytic hierarchy process (FAHP) in the comprehensive evaluation of wind power projects, such as the diffi culty of satisfying and modifying the consistency of the judgment matrix and the high computational complexity, a fuzzy analytic hierarchy process based on ant colony optimization (ACO+FAHP) is proposed. Firstly, the proposed fuzzy analytic hierarchy process based on ant colony optimization algorithm overcomes the disadvantages that the weight and consistency cannot be improved once the judgment matrix is given. The comparison chart of the consistency ratio calculated according to this method shows that the consistency ratio B, C1-C5 all have diff erent degrees of reduction. Then, in view of the fact that various qualitative indicators cannot be accurately calculated, the wind turbine operating status evaluation model is established by using the fuzzy comprehensive evaluation method. In this paper, the evaluation score of a certain wind farm is 0.731, which means that the operators need to carry out high-level maintenance at this time

Electronics Science Technology and Application (E-Journal)

BiHRNet: A Binary high-resolution network for Human Pose Estimation

Author: Dang Yonghao
Sun Xueyao
Yin Jianqin
Zhang Zhicheng
Publication venue
Publication date: 16/11/2023
Field of study

Human Pose Estimation (HPE) plays a crucial role in computer vision applications. However, it is difficult to deploy state-of-the-art models on resouce-limited devices due to the high computational costs of the networks. In this work, a binary human pose estimator named BiHRNet(Binary HRNet) is proposed, whose weights and activations are expressed as

\pm

1. BiHRNet retains the keypoint extraction ability of HRNet, while using fewer computing resources by adapting binary neural network (BNN). In order to reduce the accuracy drop caused by network binarization, two categories of techniques are proposed in this work. For optimizing the training process for binary pose estimator, we propose a new loss function combining KL divergence loss with AWing loss, which makes the binary network obtain more comprehensive output distribution from its real-valued counterpart to reduce information loss caused by binarization. For designing more binarization-friendly structures, we propose a new information reconstruction bottleneck called IR Bottleneck to retain more information in the initial stage of the network. In addition, we also propose a multi-scale basic block called MS-Block for information retention. Our work has less computation cost with few precision drop. Experimental results demonstrate that BiHRNet achieves a PCKh of 87.9 on the MPII dataset, which outperforms all binary pose estimation networks. On the challenging of COCO dataset, the proposed method enables the binary neural network to achieve 70.8 mAP, which is better than most tested lightweight full-precision networks.Comment: 12 pages, 6 figure

arXiv.org e-Print Archive

Multi-Scale Sub-Band Constant-Q Transform Discriminator for High-Fidelity Vocoder

Author: Gu Yicheng
Wu Zhizheng
Xue Liumeng
Zhang Xueyao
Publication venue
Publication date: 25/11/2023
Field of study

Generative Adversarial Network (GAN) based vocoders are superior in inference speed and synthesis quality when reconstructing an audible waveform from an acoustic representation. This study focuses on improving the discriminator to promote GAN-based vocoders. Most existing time-frequency-representation-based discriminators are rooted in Short-Time Fourier Transform (STFT), whose time-frequency resolution in a spectrogram is fixed, making it incompatible with signals like singing voices that require flexible attention for different frequency bands. Motivated by that, our study utilizes the Constant-Q Transform (CQT), which owns dynamic resolution among frequencies, contributing to a better modeling ability in pitch accuracy and harmonic tracking. Specifically, we propose a Multi-Scale Sub-Band CQT (MS-SB-CQT) Discriminator, which operates on the CQT spectrogram at multiple scales and performs sub-band processing according to different octaves. Experiments conducted on both speech and singing voices confirm the effectiveness of our proposed method. Moreover, we also verified that the CQT-based and the STFT-based discriminators could be complementary under joint training. Specifically, enhanced by the proposed MS-SB-CQT and the existing MS-STFT Discriminators, the MOS of HiFi-GAN can be boosted from 3.27 to 3.87 for seen singers and from 3.40 to 3.78 for unseen singers

arXiv.org e-Print Archive

Zoom Out and Observe: News Environment Perception for Fake News Detection

Author: Cao Juan
Li Rundong
Sheng Qiang
Wang Danding
Zhang Xueyao
Zhu Yongchun
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 27/10/2022
Field of study

Fake news detection is crucial for preventing the dissemination of misinformation on social media. To differentiate fake news from real ones, existing methods observe the language patterns of the news post and "zoom in" to verify its content with knowledge sources or check its readers' replies. However, these methods neglect the information in the external news environment where a fake news post is created and disseminated. The news environment represents recent mainstream media opinion and public attention, which is an important inspiration of fake news fabrication because fake news is often designed to ride the wave of popular events and catch public attention with unexpected novel content for greater exposure and spread. To capture the environmental signals of news posts, we "zoom out" to observe the news environment and propose the News Environment Perception Framework (NEP). For each post, we construct its macro and micro news environment from recent mainstream news. Then we design a popularity-oriented and a novelty-oriented module to perceive useful signals and further assist final prediction. Experiments on our newly built datasets show that the NEP can efficiently improve the performance of basic fake news detectors.Comment: ACL 2022 Main Conference (Long Paper

arXiv.org e-Print Archive

SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion

Author: Han Jun
Wang Chaoren
Wang Mingxuan
Wu Zhizheng
Xue Liumeng
Zhang Xueyao
Publication venue
Publication date: 19/02/2024
Field of study

In this study, we present SingVisio, an interactive visual analysis system that aims to explain the diffusion model used in singing voice conversion. SingVisio provides a visual display of the generation process in diffusion models, showcasing the step-by-step denoising of the noisy spectrum and its transformation into a clean spectrum that captures the desired singer's timbre. The system also facilitates side-by-side comparisons of different conditions, such as source content, melody, and target timbre, highlighting the impact of these conditions on the diffusion generation process and resulting conversions. Through comprehensive evaluations, SingVisio demonstrates its effectiveness in terms of system design, functionality, explainability, and user-friendliness. It offers users of various backgrounds valuable learning experiences and insights into the diffusion model for singing voice conversion

arXiv.org e-Print Archive

Refined drift chamber simulation in the CEPC experiment

Author: Fang Wenxing
Huang Xingtao
Li Weidong
Lin Tao
Liu Mengyao
Yuan Ye
Zhang Xueyao
Zhang Yao
Publication venue: EDP Sciences
Publication date: 01/01/2024
Field of study

The Circular Electron Positron Collider (CEPC) is a future experiment aimed at studying the properties of the Higgs boson with high precision. This requires excellent track reconstruction and particle identification (PID) performance, which is achieved in the 4th conceptual detector design of the CEPC experiments by combining a silicon tracker and a drift chamber. The drift chamber not only improves track reconstruction but also provides excellent PID with the cluster counting method. To evaluate the performance of this design accurately, a detailed simulation is necessary. In this paper, we present a refined drift chamber simulation by combining Geant4 and Garfield++. However, traditional waveform simulation using Garfield++ is extremely time-consuming, which motivates us to develop a fast waveform simulation method using a neural network. We validate the method using real data from the BESIII experiment. The results demonstrate the effectiveness of our approach and provide valuable insights for future experiments

Directory of Open Access Journals