281 research outputs found
Evaluation of Wind Turbine Operation Status Based on ACO + FAHP
Aiming at the shortcomings of the fuzzy analytic hierarchy process (FAHP) in the comprehensive evaluation of wind power
projects, such as the diffi culty of satisfying and modifying the consistency of the judgment matrix and the high computational complexity, a
fuzzy analytic hierarchy process based on ant colony optimization (ACO+FAHP) is proposed. Firstly, the proposed fuzzy analytic hierarchy
process based on ant colony optimization algorithm overcomes the disadvantages that the weight and consistency cannot be improved once
the judgment matrix is given. The comparison chart of the consistency ratio calculated according to this method shows that the consistency
ratio B, C1-C5 all have diff erent degrees of reduction. Then, in view of the fact that various qualitative indicators cannot be accurately
calculated, the wind turbine operating status evaluation model is established by using the fuzzy comprehensive evaluation method. In this
paper, the evaluation score of a certain wind farm is 0.731, which means that the operators need to carry out high-level maintenance at this
time
BiHRNet: A Binary high-resolution network for Human Pose Estimation
Human Pose Estimation (HPE) plays a crucial role in computer vision
applications. However, it is difficult to deploy state-of-the-art models on
resouce-limited devices due to the high computational costs of the networks. In
this work, a binary human pose estimator named BiHRNet(Binary HRNet) is
proposed, whose weights and activations are expressed as 1. BiHRNet
retains the keypoint extraction ability of HRNet, while using fewer computing
resources by adapting binary neural network (BNN). In order to reduce the
accuracy drop caused by network binarization, two categories of techniques are
proposed in this work. For optimizing the training process for binary pose
estimator, we propose a new loss function combining KL divergence loss with
AWing loss, which makes the binary network obtain more comprehensive output
distribution from its real-valued counterpart to reduce information loss caused
by binarization. For designing more binarization-friendly structures, we
propose a new information reconstruction bottleneck called IR Bottleneck to
retain more information in the initial stage of the network. In addition, we
also propose a multi-scale basic block called MS-Block for information
retention. Our work has less computation cost with few precision drop.
Experimental results demonstrate that BiHRNet achieves a PCKh of 87.9 on the
MPII dataset, which outperforms all binary pose estimation networks. On the
challenging of COCO dataset, the proposed method enables the binary neural
network to achieve 70.8 mAP, which is better than most tested lightweight
full-precision networks.Comment: 12 pages, 6 figure
Multi-Scale Sub-Band Constant-Q Transform Discriminator for High-Fidelity Vocoder
Generative Adversarial Network (GAN) based vocoders are superior in inference
speed and synthesis quality when reconstructing an audible waveform from an
acoustic representation. This study focuses on improving the discriminator to
promote GAN-based vocoders. Most existing time-frequency-representation-based
discriminators are rooted in Short-Time Fourier Transform (STFT), whose
time-frequency resolution in a spectrogram is fixed, making it incompatible
with signals like singing voices that require flexible attention for different
frequency bands. Motivated by that, our study utilizes the Constant-Q Transform
(CQT), which owns dynamic resolution among frequencies, contributing to a
better modeling ability in pitch accuracy and harmonic tracking. Specifically,
we propose a Multi-Scale Sub-Band CQT (MS-SB-CQT) Discriminator, which operates
on the CQT spectrogram at multiple scales and performs sub-band processing
according to different octaves. Experiments conducted on both speech and
singing voices confirm the effectiveness of our proposed method. Moreover, we
also verified that the CQT-based and the STFT-based discriminators could be
complementary under joint training. Specifically, enhanced by the proposed
MS-SB-CQT and the existing MS-STFT Discriminators, the MOS of HiFi-GAN can be
boosted from 3.27 to 3.87 for seen singers and from 3.40 to 3.78 for unseen
singers
Zoom Out and Observe: News Environment Perception for Fake News Detection
Fake news detection is crucial for preventing the dissemination of
misinformation on social media. To differentiate fake news from real ones,
existing methods observe the language patterns of the news post and "zoom in"
to verify its content with knowledge sources or check its readers' replies.
However, these methods neglect the information in the external news environment
where a fake news post is created and disseminated. The news environment
represents recent mainstream media opinion and public attention, which is an
important inspiration of fake news fabrication because fake news is often
designed to ride the wave of popular events and catch public attention with
unexpected novel content for greater exposure and spread. To capture the
environmental signals of news posts, we "zoom out" to observe the news
environment and propose the News Environment Perception Framework (NEP). For
each post, we construct its macro and micro news environment from recent
mainstream news. Then we design a popularity-oriented and a novelty-oriented
module to perceive useful signals and further assist final prediction.
Experiments on our newly built datasets show that the NEP can efficiently
improve the performance of basic fake news detectors.Comment: ACL 2022 Main Conference (Long Paper
SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion
In this study, we present SingVisio, an interactive visual analysis system
that aims to explain the diffusion model used in singing voice conversion.
SingVisio provides a visual display of the generation process in diffusion
models, showcasing the step-by-step denoising of the noisy spectrum and its
transformation into a clean spectrum that captures the desired singer's timbre.
The system also facilitates side-by-side comparisons of different conditions,
such as source content, melody, and target timbre, highlighting the impact of
these conditions on the diffusion generation process and resulting conversions.
Through comprehensive evaluations, SingVisio demonstrates its effectiveness in
terms of system design, functionality, explainability, and user-friendliness.
It offers users of various backgrounds valuable learning experiences and
insights into the diffusion model for singing voice conversion
Refined drift chamber simulation in the CEPC experiment
The Circular Electron Positron Collider (CEPC) is a future experiment aimed at studying the properties of the Higgs boson with high precision. This requires excellent track reconstruction and particle identification (PID) performance, which is achieved in the 4th conceptual detector design of the CEPC experiments by combining a silicon tracker and a drift chamber. The drift chamber not only improves track reconstruction but also provides excellent PID with the cluster counting method. To evaluate the performance of this design accurately, a detailed simulation is necessary. In this paper, we present a refined drift chamber simulation by combining Geant4 and Garfield++. However, traditional waveform simulation using Garfield++ is extremely time-consuming, which motivates us to develop a fast waveform simulation method using a neural network. We validate the method using real data from the BESIII experiment. The results demonstrate the effectiveness of our approach and provide valuable insights for future experiments
- …