Search CORE

66 research outputs found

Spectrally resolved two-photon interference in a modified Hong-Ou-Mandel interferometer

Author: Chen Changhua
Dong Ruifang
Jin Ruibo
Li Baihong
Quan Runai
Xiang Xiao
Yuan Boxin
Zhang Shougang
Publication venue
Publication date: 19/07/2022
Field of study

A modified Hong-Ou-Mandel(HOM) interference reveals that the two-photon interference phenomenon can be explained only by the concept of a two-photon wave packet rather than the single-photon one. Previously, the measurements for such interference were usually performed in the time domain where the spectral information of the involved photons was integrated and lost during the measurement. Here, we theoretically explore the spectrally resolved two-photon interference for the modified HOM interferometer both in the cases of CW pump and pulse pump. It is found that, in the CW-pumped case, a one-dimensional (1D) temporal interferogram can be directly recovered by projecting a 2D spectrally resolved interferogram at different phases, without a standard delay-scanning. In the pulse-pumped case, the joint spectral intensity is phase-dependent and can be modulated by the time delay along the directions of both frequency sum and frequency difference between signal and idler photons, which may provide a versatile way to generate high-dimensional frequency entanglement and engineer high-dimensional quantum states. These results not only show more rich spectral information that cannot be extracted from the time domain, but also shed new light on a comprehensive understanding of the two-photon interference phenomenon in the frequency domain.Comment: 13 pages, 6 figure

arXiv.org e-Print Archive

Analysis of adverse drug reactions of Denosumab (Prolia) in osteoporosis based on FDA adverse event reporting system (FAERS)

Author: Jin Chen
Ruibo Li
Xi Chen
Xingyue Yuan
Yili Ou
Publication venue: Frontiers Media S.A.
Publication date: 01/06/2024
Field of study

ObjectiveTo comprehensively analyze the ADRs associated with Denosumab (Prolia) in the treatment of osteoporosis using data from the FAERS database, and gain a better understanding of the potential risks and side effects of Denosumab (Prolia) therapy.MethodsData of Denosumab (Prolia) were collected from the FAERS database covering the period from first quarter of 2010 to the third quarter of 2023. Disproportionality analysis was performed by calculating the reporting odds ratios (ROR), proportional reporting ratio (PRR), and Bayesian analysis confidence propagation neural network (BCPNN) to detect positive signals.ResultsTotally, 17,985,365 reports were collected from the FAERS database, 1,97,807 reports of Denosumab (Prolia) were identified as the “primary suspected (PS)” ADRs. Denosumab (Prolia) induced ADRs occurred in 27 organ systems. 38 significant disproportionality PTs satisfying with the three algorithms were retained at the same time. Unexpected significant ADRs such as bone density abnormal and immobile also occur. The majority of the ADRs occurred within the first 30 days after Denosumab (Prolia) initiation.ConclusionBased on the American FAERS database, the high frequency ADRs of Denosumab (Prolia) were hypocalcaemia, bone density abnormal, eczema, rebound effect, spinal deformity, etc. Clinical use of this drug should focus on this part of ADRs. Attention should also be paid to newly discovered ADRs, such as immobile, menopausal symptoms, etc., to avoid more serious consequences. Cohort studies, more detailed and comprehensive case information, and long-term clinical investigations are needed to confirm these results and to further understand the safety profile of Denosumab (Prolia)

Directory of Open Access Journals

Chinese Open Instruction Generalist: A Preliminary Release

Author: Dong Siwei
Fu Jie
Huang Wenhao
Li Yizhi
Li Zhaoqun
Lin Chenghua
Liu Ruibo
Shi Yemin
Shu Yu
Wang Zekun
Yuan Ruibin
Zhang Ge
Publication venue
Publication date: 18/04/2023
Field of study

Instruction tuning is widely recognized as a key technique for building generalist language models, which has attracted the attention of researchers and the public with the release of InstructGPT~\citep{ouyang2022training} and ChatGPT\footnote{\url{https://chat.openai.com/}}. Despite impressive progress in English-oriented large-scale language models (LLMs), it is still under-explored whether English-based foundation LLMs can perform similarly on multilingual tasks compared to English tasks with well-designed instruction tuning and how we can construct the corpora needed for the tuning. To remedy this gap, we propose the project as an attempt to create a Chinese instruction dataset by various methods adapted to the intrinsic characteristics of 4 sub-tasks. We collect around 200k Chinese instruction tuning samples, which have been manually checked to guarantee high quality. We also summarize the existing English and Chinese instruction corpora and briefly describe some potential applications of the newly constructed Chinese instruction corpora. The resulting \textbf{C}hinese \textbf{O}pen \textbf{I}nstruction \textbf{G}eneralist (\textbf{COIG}) corpora are available in Huggingface\footnote{\url{https://huggingface.co/datasets/BAAI/COIG}} and Github\footnote{\url{https://github.com/FlagOpen/FlagInstruct}}, and will be continuously updated

arXiv.org e-Print Archive

On the Effectiveness of Speech Self-supervised Learning for Music

Author: Benetos Emmanouil
Chen Xingran
Dannenberg Roger
Fu Jie
Guo Yike
Gyenge Norbert
Li Yizhi
Lin Chenghua
Liu Ruibo
Ma Yinghao
Ragni Anton
Xia Gus
Yin Hanzhi
Yuan Ruibin
Zhang Ge
Publication venue
Publication date: 11/07/2023
Field of study

Self-supervised learning (SSL) has shown promising results in various speech and natural language processing applications. However, its efficacy in music information retrieval (MIR) still remains largely unexplored. While previous SSL models pre-trained on music recordings may have been mostly closed-sourced, recent speech models such as wav2vec2.0 have shown promise in music modelling. Nevertheless, research exploring the effectiveness of applying speech SSL models to music recordings has been limited. We explore the music adaption of SSL with two distinctive speech-related models, data2vec1.0 and Hubert, and refer to them as music2vec and musicHuBERT, respectively. We train

12

SSL models with 95M parameters under various pre-training configurations and systematically evaluate the MIR task performances with 13 different MIR tasks. Our findings suggest that training with music data can generally improve performance on MIR tasks, even when models are trained using paradigms designed for speech. However, we identify the limitations of such existing speech-oriented designs, especially in modelling polyphonic information. Based on the experimental results, empirical suggestions are also given for designing future musical SSL strategies and paradigms

arXiv.org e-Print Archive

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training

Author: Benetos Emmanouil
Chen Wenhu
Chen Xingran
Dannenberg Roger
Fu Jie
Guo Yike
Gyenge Norbert
Huang Wenhao
Li Yizhi
Lin Chenghua
Liu Ruibo
Ma Yinghao
Ragni Anton
Shi Yemin
Xia Gus
Yin Hanzhi
Yuan Ruibin
Zhang Ge
Publication venue
Publication date: 31/05/2023
Field of study

Self-supervised learning (SSL) has recently emerged as a promising paradigm for training generalisable models on large-scale data in the fields of vision, text, and speech. Although SSL has been proven effective in speech and audio, its application to music audio has yet to be thoroughly explored. This is primarily due to the distinctive challenges associated with modelling musical knowledge, particularly its tonal and pitched characteristics of music. To address this research gap, we propose an acoustic Music undERstanding model with large-scale self-supervised Training (MERT), which incorporates teacher models to provide pseudo labels in the masked language modelling (MLM) style acoustic pre-training. In our exploration, we identified a superior combination of teacher models, which outperforms conventional speech and audio approaches in terms of performance. This combination includes an acoustic teacher based on Residual Vector Quantization - Variational AutoEncoder (RVQ-VAE) and a musical teacher based on the Constant-Q Transform (CQT). These teachers effectively guide our student model, a BERT-style transformer encoder, to better model music audio. In addition, we introduce an in-batch noise mixture augmentation to enhance the representation robustness. Furthermore, we explore a wide range of settings to overcome the instability in acoustic language model pre-training, which allows our designed paradigm to scale from 95M to 330M parameters. Experimental results indicate that our model can generalise and perform well on 14 music understanding tasks and attains state-of-the-art (SOTA) overall scores. The code and models are online: https://github.com/yizhilll/MERT

arXiv.org e-Print Archive

ADD 2023: the Second Audio Deepfake Detection Challenge

Author: Fu Ruibo
Gu Hao
Li Haizhou
Lian Zheng
Liang Shan
Nie Shuai
Ren Yong
Tao Jianhua
Wang Chenglong
Wang Tao
Wen Zhengqi
Xu Le
Yan Xinrui
Yi Jiangyan
Zhang Chu Yuan
Zhang Xiaohui
Zhao Yan
Zhou Junzuo
Publication venue
Publication date: 23/05/2023
Field of study

Audio deepfake detection is an emerging topic in the artificial intelligence community. The second Audio Deepfake Detection Challenge (ADD 2023) aims to spur researchers around the world to build new innovative technologies that can further accelerate and foster research on detecting and analyzing deepfake speech utterances. Different from previous challenges (e.g. ADD 2022), ADD 2023 focuses on surpassing the constraints of binary real/fake classification, and actually localizing the manipulated intervals in a partially fake speech as well as pinpointing the source responsible for generating any fake audio. Furthermore, ADD 2023 includes more rounds of evaluation for the fake audio game sub-challenge. The ADD 2023 challenge includes three subchallenges: audio fake game (FG), manipulation region location (RL) and deepfake algorithm recognition (AR). This paper describes the datasets, evaluation metrics, and protocols. Some findings are also reported in audio deepfake detection tasks

arXiv.org e-Print Archive

Real-Time Detection of Mango Based on Improved YOLOv4

Author: Ruibo Yuan
Zhipeng Cao
Publication venue: 'MDPI AG'
Publication date: 01/11/2022
Field of study

Agricultural mechanization occupies a key position in modern agriculture. Aiming at the fruit recognition target detection part of the picking robot, a mango recognition method based on an improved YOLOv4 network structure is proposed, which can quickly and accurately identify and locate mangoes. The method improves the recognition accuracy of the width adjustment network, then reduces the ResNet (Residual Networks) module to adjust the neck network to improve the prediction speed, and finally adds CBAM (Convolutional Block Attention Module) to improve the prediction accuracy of the network. The newly improved network model is YOLOv4-LightC-CBAM. The training results show that the mAP (mean Average Precision) obtained by YOLOV4-LightC-CBAM is 95.12%, which is 3.93% higher than YOLOv4. Regarding detection speed, YOLOV4-LightC-CBAM is up to 45.4 frames, which is 85.3% higher than YOLOv4. The results show that the modified network can recognize mangoes better, faster, and more accurately

Directory of Open Access Journals

Application of electro-hydraulic proportional control in cathode rod pulling out system of lead electrolysis

Author: Bo Song
Dajun Yang
Ruibo Yuan
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date: 01/10/2018
Field of study

In the process of lead electrolysis, due to its production characteristics, it is necessary to extract and collect the cathode conductive rod on the cathode plate after electrolysis, so as to make it reusable. At present, most of the lead electrolytic manufacturers in China still rely on manual extraction in this process. In this study, a rapid, stable and effective cathode rod extraction equipment for lead electrolysis is designed by means of electro-hydraulic proportional technology. AMESim simulation and experimental research on the equipment are also carried out

Directory of Open Access Journals

Performance prediction of tobacco flavouring using response surface methodology and artificial neural network

Author: Lin Chen
Ruibo Yuan
Ze Liu
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date: 01/10/2018
Field of study

This study was to predict the optimum condition for leaf flavouring in cigarette manufacturing. To this purpose, an integrated research was used by using response surface and artificial neural network. A series of tobacco flavouring experiment's factors were designed by Experimental Design software. The MATLAB software's Neural Network function was used to forecast the responses, and the optimal solution configuration was coming out from the Response Surface Analysis Method. In the optimum condition, moisture removal opening, roller speed and tobacco process flow, pressure and feed liquid gas ejector flow are 18.60%, 10.74 rpm, 5314.11 kg/h, 3.70 bar and 243.63 kg/h, uniformity of the evaluation index and the utilization rate of material liquid distribution are 93.088% and 98.694%. With the corresponding experimental, results are consistent, under the condition of the error to less 7%, the test results show that through a few experimental data of predictive results of the neural network and response surface design has a certain practicability

Directory of Open Access Journals