Search CORE

58 research outputs found

Conversational Co-Speech Gesture Generation via Modeling Dialog Intention, Emotion, and Context with Diffusion Models

Author: Dai Zonghong
Li Minglei
Meng Helen
Wu Zhiyong
Xue Haiwei
Yang Sicheng
Zhang Zhensong
Publication venue
Publication date: 10/01/2024
Field of study

Audio-driven co-speech human gesture generation has made remarkable advancements recently. However, most previous works only focus on single person audio-driven gesture generation. We aim at solving the problem of conversational co-speech gesture generation that considers multiple participants in a conversation, which is a novel and challenging task due to the difficulty of simultaneously incorporating semantic information and other relevant features from both the primary speaker and the interlocutor. To this end, we propose CoDiffuseGesture, a diffusion model-based approach for speech-driven interaction gesture generation via modeling bilateral conversational intention, emotion, and semantic context. Our method synthesizes appropriate interactive, speech-matched, high-quality gestures for conversational motions through the intention perception module and emotion reasoning module at the sentence level by a pretrained language model. Experimental results demonstrate the promising performance of the proposed method.Comment: 5 pages,2 figures, Accepted for publication at the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024

arXiv.org e-Print Archive

Effect of Natural Nanostructured Rods and Platelets on Mechanical and Water Resistance Properties of Alginate-Based Nanocomposites

Author: Dajian Huang
Qiling Quan
Zhuo Zhang
Zonghong Ma
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

A series of biopolymer-based nanocomposite films were prepared by incorporating natural one-dimensional (1D) palygorskite (PAL) nanorods, and two-dimensional (2D) montmorillonite (MMT) nanoplatelets into sodium alginate (SA) film by a simple solution casting method. The effect of different dimensions of nanoclays on the mechanical, water resistance, and light transmission properties of the SA/PAL or MMT nanocomposite films were studied. The field-emission scanning electron microscopy (FE-SEM) result showed that PAL can disperse better than MMT in the SA matrix in the case of the same addition amount. The incorporation of both PAL and MMT into the SA film can enhance the tensile strength (TS) and water resistance capability of the film. At a high content of nanoclays, the SA/PAL nanocomposite film shows relatively higher TS, and better water resistance than the SA/MMT nanocomposite film. The SA/MMT nanocomposite films have better light transmission than SA/PAL nanocomposite film at the same loading amount of nanoclays. These results demonstrated that 1D PAL nanorods are more suitable candidate of inorganic filler to improve the mechanical and water resistance properties of biopolymers/nanoclays nanocomposites

Directory of Open Access Journals

Frontiers - Publisher Connector

A note on Cauchy-Lipschitz-Picard theorem

Author: Fengying Li
H Brezis
HG Thomas
Shiqing Zhang
Ying Lv
Zonghong Feng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons

Author: Dai Zonghong
Hao Lei
Huang Qiaochu
Li Minglei
Wang Zilin
Wu Xiaofei
Wu Zhiyong
Xu Songcen
yang changpeng
Yang Sicheng
Zhang Zhensong
Publication venue
Publication date: 13/09/2023
Field of study

The automatic co-speech gesture generation draws much attention in computer animation. Previous works designed network structures on individual datasets, which resulted in a lack of data volume and generalizability across different motion capture standards. In addition, it is a challenging task due to the weak correlation between speech and gestures. To address these problems, we present UnifiedGesture, a novel diffusion model-based speech-driven gesture synthesis approach, trained on multiple gesture datasets with different skeletons. Specifically, we first present a retargeting network to learn latent homeomorphic graphs for different motion capture standards, unifying the representations of various gestures while extending the dataset. We then capture the correlation between speech and gestures based on a diffusion model architecture using cross-local attention and self-attention to generate better speech-matched and realistic gestures. To further align speech and gesture and increase diversity, we incorporate reinforcement learning on the discrete gesture units with a learned reward function. Extensive experiments show that UnifiedGesture outperforms recent approaches on speech-driven gesture generation in terms of CCA, FGD, and human-likeness. All code, pre-trained models, databases, and demos are available to the public at https://github.com/YoungSeng/UnifiedGesture.Comment: 16 pages, 11 figures, ACM MM 202

arXiv.org e-Print Archive

Sub-second periodic radio oscillations in a microquasar

Author: Chen Jiashi
Chen Xiao
Dai Zigao
Gan Hengqian
Jiang Peng
Li Di
Liu Jifeng
Liu Qingzhong
Pan Zhichen
Sai Na
Sun Xiaohui
Tian Pengfu
Wang Pei
Wang Wei
Wu Xuefeng
Yuan Feng
Zhang Bing
Zhang Ping
Zhang Shuangnan
Zheng Zheng
Zhu Zonghong
Publication venue
Publication date: 26/07/2023
Field of study

Powerful relativistic jets are one of the ubiquitous features of accreting black holes in all scales. GRS 1915+105 is a well-known fast-spinning black-hole X-ray binary with a relativistic jet, termed as a ``microquasar'', as indicated by its superluminal motion of radio emission. It exhibits persistent x-ray activity over the last 30 years, with quasi-periodic oscillations of

\sim 1-10

Hz and 34 and 67 Hz in the x-ray band. These oscillations likely originate in the inner accretion disk, but other origins have been considered. Radio observations found variable light curves with quasi-periodic flares or oscillations with periods of

\sim 20-50

minutes. Here we report two instances of

\sim

5 Hz transient periodic oscillation features from the source detected in the 1.05-1.45 GHz radio band that occurred in January 2021 and June 2022, respectively. Circular polarization was also observed during the oscillation phase.Comment: The author version of the article which will appear in Nature on 26 July 2023, 32 pages including the extended data. The online publication version can be found at the following URL: https://www.nature.com/articles/s41586-023-06336-

arXiv.org e-Print Archive

Transposable elements cause the loss of self-incompatibility in citrus

Author: Bosch Maurice
Cao Zonghong
Chai Lijun
Chen Peng
Deng Xiuxin
Du Zezhen
Franklin‐tong Vernonica e.
Guo Furong
Hu Jianbing
Jiang Jingdong
Larkin Robert m.
Lin Zongcheng
Liu Chenchen
Shi Chunmei
Song Dan
Wang Nan
Wei Zhuangmin
Xu Qiang
Ye Junli
Zhang Siqi
Zhu Chenqiao
Publication venue
Publication date: 01/12/2023
Field of study

Self-incompatibility (SI) is a widespread prezygotic mechanism for flowering plants to avoid inbreeding depression and promote genetic diversity. Citrus has an S-RNase-based SI system, which was frequently lost during evolution. We previously identified a single nucleotide mutation in Sm-RNase, which is responsible for the loss of SI in mandarin and its hybrids. However, little is known about other mechanisms responsible for conversion of SI to self-compatibility (SC) and we identify a completely different mechanism widely utilized by citrus. Here, we found a 786-bp miniature inverted-repeat transposable element (MITE) insertion in the promoter region of the FhiS2-RNase in Fortunella hindsii Swingle (a model plant for citrus gene function), which does not contain the Sm-RNase allele but are still SC. We demonstrate that this MITE plays a pivotal role in the loss of SI in citrus, providing evidence that this MITE insertion prevents expression of the S-RNase; moreover, transgenic experiments show that deletion of this 786-bp MITE insertion recovers the expression of FhiS2-RNase and restores SI. This study identifies the first evidence for a role for MITEs at the S-locus affecting the SI phenotype. A family-wide survey of the S-locus revealed that MITE insertions occur frequently adjacent to S-RNase alleles in different citrus genera, but only certain MITEs appear to be responsible for the loss of SI. Our study provides evidence that insertion of MITEs into a promoter region can alter a breeding strategy and suggests that this phenomenon may be broadly responsible for SC in species with the S-RNase system

University of Birmingham Research Portal

Yi: Open Foundation Models by 01.AI

Author: :
AI 01.
Cai Yuxuan
Chang Jing
Chen Bei
Chen Jianqun
Dai Zonghong
Gu Zhenyu
Hu Xiaohui
Huang Chengen
Huang Wenhao
Li Chao
Li Heng
Liu Peng
Liu Qiang
Liu Yudong
Liu Zhiyuan
Nie Pengcheng
Niu Xinyao
Ren Xiaoyi
Wang Yue
Xie Wen
Xu Yuchi
Yang Senbin
Yang Shiming
Young Alex
Yu Kaidong
Yu Tao
Yue Shawn
Zhang Ge
Zhang Guanwei
Zhu Jiangcheng
Publication venue
Publication date: 07/03/2024
Field of study

We introduce the Yi model family, a series of language and multimodal models that demonstrate strong multi-dimensional capabilities. The Yi model family is based on 6B and 34B pretrained language models, then we extend them to chat models, 200K long context models, depth-upscaled models, and vision-language models. Our base models achieve strong performance on a wide range of benchmarks like MMLU, and our finetuned chat models deliver strong human preference rate on major evaluation platforms like AlpacaEval and Chatbot Arena. Building upon our scalable super-computing infrastructure and the classical transformer architecture, we attribute the performance of Yi models primarily to its data quality resulting from our data-engineering efforts. For pretraining, we construct 3.1 trillion tokens of English and Chinese corpora using a cascaded data deduplication and quality filtering pipeline. For finetuning, we polish a small scale (less than 10K) instruction dataset over multiple iterations such that every single instance has been verified directly by our machine learning engineers. For vision-language, we combine the chat language model with a vision transformer encoder and train the model to align visual representations to the semantic space of the language model. We further extend the context length to 200K through lightweight continual pretraining and demonstrate strong needle-in-a-haystack retrieval performance. We show that extending the depth of the pretrained checkpoint through continual pretraining further improves performance. We believe that given our current results, continuing to scale up model parameters using thoroughly optimized data will lead to even stronger frontier models

arXiv.org e-Print Archive

A note on Cauchy-Lipschitz-Picard theorem

Author: Fengying Li
Shiqing Zhang
Ying Lv
Zonghong Feng
Publication venue: SpringerOpen
Publication date: 01/11/2016
Field of study

Abstract In this note, we try to generalize the classical Cauchy-Lipschitz-Picard theorem on the global existence and uniqueness for the Cauchy initial value problem of the ordinary differential equation with global Lipschitz condition, and we try to weaken the global Lipschitz condition. We can also get the global existence and uniqueness

Springer - Publisher Connector

Directory of Open Access Journals

Reducing Water Sensitivity of Chitosan Biocomposite Films Using Gliadin Particles Made by In Situ Method

Author: Dajian Huang
Qiling Quan
Zhuo Zhang
Zonghong Ma
Publication venue: 'MDPI AG'
Publication date: 01/11/2017
Field of study

In order to sustain rapid expansion in the field of biocomposites, it is necessary to develop novel fillers that are biodegradable, and easy to disperse and obtain. In this work, gliadin particles (GPs) fabricated through an in situ method have been reported as fillers for creating chitosan (CS)-based biocomposite films. In general, the particles tend to agglomerate in the polymer matrix at high loading (approximately >10%) in the biopolymer/particles composites prepared by the traditional solution-blending method. However, the micrographs of biocomposites confirmed that the GPs are well dispersed in the CS matrix in all CS/GPs composites even at a high loading of 30% in this study. It was found that the GPs could improve the mechanical properties of the biocomposites. In addition, the results of moisture uptake and solubility in water of biocomposites showed that water resistance of biocomposites was enhanced by the introduction of GPs. These results suggested that GPs fabricated through an in situ method could be a good candidate for use in biopolymer-based composites

Directory of Open Access Journals