Search CORE

28 research outputs found

Exploiting Prompt Caption for Video Grounding

Author: Cao Meng
Cheng Xuxin
Li Hongxiang
Li Yaowei
Zhu Zhihong
Zou Yuexian
Publication venue
Publication date: 28/03/2023
Field of study

Video grounding aims to locate a moment of interest matching the given query sentence from an untrimmed video. Previous works ignore the \emph{sparsity dilemma} in video annotations, which fails to provide the context information between potential events and query sentences in the dataset. In this paper, we contend that exploiting easily available captions which describe general actions \ie, prompt captions (PC) defined in our paper, will significantly boost the performance. To this end, we propose a Prompt Caption Network (PCNet) for video grounding. Specifically, we first introduce dense video captioning to generate dense captions and then obtain prompt captions by Non-Prompt Caption Suppression (NPCS). To capture the potential information in prompt captions, we propose Caption Guided Attention (CGA) project the semantic relations between prompt captions and query sentences into temporal space and fuse them into visual representations. Considering the gap between prompt captions and ground truth, we propose Asymmetric Cross-modal Contrastive Learning (ACCL) for constructing more negative pairs to maximize cross-modal mutual information. Without bells and whistles, extensive experiments on three public datasets (\ie, ActivityNet Captions, TACoS and ActivityNet-CG) demonstrate that our method significantly outperforms state-of-the-art methods

arXiv.org e-Print Archive

G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory

Author: Cao Meng
Cheng Xuxin
Li Hongxiang
Li Yaowei
Zhu Zhihong
Zou Yuexian
Publication venue
Publication date: 26/07/2023
Field of study

The recent video grounding works attempt to introduce vanilla contrastive learning into video grounding. However, we claim that this naive solution is suboptimal. Contrastive learning requires two key properties: (1) \emph{alignment} of features of similar samples, and (2) \emph{uniformity} of the induced distribution of the normalized features on the hypersphere. Due to two annoying issues in video grounding: (1) the co-existence of some visual entities in both ground truth and other moments, \ie semantic overlapping; (2) only a few moments in the video are annotated, \ie sparse annotation dilemma, vanilla contrastive learning is unable to model the correlations between temporally distant moments and learned inconsistent video representations. Both characteristics lead to vanilla contrastive learning being unsuitable for video grounding. In this paper, we introduce Geodesic and Game Localization (G2L), a semantically aligned and uniform video grounding framework via geodesic and game theory. We quantify the correlations among moments leveraging the geodesic distance that guides the model to learn the correct cross-modal representations. Furthermore, from the novel perspective of game theory, we propose semantic Shapley interaction based on geodesic distance sampling to learn fine-grained semantic alignment in similar moments. Experiments on three benchmarks demonstrate the effectiveness of our method.Comment: ICCV202

arXiv.org e-Print Archive

ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding

Author: Cao Bowen
Cheng Xuxin
Li Hongxiang
Ye Qichen
Zhu Zhihong
Zou Yuexian
Publication venue
Publication date: 19/11/2023
Field of study

Spoken language understanding (SLU) is a fundamental task in the task-oriented dialogue systems. However, the inevitable errors from automatic speech recognition (ASR) usually impair the understanding performance and lead to error propagation. Although there are some attempts to address this problem through contrastive learning, they (1) treat clean manual transcripts and ASR transcripts equally without discrimination in fine-tuning; (2) neglect the fact that the semantically similar pairs are still pushed away when applying contrastive learning; (3) suffer from the problem of Kullback-Leibler (KL) vanishing. In this paper, we propose Mutual Learning and Large-Margin Contrastive Learning (ML-LMCL), a novel framework for improving ASR robustness in SLU. Specifically, in fine-tuning, we apply mutual learning and train two SLU models on the manual transcripts and the ASR transcripts, respectively, aiming to iteratively share knowledge between these two models. We also introduce a distance polarization regularizer to avoid pushing away the intra-cluster pairs as much as possible. Moreover, we use a cyclical annealing schedule to mitigate KL vanishing issue. Experiments on three datasets show that ML-LMCL outperforms existing models and achieves new state-of-the-art performance

arXiv.org e-Print Archive

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation

Author: Cheng Xuxin
Li Hongxiang
Li Yaowei
Yang Bang
Zhu Zhihong
Zou Yuexian
Publication venue
Publication date: 29/03/2023
Field of study

Automatic radiology report generation has attracted enormous research interest due to its practical value in reducing the workload of radiologists. However, simultaneously establishing global correspondences between the image (e.g., Chest X-ray) and its related report and local alignments between image patches and keywords remains challenging. To this end, we propose an Unify, Align and then Refine (UAR) approach to learn multi-level cross-modal alignments and introduce three novel modules: Latent Space Unifier (LSU), Cross-modal Representation Aligner (CRA) and Text-to-Image Refiner (TIR). Specifically, LSU unifies multimodal data into discrete tokens, making it flexible to learn common knowledge among modalities with a shared network. The modality-agnostic CRA learns discriminative features via a set of orthonormal basis and a dual-gate mechanism first and then globally aligns visual and textual representations under a triplet contrastive loss. TIR boosts token-level local alignment via calibrating text-to-image attention with a learnable mask. Additionally, we design a two-stage training procedure to make UAR gradually grasp cross-modal alignments at different levels, which imitates radiologists' workflow: writing sentence by sentence first and then checking word by word. Extensive experiments and analyses on IU-Xray and MIMIC-CXR benchmark datasets demonstrate the superiority of our UAR against varied state-of-the-art methods.Comment: 8 pages,6 figures,4 table

arXiv.org e-Print Archive

Processing of nanostructured polymers and advanced polymeric based nanocomposites

Author: Ab Rahman
Abitbol
Abu-Sharkh
Acharya
Adhikari
Adhikari
Adhikari
Afzal
Agrawal
Ahn
Ajayan
Al-Saleh
Alamusi
Alateyah
Albdiry
Albuerne
Alexandre
Alzari
Alzari
Andrews
Armentano
Askari
Auad
Auvergne
Azeez
Azeredo
Azizi Samir
Bae
Bae
Bahloul
Bahramian
Balazs
Banaszak
Bandyopadhyay
Baney
Barcikowski
Bartholomai
Baskaran
Bates
Bates
Bates
Bauhofer
Beardsley
Beardsley
Becheri
Beck-Candanedo
Becker
Becker
Bekyarova
Bell
Bendahou
Benjaminn
Bennett
Bhatnagar
Bikiaris
Billingham
Billingham
Bittmann
Bittolo Bon
Bittolo Bon
Bittolo Bon
Bittolo Bon
Bockstaller
Boday
Bondeson
Bonduel
Bourbigot
Bourlinos
Bourlinos
Bronstein
Brostow
Bryning
Builes
Builes
Bunch
Butchosa
Buzdugan
Cai
Cai
Cakmak
Camino
Cao
Cao
Cao
Capadona
Capadona
Capretti
Cardinali
Chaim
Chakoli
Chandrasekaran
Chang
Chang
Chang
Charlier
Chen
Chen
Chen
Chen
Chen
Chen
Chen
Chen
Chen
Cheng
Chiacchiarelli
Chiacchiarelli
Chin
Cho
Choi
Choi
Choi
Chou
Chowdhury
Chung
Cioffi
Cohn
Coleman
Coleman
Colson
Compton
Cote
Crooks
Curl
Dallas
Dallas
Das
Datsyuk
Datta
Davidson
Davis
De Menezes
Dean
Debora Puglia
Dennis
Dennis
Dervaux
Dikin
Dixon
Dong
Dove
Dresselhaus
Du
Dubois
Dubois
Dufresne
Dufresne
Durkop
Eda
Eda
Edens
Elliniadis
Enotiadis
Ermanni
Esawi
Fang
Fang
Favier
Felten
Fermeglia
Ferrari
Fiedler
Fina
Fine
Finnigan
Finnigan
Fischer
Fleury
Flory
Floudas
Fornes
Fortunati
Fortunati
Fortunati
Fortunati
Fortunati
Fortunati
Fowler
Fragouli
Frielinghaus
Frielinghaus
Frømyr
Fu
Gangopadhyay
Ganguli
Ganguly
Ganss
Gao
Gao
Garate
Garces
Garcia
Garcia
Garnweitner
Geim
Geim
Gemma
Georgakilas
George
Giacomelli
Giannelis
Gilberto
Gilje
Gilman
Gilmore
Gojny
Gojny
Gomoll
Gong
Gorga
Gorrasi
Gou
Goussé
Green
Groenendaal
Grubbs
Grubbs
Grunert
Gua
Guo
Guo
Guo
Guo
Gutierrez
Gutierrez
Gutierrez
Ha
Ha
Habibi
Habibi
Habibi
Hadjichristidis
Haldane
Hameed
Hamley
Hamley
Hamley
Hamley
Han
Han
Han
Hanley
Haque
Harrison
Hasani
Hasegawa
Hasegawa
Hasell
Hashimoto
Hatori
Haupt
Helfand
Helfand
Helfand
Henriksson
Hernandez
Hernandez
Hernández
Heux
Hild
Hill
Hillmyer
Hillmyer
Hillmyer
Hirasawa
Hirata
Ho
Holden
Holden
Hong
Hongxiang
Hore
Horechyy
Horiuchi
Horsch
Hossain
Houdayer
Hsiao
Hsiue
Hu
Hu
Hua
Huang
Huang
Huanga
Hubbe
Hugouvieux
Huh
Hussain
Hussain
Iguchi
Iijima
Ishida
Islam
Iwahori
Jang
Javadi
Jeong
Jiankun
Jimenez
Jonoobi
Joshi
José M. Kenny
Kahraman
Kaiser
Kalaitzidou
Kalia
Kan
Kandola
Kane
Kardar
Karwa
Kashiwagi
Kashiwagi
Kashiwagi
Kashiwagi
Kato
Kato
Katsnelson
Kawasumi
Kaya
Kchit
Khabashesku
Khare
Khattab
Khokhlov
Kiliaris
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kim
Kishore
Koizumi
Koizumi
Koo
Koo
Kornmann
Kornmann
Kornmann
Kroto
Kruis
Kumar
Kuo
Kvien
Laborie
Lagaly
Lan
Lan
Lan
Lange
Laoutid
Larranaga
Laura Peponi
Lavoine
Lazzari
LeBaron
Leblanc
Lee
Lee
Lee
Lee
Lee
Lee
Lee
Leibler
Lekakou
Lelli
Lentz
Lerf
Levchik
Levchik
Levitt
Levy
Levy
Lewandowski
Lewin
Li
Li
Li
Li
Li
Li
Li
Liang
Lin
Lin
Lin
Lin
Linn
Lipic
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Livi
Ljungberg
Ljungberg
Lodge
Lodge
Lodge
Lodge
Longo
Lonkar
Lopez-Rodrıguez
Lotya
Lu
Luca Valentini
Luigi Torre
Lönnberg
Ma
Ma
Ma
Mahaling
Mai
Mak
Malik
Mallick
Malmsten
Malucelli
Manias
Manias
Marcovich
Mariani
Mariani
Martinez-Veracoechea
Martinez-Veracoechea
Martone
Martínez-Hernández
Marzouk
Masoodi
Mathew
Matos Ruiz
Matsen
Matsen
Matsen
Matsen
Matsumoto
Matsuo
Matsuo
Matsuo
Matsuo
Matyjaszewski
Mc Clory
McAllister
McNally
McNally
McNally
Mecerreyes
Meguid
Meijer
Meng
Messersmith
Messori
Mi
Miaudet
Min
Min
Missoum
Mittal
Mittal
Miyagawa
Mo
Monteiro Cordeiro de Azeredo
Monti
Monti
Monti
Monti
Monticelli
Moon
Morandi
Morgan
Mueller
Mädler
Nakagaito
Nakagaito
Nakashima
Natali
Natali
Ngece
Nguyen
Nielsen
Niu
Njuguna
Nogi
Norkhairunnisa
Novak
Novoselov
Nuvoli
Nuvoli
Ocando
Ocando
Ocando
Ogata
Ogata
Ogata
Oh
Oke
Okubo
Oshima
Osman
Osman
Osuna
Oueiny
O’Connell
O’Mullane
Pachfule
Padalkar
Panaitescu
Pappas
Park
Park
Park
Park
Park
Parkin
Pascault
Patel
Paul
Paul
Pauling
Pavlidou
Pehrsson
Peng
Peponi
Peponi
Peponi
Peponi
Peponi
Peponi
Peponi
Peponi
Peponi
Peponi
Peponi
Peponi
Perineau
Petersson
Pham
Pham
Pielichowski
Plank
Poologasundarampillai
Potts
Potts
Potts
Pranger
Pranger
Pratsinis
Puglia
Pääkkö
Qi
Qi
Rafiee
Raghu
Ragosta
Rahatekar
Rallini
Ramanathan
Rao
Raravikar
Ratner
Raveendran
Ray
Raza
Rebouillat
Reddy
Rehab
Reverchon
Riley
Ritzenthaler
Ritzenthaler
Ritzhaupt-Kleissl
Rizis
Roman
Romeo
Roohani
Rueda
Ruiz
Ruiz-Pérez
Ryan
Saavedra
Sabri
Sadasivuni
Sahoo
Saito
Saito
Sajti
Salavagione
Samir
Sanchez
Sandler
Sandler
Sangermano
Saralegi
Sassi
Saxena
Schartel
Schmidt
Schmolka
Schniepp
Scwarc
Sene
Sengupta
Seppala
Seregina
Serrano
Serrano
Serrano
Serrano
Serrano
Shartel
Shchukin
She
Shen
Shibata
Shul’ga
Siddiqui
Sih
Sim
Simoes
Singh
Sinha Ray
Sinturel
Siqueira
Siqueira
Siro
Siró
Skaltsas
Smalley
Snow
Sofo
Solomon
Song
Song
Sorrentino
Spector
Spindler-Ranta
Spitalsky
Spoljaric
Srikant
Sriraman
Stamatopoulou
Stankovich
Stankovich
Stankovich
Stankovich
Stanley
Stepaneka
Steurer
Stevens
Strong
Su
Subrahmanyam
Sun
Suntivich
Swihart
Szleifer
Takagi
Tamburrano
Tan
Tanaka
Tang
Tao
Tate
Ten
Tercjak
Terenzi
Terrones
Thakkar
Thompson
Thompson
Thostenson
Thostenson
Thostenson
Thostenson
Thostenson
Tibiletti
Tien
Tien
Tiitua
Tingaut
Titelman
Tkalya
Tomasko
Torre
Torre
Tortora
Toshiaki Enoki
Troitzsch
Tseng
Tsimpliaraki
Tudor
Turcova
Uddin
Uddin
Usuki
Utracki
Vaia
Vaia
Vaia
Vaia
Valentini
Valentini
Valentini
Valentini
Valentini
Valentini
Vallé
Van den Berg
Varlot
Vazquez
Veca
Veedu
Velusamy
Vendamme
Vennerberg
Viswanathan
Vollath
Wakabayashi
Wang
Wang
Wang
Wang
Wang
Wang
Wang
Wang
Wang
Wang
Wang
Wang
Wang
Wang
Wang
Wang
Watcharotone
Weng
Wetzel
Wetzel
Wichmann
Wichmann
Widawski
Wik
Williams
Wilson
Wintmire
Wissert
Woloszczuk
Wu
Wu
Wu
Wu
Wu
Wu
Xiao
Xie
Xie
Xie
Xu
Xu
Xu
Xue
Yamaguchi
Yan
Yang
Yang
Yang
Yao
Yasmin
Yeltik
Yildirim
Yokozeki
Young
Yurekli
Zammarano
Zanetti
Zanetti
Zerda
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhao
Zhao
Zhao
Zhao
Zhao
Zhao
Zheng
Zhou
Zhou
Zhou
Zhou
Zhu
Zhu
Zhu
Zhu
Zhu
Zipfel
Zou
Zou
Zou
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Effect on Treatment of the Landfill Leachate with the Furrow Irrigation in Onland Planting Reed (<i>Phragmites</i>)

Author: Hongxiang Cai
Kun Shi
Ming Zou
Publication venue: 'Scientific Research Publishing, Inc.'
Publication date: 01/01/2012
Field of study

Crossref

Low-order mixed finite element analysis of progressive failure in pressure-dependent materials within the framework of the Cosserat continuum

Author: Degao Zou
Hongxiang Tang
Xue Zhang
Yuhui Guan
Publication venue: 'Emerald'
Publication date
Field of study

Crossref

Transition-Layer Implantation for Improving Magnetoelectric Response in Co-fired Laminated Composite

Author: Bo Qin
Hongxiang Zou
Lianwen Deng
Sheng Liu
Sihua Liao
Publication venue: 'MDPI AG'
Publication date: 01/02/2023
Field of study

Magnetoelectric (ME) laminated composites with strong ME coupling are becoming increasingly prevalent in the electron device field. In this paper, an enhancement of the ME coupling effect via transition-layer implantation for co-fired lead-free laminated composite (80Bi0.5Na0.5TiO3-20Bi0.5K0.5TiO3)/(Ni0.8Zn0.2)Fe2O4 (BNKT/NZFO) was demonstrated. A transition layer composed of particulate ME composite 0.5BNKT-0.5NZFO was introduced between the BNKT piezoelectric layer and the NZFO magnetostrictive layer, effectively connecting the two-phase interface and strengthening interface stress transfer. In particular, an optimal ME voltage coefficients (αME) of 144 mV/(cm·Oe) at 1 kHz and 1.05 V/(cm·Oe) at the resonant frequency in the composite was achieved, with a layer thickness ratio (BNKT:0.5BNKT-0.5NZFO:NZFO) of 3:1:6. The static elastic model was used to determine strong interface coupling. A large magnetodielectric (MD) response of 3.95% was found under a magnetic field excitation of 4 kOe. These results demonstrate that transition-layer implantation provides a new path to enhance the ME response in co-fired laminated composite, which can play an important role in developing magnetic field-tuned electronic devices

Directory of Open Access Journals

Self‐biased magnetoelectric composite for energy harvesting

Author: Hongxiang Zou
Kexiang Wei
Lianwen Deng
Linchuan Zhao
Sheng Liu
Sihua Liao
Publication venue: Wiley
Publication date: 01/09/2023
Field of study

Abstract The wireless sensor network energy supply technology for the Internet of things has progressed substantially, but attempts to provide sustainable and environmentally friendly energy for sensor networks remain limited and considerably cumbersome for practical application. Energy harvesting devices based on the magnetoelectric (ME) coupling effect have promising prospects in the field of self‐powered devices due to their advantages of small size, fast response, and low power consumption. Driven by application requirements, the development of composite with a self‐biased magnetoelectric (SME) coupling effect provides effective strategies for the miniaturized and high‐precision design of energy harvesting devices. This review summarizes the work mechanism, research status, characteristics, and structures of SME composites, with emphasis on the application and development of SME devices for vibration and magnetic energy harvesting. The main challenges and future development directions for the design and implementation of energy harvesting devices based on the SME effect are presented

Directory of Open Access Journals