Search CORE

212 research outputs found

AWTE-BERT:Attending to Wordpiece Tokenization Explicitly on BERT for Joint Intent Classification and SlotFilling

Author: Chen Xingyan
Guo Yu
Wang Leilei
Wu Gang
Xie Zhilong
Zhao Yu
Publication venue
Publication date: 29/11/2022
Field of study

Intent classification and slot filling are two core tasks in natural language understanding (NLU). The interaction nature of the two tasks makes the joint models often outperform the single designs. One of the promising solutions, called BERT (Bidirectional Encoder Representations from Transformers), achieves the joint optimization of the two tasks. BERT adopts the wordpiece to tokenize each input token into multiple sub-tokens, which causes a mismatch between the tokens and the labels lengths. Previous methods utilize the hidden states corresponding to the first sub-token as input to the classifier, which limits performance improvement since some hidden semantic informations is discarded in the fine-tune process. To address this issue, we propose a novel joint model based on BERT, which explicitly models the multiple sub-tokens features after wordpiece tokenization, thereby generating the context features that contribute to slot filling. Specifically, we encode the hidden states corresponding to multiple sub-tokens into a context vector via the attention mechanism. Then, we feed each context vector into the slot filling encoder, which preserves the integrity of the sentence. Experimental results demonstrate that our proposed model achieves significant improvement on intent classification accuracy, slot filling F1, and sentence-level semantic frame accuracy on two public benchmark datasets. The F1 score of the slot filling in particular has been improved from 96.1 to 98.2 (2.1% absolute) on the ATIS dataset

arXiv.org e-Print Archive

Taming Gradient Variance in Federated Learning with Networked Control Variates

Author: Chen Xingyan
Du Huaming
Liu Yaling
Wang Mu
Zhao Yu
Publication venue
Publication date: 26/10/2023
Field of study

Federated learning, a decentralized approach to machine learning, faces significant challenges such as extensive communication overheads, slow convergence, and unstable improvements. These challenges primarily stem from the gradient variance due to heterogeneous client data distributions. To address this, we introduce a novel Networked Control Variates (FedNCV) framework for Federated Learning. We adopt the REINFORCE Leave-One-Out (RLOO) as a fundamental control variate unit in the FedNCV framework, implemented at both client and server levels. At the client level, the RLOO control variate is employed to optimize local gradient updates, mitigating the variance introduced by data samples. Once relayed to the server, the RLOO-based estimator further provides an unbiased and low-variance aggregated gradient, leading to robust global updates. This dual-side application is formalized as a linear combination of composite control variates. We provide a mathematical expression capturing this integration of double control variates within FedNCV and present three theoretical results with corresponding proofs. This unique dual structure equips FedNCV to address data heterogeneity and scalability issues, thus potentially paving the way for large-scale applications. Moreover, we tested FedNCV on six diverse datasets under a Dirichlet distribution with {\alpha} = 0.1, and benchmarked its performance against six SOTA methods, demonstrating its superiority.Comment: 14 page

arXiv.org e-Print Archive

Graph Learning and Its Applications: A Holistic Survey

Author: Chen Xingyan
Kou Gang
Li Qing
Liu Ji
Wei Shaopeng
Zhao Yu
Zhuang Fuzhen
Publication venue
Publication date: 11/03/2023
Field of study

Graph learning is a prevalent domain that endeavors to learn the intricate relationships among nodes and the topological structure of graphs. These relationships endow graphs with uniqueness compared to conventional tabular data, as nodes rely on non-Euclidean space and encompass rich information to exploit. Over the years, graph learning has transcended from graph theory to graph data mining. With the advent of representation learning, it has attained remarkable performance in diverse scenarios, including text, image, chemistry, and biology. Owing to its extensive application prospects, graph learning attracts copious attention from the academic community. Despite numerous works proposed to tackle different problems in graph learning, there is a demand to survey previous valuable works. While some researchers have perceived this phenomenon and accomplished impressive surveys on graph learning, they failed to connect related objectives, methods, and applications in a more coherent way. As a result, they did not encompass current ample scenarios and challenging problems due to the rapid expansion of graph learning. Different from previous surveys on graph learning, we provide a holistic review that analyzes current works from the perspective of graph structure, and discusses the latest applications, trends, and challenges in graph learning. Specifically, we commence by proposing a taxonomy from the perspective of the composition of graph data and then summarize the methods employed in graph learning. We then provide a detailed elucidation of mainstream applications. Finally, based on the current trend of techniques, we propose future directions.Comment: 20 pages, 7 figures, 3 table

arXiv.org e-Print Archive

Mortality predicting models for patients with infective endocarditis: a machine learning approach

Author: Li Haolin
Liu Xingyan
Qi Wang
Wang Shouhong
Wei Xuebiao
Yu Danqing
Zi-yang Yang
Publication venue: Springer Nature
Publication date: 01/07/2025
Field of study

Background Infective endocarditis (IE) is a fatal cardiovascular disease with varied clinical manifestations but rapid progression. A series of existing risk models helped identify IE patients with high risk, but the imperfect predictive performance and limited application called for better predictive systems. Methods The single-centered, retrospective observational study applied four machine learning methods for predictive model construction: LASSO logistic regression, random forest (RF), support vector machine (SVM), and k-nearest neighbors (KNN). A 10-fold cross-validated area under the receiver operating characteristic curve (AUC-ROC) was used for performance evaluation. Results A total of 1705 patients with IE were enrolled in the study, with 119 in-hospital deaths and 178 deaths after 6-month follow-up. RF achieved the highest AUC-ROCs for in-hospital and six-month mortality prediction (in-hospital: 0.83, 6-month: 0.85). RF was also applied to assess variable importance. The following variables were selected by RF as top important predictors for both in-hospital and six-month mortality prediction: total bilirubin, N-terminal pro-B-type natriuretic peptide, albumin, diastolic blood pressure, fasting blood glucose, uric acid, and age. Conclusions A risk model with machine learning approach was integrated in purpose of prognosis prediction in IE patients, helping rapid risk stratification and in-time management clinically. Clinical trial number Not applicable

Carolina Digital Repository

Theoretical analysis of a regenerative supercritical carbon dioxide Brayton cycle/organic Rankine cycle dual loop for waste heat recovery of a diesel/natural gas dual-fuel engine

Author: Ban Zhibo
Bian Xingyan
Liang Youcai
Pan Mingzhang
Qian Weiwei
Yu Zhibin
Publication venue: 'Elsevier BV'
Publication date: 01/10/2019
Field of study

Supercritical carbon dioxide Brayton cycle is considered one of the most promising systems for waste heat recovery of engines because of its compactness and high energy efficiency. To further improve the fuel utilization ratio and solve the difficulties of waste heat recovery of high temperature exhaust gas, a regenerative supercritical carbon dioxide Brayton cycle/organic Rankine cycle dual loop is proposed for cascade utilization of exhaust heat from a dual-fuel engine. The regenerative supercritical carbon dioxide Brayton cycle of the proposed system is powered by the waste heat contained in the exhaust gas. The working fluid in the organic Rankine cycle is pre-heated by CO2 exiting the regenerator and then further heated by the residual heat of the exhaust gas. The flow rates of the working fluids in both sub cycles are adjusted to match the waste heat recovery system to respond to the changing conditions of the dual-fuel engine. The results revealed that the maximum net power output of this system is up to 40.88 kW, thus improving the dual-fuel engine power output by 6.78%. Therefore, such a regenerative supercritical carbon dioxide Brayton cycle/organic Rankine cycle dual loop system design enables the thorough recovery of high temperature exhaust heat, leading to higher energy efficiency and lower fuel consumption of the engine

Enlighten

Highly-stable, flexible delivery of microjoule-level ultrafast pulses in vacuumized anti-resonant hollow-core fibers for active synchronization

Author: Huang Zhiyuan
Jiang Xin
Leng Yuxin
Li Hongyang
Liu Donghan
Liu Xingyan
Luo Zhuozhao
Pan Jinyu
Pang Meng
Song Liwei
Wang Xinliang
Yan Chuanchuan
Yang Fei
Yin Ruochen
Yu Haihu
Zheng Yu
Publication venue: 'The Optical Society'
Publication date: 01/02/2023
Field of study

We demonstrate the stable and flexible light delivery of multi-{\mu}J, sub-200-fs pulses over a ~10-m-long vacuumized anti-resonant hollow-core fiber (AR-HCF), which was successfully used for high-performance pulse synchronization. Compared with the pulse train launched into the AR-HCF, the transmitted pulse train out of the fiber exhibits excellent stabilities in pulse power and spectrum, with pointing stability largely improved. The walk-off between the fiber-delivery and the other free-space-propagation pulse trains, in an open loop, was measured to be <6 fs root-mean-square (RMS) over 90 minutes, corresponding to a relative optical-path variation of <2x10-7. This walk-off can be further suppressed to ~2 fs RMS simply using an active control loop, highlighting the great application potentials of this AR-HCF set-up in large-scale laser and accelerator facilities

arXiv.org e-Print Archive

Effect of Chitosan Coating with Different Molecular Weights on the Storage Quality of Postharvest Passion Fruit (Passiflora edulis Sims)

Author: Hanmei LIU
Lijin HUANG
Shoukui TAO
Xingyan LI
Yezhen CHEN
Yonggui PAN
Yue JIANG
Zhiqian YU
Publication venue: The editorial department of Science and Technology of Food Industry
Publication date: 01/11/2023
Field of study

To study the preservation effect of chitosan coating with different molecular weights on postharvest passion fruit, the "Qinmi No.9" was coated with chitosan of molecular weights of 30, 50, 100, 150 and 200 kDa (1.5%, w/v) to determine the quality of passion fruit during storage. The results showed that chitosan coating with different molecular weights was able to delay the shrinkage and yellowing, reduce the weight loss rate and inhibit the decay of passion fruit. Moreover, chitosan with a larger molecular weight was more conducive to delaying the ripening and senescence of passion fruit, as well as reducing shrinkage, and decay. At the end of storage, the weight loss of fruits coated with 200 kDa chitosan was nearly 10% less than that coated with 30 kDa chitosan, and the fruits coated with 150 and 200 kDa chitosan did not decay. The lower molecular weight (30 and 50 kDa) and higher molecular weight (150 kDa) chitosan were more effective in inhibiting weight loss, total soluble solids and soluble sugar metabolism, and maintaining titratable acid, flavonoid and total phenol contents of fruit during storage. The chitosan with 150 kDa had the best effect in maintaining the vitamin C content, which was 1.12 times higher than the control group at the end of storage. In conclusion, chitosan with different molecular weights was effective to delay senescence, slow down water loss and shrink of passion fruit and maintain the quality, chitosan with 150 kDa was more suitable to maintain the quality of postharvest passion fruit

Directory of Open Access Journals

Genetic diversity fuels gene discovery for tobacco and alcohol use

Author: Addison Clifton
Akiyama Masato
Albert Christine M.
Aliev Fazil
Alonso Alvaro
Arnett Donna K.
Ashley-Koch Allison E.
Ashrani Aneel A.
Barnes Kathleen C.
Barr R. Graham
Bartz Traci M.
Batini Chiara
Becker Diane M.
Benjamin Emelia J.
Bergen Andrew W.
Bielak Lawrence F.
Bierut Laura J.
Bis Joshua C.
Bjornsdottir Gyda
Blangero John
Bleecker Eugene R.
Boardman Jason D.
Boerwinkle Eric
Boomsma Dorret I.
Boorgula Meher Preethi
Bowden Donald W.
Brody Jennifer A.
Cade Brian E.
Chasman Daniel I.
Chavan Sameer
Chen Fang
Chen Yii-Der Ida
Chen Zhengming
Cheng Iona
Cho Michael H.
Choquet Hélène
Cole John W.
Cornelis Marilyn C.
Cucca Francesco
Curran Joanne E.
David Sean P.
de Andrade Mariza
Dick Danielle M.
Docherty Anna R.
Duggirala Ravindranath
Eaton Charles B.
Ehringer Marissa A.
Esko Tõnu
Faul Jessica D.
Fiorillo Edoardo
Fornage Myriam
Freedman Barry I.
Gabrielsen Maiken E.
Gagliano Taliun Sarah A.
Gao Shuang
Garrett Melanie E.
Gharib Sina A.
Gieger Christian
Gillespie Nathan
Glahn David C.
Gordon Scott D.
Gu Charles C.
Gu Dongfeng
Gudbjartsson Daniel F.
Guo Xiuqing
Haessler Jeffrey
Hall Michael E.
Haller Toomas
Hancock Dana B.
Harris Kathleen Mullan
He Jiang
Herd Pamela
Hewitt John K.
Hickie Ian
Hidalgo Bertha
Hokanson John E.
Hopfer Christian
Hottenga JoukeJan
Hou Lifang
Huang Hongyan
Hung Yi-Jen
Hunter David J.
Hveem Kristian
Hwang Shih-Jen
Hwu Chii-Min
Iacono William
Irvin Marguerite R.
Jang Seon-Kyeong
Jee Yon Ho
Jiang Bibo
Jiang Yu
Johnson Eric O.
Joo Yoonjung Y.
Jorgenson Eric
Justice Anne E.
Kamatani Yoichiro
Kaplan Robert C.
Kaprio Jaakko
Kardia Sharon L. R.
Keller Matthew C.
Kelly Tanika N.
Khunsriraksakul Chachrit
Kooperberg Charles
Korhonen Tellervo
Kraft Peter
Krauter Kenneth
Kuusisto Johanna
Laakso Markku
Lasky-Su Jessica
Lee James J.
Lee Wen-Jane
Levy Daniel
Li Kevin
Li Liming
Li Yuqing
Lin Kuang
Lind Penelope A.
Liu Chunyu
Liu Dajiang J.
Liu Mengzhen
Lloyd-Jones Donald M.
Lutz Sharon M.
Ma Jiantao
Manichaikul Ani
Martin Nicholas G.
Mathur Ravi
Matoba Nana
McArdle Patrick F.
McGue Matt
McQueen Matthew B.
Medland Sarah E.
Metspalu Andres
Meyers Deborah A.
Millwood Iona Y.
Mitchell Braxton D.
Mohlke Karen L.
Moll Matthew
Montasser May E.
Morrison Alanna C.
Mulas Antonella
Munafò Marcus R.
Mägi Reedik
Nielsen Jonas B.
North Kari E.
Oelsner Elizabeth C.
Okada Yukinori
Orrù Valeria
Otto Jacqueline M.
Palmer Nicholette D.
Palviainen Teemu
Pandit Anita
Park S. Lani
Peters Annette
Peters Ulrike
Peyser Patricia A.
Polderman Tinca J. C.
Rafaels Nicholas
Redline Susan
Reed Robert M.
Reiner Alex P.
Rice John P.
Rich Stephen S.
Richmond Nicole E.
Roan Carol
Rotter Jerome I.
Rueschman Michael N.
Runarsdottir Valgerdur
Saccone Nancy L.
Saunders Gretchen R. B.
Schwartz David A.
Shadyab Aladdin H.
Shi Jingchunzi
Shringarpure Suyash S.
Sicinski Kamil
Silva Lilian Fernandes
Skogholt Anne Heidi
Smith Jennifer A.
Smith Nicholas L.
Sotoodehnia Nona
Stallings Michael C.
Stefansson Hreinn
Stefansson Kari
Stitzel Jerry A.
Sun Xiao
Syed Moin
Tal-Singer Ruth
Taylor Amy E.
Taylor Kent D.
Telen Marilyn J.
Thai Khanh K.
Thorgeirsson Thorgeir E.
Tiwari Hemant
Turman Constance
Tyrfingsson Thorarinn
Vrieze Scott
Wall Tamara L.
Walters Robin G.
Wang Chen
Wang Xingyan
Weir David R.
Weiss Scott T.
White Wendy B.
Whitfield John B.
Wiggins Kerri L.
Willemsen Gonneke
Willer Cristen J.
Winsvold Bendik S.
Xu Huichun
Yanek Lisa R.
Yin Jie
Young Kendra A.
Young Kristin L.
Yu Bing
Zhao Wei
Zhou Wei
Zuccolo Luisa
Zöllner Sebastian
Publication venue
Publication date: 01/01/2022
Field of study

Tobacco and alcohol use are heritable behaviours associated with 15% and 5.3% of worldwide deaths, respectively, due largely to broad increased risk for disease and injury(1-4). These substances are used across the globe, yet genome-wide association studies have focused largely on individuals of European ancestries(5). Here we leveraged global genetic diversity across 3.4 million individuals from four major clines of global ancestry (approximately 21% non-European) to power the discovery and fine-mapping of genomic loci associated with tobacco and alcohol use, to inform function of these loci via ancestry-aware transcriptome-wide association studies, and to evaluate the genetic architecture and predictive power of polygenic risk within and across populations. We found that increases in sample size and genetic diversity improved locus identification and fine-mapping resolution, and that a large majority of the 3,823 associated variants (from 2,143 loci) showed consistent effect sizes across ancestry dimensions. However, polygenic risk scores developed in one ancestry performed poorly in others, highlighting the continued need to increase sample sizes of diverse ancestries to realize any potential benefit of polygenic prediction.Peer reviewe

Crossref

DukeSpace (Duke Univ.)

Open Access LMU ( Ludwig-Maximilians-Univ. München)

PubMed Central

Queensland University of Technology ePrints Archive

Helsingin yliopiston digitaalinen arkisto

DigitalCommons@The Texas Medical Center