Search CORE

25 research outputs found

Feedback-Based Self-Learning in Large-Scale Conversational AI Agents

Author: Ghias Alireza Roshan
Guo Chenlei
Ponnusamy Pragaash
Sarikaya Ruhi
Publication venue
Publication date: 06/11/2019
Field of study

Today, most large-scale conversational AI agents (e.g. Alexa, Siri, or Google Assistant) are built using manually annotated data to train the different components of the system. Typically, the accuracy of the ML models in these components are improved by manually transcribing and annotating data. As the scope of these systems increase to cover more scenarios and domains, manual annotation to improve the accuracy of these components becomes prohibitively costly and time consuming. In this paper, we propose a system that leverages user-system interaction feedback signals to automate learning without any manual annotation. Users here tend to modify a previous query in hopes of fixing an error in the previous turn to get the right results. These reformulations, which are often preceded by defective experiences caused by errors in ASR, NLU, ER or the application. In some cases, users may not properly formulate their requests (e.g. providing partial title of a song), but gleaning across a wider pool of users and sessions reveals the underlying recurrent patterns. Our proposed self-learning system automatically detects the errors, generate reformulations and deploys fixes to the runtime system to correct different types of errors occurring in different components of the system. In particular, we propose leveraging an absorbing Markov Chain model as a collaborative filtering mechanism in a novel attempt to mine these patterns. We show that our approach is highly scalable, and able to learn reformulations that reduce Alexa-user errors by pooling anonymized data across millions of customers. The proposed self-learning system achieves a win/loss ratio of 11.8 and effectively reduces the defect rate by more than 30% on utterance level reformulations in our production A/B tests. To the best of our knowledge, this is the first self-learning large-scale conversational AI system in production.Comment: 8 pages, 2 figure

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Knowledge Distillation from Internal Representations

Author: Aguilar Gustavo
Fan Xing
Guo Chenlei
Ling Yuan
Yao Benjamin
Zhang Yu
Publication venue
Publication date: 16/01/2020
Field of study

Knowledge distillation is typically conducted by training a small model (the student) to mimic a large and cumbersome model (the teacher). The idea is to compress the knowledge from the teacher by using its output probabilities as soft-labels to optimize the student. However, when the teacher is considerably large, there is no guarantee that the internal knowledge of the teacher will be transferred into the student; even if the student closely matches the soft-labels, its internal representations may be considerably different. This internal mismatch can undermine the generalization capabilities originally intended to be transferred from the teacher to the student. In this paper, we propose to distill the internal representations of a large model such as BERT into a simplified version of it. We formulate two ways to distill such representations and various algorithms to conduct the distillation. We experiment with datasets from the GLUE benchmark and consistently show that adding knowledge distillation from internal representations is a more powerful method than only using soft-label distillation.Comment: To appear in AAAI-202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Impact of geographic diversity on citation of collaborative research

Author: Guo Weisi
Larivière Vincent
Leng Chenlei
Naik Cian
Sugimoto Cassidy R.
Publication venue: 'MIT Press - Journals'
Publication date: 25/04/2022
Field of study

Diversity in human capital is widely seen as critical to creating holistic and high-quality research, especially in areas that engage with diverse cultures, environments, and challenges. Quantification of diverse academic collaborations and their effect on research quality is lacking, especially at international scale and across different domains. Here, we present the first effort to measure the impact of geographic diversity in coauthorships on the citation of their papers across different academic domains. Our results unequivocally show that geographic coauthor diversity improves paper citation, but very long distance collaborations have variable impact. We also discover “well-trodden” collaboration circles that yield much less impact than similar travel distances. These relationships are observed to exist across different subject areas, but with varying strengths. These findings can help academics identify new opportunities from a diversity perspective, as well as inform funders on areas that require additional mobility support

arXiv.org e-Print Archive

Cranfield CERES

The Function of MoGlk1 in Integration of Glucose and Ammonium Utilization in Magnaporthe oryzae

Author: AJ Foster
Alexander Idnurm
B Moore
CA Michels
Chenlei Hua
D Ahuatzi
D Ahuatzi
DJ Cove
F Moreno
F Rolland
F Rolland
F Rolland
FK Zimmermann
GM Santangelo
GN Harrington
H Panneman
H Panneman
Haifeng Zhang
HF Zhang
IC Oliveira
J Huth
J Sambrook
J Stulke
J Sweigard
JA Diderich
JC de Jong
JC Jang
JD Thompson
JE Wilson
JI Castrillo
K Jernejc
KD Entian
KD Entian
KJ Livak
Lisha Zhang
LS Kraakman
M Flipphi
M Guo
M Guo
M Johnston
M Yi
NJ Talbot
NJ Talbot
O Rui
PG Bertram
R Lagos
R Serrano
RA Dean
RA Wilson
RA Wilson
RJ Howard
RJ Howard
Ruili Lv
S Kumer
S Rossell
S Vaulont
WW Song
Xianying Dou
Xiaobo Zheng
YH Cho
Z Lobo
Zhengguang Zhang
Zhengyi Wang
Zhongqiang Qi
ZY Wang
ZY Wang
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Hexokinases are conserved proteins functioning in glucose sensing and signaling. The rice blast fungus Magnaporthe oryzae contains several hexokinases, including MoHxk1 (hexokinase) and MoGlk1 (glucokinase) encoded respectively by MoHXK1 and MoGLK1 genes. The heterologous expression of MoGlk1 and MoHxk1 in Saccharomyces cerevisiae confirmed their conserved functions. Disruption of MoHXK1 resulted in growth reduction in medium containing fructose as the sole carbon source, whereas disruption of MoGLK1 did not cause the similar defect. However, the ΔMoglk1 mutant displayed decreased proton extrusion and a lower biomass in the presence of ammonium, suggesting a decline in the utilization of ammonium. Additionally, the MoGLK1 allele lacking catalytic activity restored growth to the ΔMoglk1 mutant. Moreover, the expression of MoPMA1 encoding a plasma membrane H+-ATPase decreased in the ΔMoglk1 mutant that can be suppressed by glucose and G-6-P. Thus, MoGlk1, but not MoHxk1, regulates ammonium utilization through a mechanism that is independent from its catalytic activity

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Targeted delivery of DOX by transferrin conjugated DSPE-PEG nanoparticles in leukemia therapy

Author: Chenlei Cai
Deming Xie
Guo L.
Rui Guo
Ruifang Fan
Xiaomeng H.
Yanling Sun
Zhigang Fang
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Clinicopathological Features and Prognosis of Papillary Thyroid Microcarcinoma for Surgery and Relationships with the BRAFV600E Mutational Status and Expression of Angiogenic Factors

Author: Abiyasi Nanding (768198)
Chenlei Shi (3390365)
Huadong Qin (3390359)
Jianjun He (104318)
Tiefeng Shi (3390362)
Yichen Lv (3390356)
Yong Guo (169213)
Publication venue
Publication date: 09/12/2016
Field of study

<div>ObjectiveTo investigate the clinicopathological characteristics of papillary thyroid microcarcinoma (PTMC) for surgery by comparing the difference between PTMC and larger papillary thyroid carcinoma (LPTC).MethodsWe analyzed the differences in the clinicopathological characteristics, prognosis, B-type RAF kinase (BRAF)V600E mutational status and expression of angiogenic factors, including pigment epithelium-derived factor (PEDF), Vascular Endothelial Growth Factor (VEGF), and hypoxia-inducible factor alpha subunit (HIF-1α), between PTMC and LPTC by retrospectively reviewing the records of 251 patients with papillary thyroid carcinoma, 169 with PTMC, and 82 with LPTC (diameter >1 cm).ResultsThere were no significant differences in the gender, age, multifocality, Hashimoto’s thyroiditis, TNM stage, PEDF protein expression, rate of recurrence, or mean follow-up duration between patients with PTMC or LPTC. The prevalence of extrathyroidal invasion (EI), lymph node metastasis (LNM), and BRAF mutation in patients with PTMC was significantly lower than in patients with LPTC. In addition, in PTMC patients with EI and/or LNM and/or positive BRAF (high-risk PTMC patients), the prevalence of extrathyroidal invasion, Hashimoto's disease, lymph node metastasis, tumor TNM stage, PEDF positive protein expression, the rate of recurrent disease, and the mRNA expression of anti-angiogenic factors was almost as high as in patients with larger PTC, but with no significant difference.ConclusionsExtrathyroid invasion, lymph node metastases, and BRAFV600E mutation were the high risk factors of PTMC. PTMC should be considered for the same treatment strategy as LPTC when any of these factors is found. Particularly, PTMC with BRAFV600E gene mutations needed earlier surgical treatment. In addition, the high cell subtype of PTMC with BRAFV600E gene mutation is recommended for total thyroidectomy in primary surgery to reduce the risk of recurrence.</div

Directory of Open Access Journals

PubMed Central

FigShare