Search CORE

51 research outputs found

Unblind Your Apps: Predicting Natural-Language Labels for Mobile GUI Components by Deep Learning

Author: Banerjee Satanjeev
Chen Chunyang
Feng Ruitao
Juan
Kingma Diederik P
Papineni Kishore
Park Kyudong
Spearman Ch
Wilcoxon Frank
Yan Shunguo
Zhang Xiaoyi
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 02/07/2020
Field of study

According to the World Health Organization(WHO), it is estimated that approximately 1.3 billion people live with some forms of vision impairment globally, of whom 36 million are blind. Due to their disability, engaging these minority into the society is a challenging problem. The recent rise of smart mobile phones provides a new solution by enabling blind users' convenient access to the information and service for understanding the world. Users with vision impairment can adopt the screen reader embedded in the mobile operating systems to read the content of each screen within the app, and use gestures to interact with the phone. However, the prerequisite of using screen readers is that developers have to add natural-language labels to the image-based components when they are developing the app. Unfortunately, more than 77% apps have issues of missing labels, according to our analysis of 10,408 Android apps. Most of these issues are caused by developers' lack of awareness and knowledge in considering the minority. And even if developers want to add the labels to UI components, they may not come up with concise and clear description as most of them are of no visual issues. To overcome these challenges, we develop a deep-learning based model, called LabelDroid, to automatically predict the labels of image-based buttons by learning from large-scale commercial apps in Google Play. The experimental results show that our model can make accurate predictions and the generated labels are of higher quality than that from real Android developers.Comment: Accepted to 42nd International Conference on Software Engineerin

arXiv.org e-Print Archive

Crossref

Discovery of Novel Term Associations in a Document Collection

Author: G. Salton
H.P. Luhn
I. Petrič
Jim Cowie
K. Spärck Jones
M. Segond
M.F. Porter
R.L. Cilibrasi
S. Deerwester
Satanjeev Banerjee
T. Kötter
Publication venue: Springer-Verlag
Publication date: 01/01/2012
Field of study

Non peer reviewe

Crossref

Springer - Publisher Connector

Helsingin yliopiston digitaalinen arkisto

Unsupervised Paraphrasing via Deep Reinforcement Learning

Author: Banerjee Satanjeev
Barzilay Regina
Dolan William B
Fader Anthony
Gupta Ankush
Heafield Kenneth
Hovy Eduard H
Knight Kevin
Papineni Kishore
Ross Stéphane
Silver David
Snover Matthew
Zhao Shiqi
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 05/07/2020
Field of study

Paraphrasing is expressing the meaning of an input sentence in different wording while maintaining fluency (i.e., grammatical and syntactical correctness). Most existing work on paraphrasing use supervised models that are limited to specific domains (e.g., image captions). Such models can neither be straightforwardly transferred to other domains nor generalize well, and creating labeled training data for new domains is expensive and laborious. The need for paraphrasing across different domains and the scarcity of labeled training data in many such domains call for exploring unsupervised paraphrase generation methods. We propose Progressive Unsupervised Paraphrasing (PUP): a novel unsupervised paraphrase generation method based on deep reinforcement learning (DRL). PUP uses a variational autoencoder (trained using a non-parallel corpus) to generate a seed paraphrase that warm-starts the DRL model. Then, PUP progressively tunes the seed paraphrase guided by our novel reward function which combines semantic adequacy, language fluency, and expression diversity measures to quantify the quality of the generated paraphrases in each iteration without needing parallel sentences. Our extensive experimental evaluation shows that PUP outperforms unsupervised state-of-the-art paraphrasing techniques in terms of both automatic metrics and user studies on four real datasets. We also show that PUP outperforms domain-adapted supervised algorithms on several datasets. Our evaluation also shows that PUP achieves a great trade-off between semantic similarity and diversity of expression

arXiv.org e-Print Archive

Crossref

Poet: Product-oriented Video Captioner for E-commerce

Author: Banerjee Satanjeev
Das Pradipto
David
Kipf Thomas N
Kipf Thomas N
Lin Chin-Yew
Liu Jingyuan
Liu Ziwei
Lu Jiasen
Papineni Kishore
Regneri Michaela
Sigurdsson Gunnar A.
Speer Robyn
Wang Bairui
Weston Jason
Whitehead Spencer
Yao Li
Zeng Kuo-Hao
Zhang Junchao
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 16/08/2020
Field of study

In e-commerce, a growing number of user-generated videos are used for product promotion. How to generate video descriptions that narrate the user-preferred product characteristics depicted in the video is vital for successful promoting. Traditional video captioning methods, which focus on routinely describing what exists and happens in a video, are not amenable for product-oriented video captioning. To address this problem, we propose a product-oriented video captioner framework, abbreviated as Poet. Poet firstly represents the videos as product-oriented spatial-temporal graphs. Then, based on the aspects of the video-associated product, we perform knowledge-enhanced spatial-temporal inference on those graphs for capturing the dynamic change of fine-grained product-part characteristics. The knowledge leveraging module in Poet differs from the traditional design by performing knowledge filtering and dynamic memory modeling. We show that Poet achieves consistent performance improvement over previous methods concerning generation quality, product aspects capturing, and lexical diversity. Experiments are performed on two product-oriented video captioning datasets, buyer-generated fashion video dataset (BFVD) and fan-generated fashion video dataset (FFVD), collected from Mobile Taobao. We will release the desensitized datasets to promote further investigations on both video captioning and general video analysis problems.Comment: 10 pages, 3 figures, to appear in ACM MM 2020 proceeding

arXiv.org e-Print Archive

Crossref

Leveraging Code Generation to Improve Code Retrieval and Summarization via Dual Learning

Author: Allamanis Miltiadis
Allamanis Miltiadis
Alon Uri
Banerjee Satanjeev
Caruana A.
Di He Wei Chen Yuanzhi Li
He Di
Lu Meili
Moreno Laura
Movshovitz-Attias Dana
Papineni Kishore
Su Shang-Yu
Voorhees M.
Wang Yijun
Xia Yingce
Yao Ziyu
Ye Hai
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 25/02/2020
Field of study

Code summarization generates brief natural language description given a source code snippet, while code retrieval fetches relevant source code given a natural language query. Since both tasks aim to model the association between natural language and programming language, recent studies have combined these two tasks to improve their performance. However, researchers have yet been able to effectively leverage the intrinsic connection between the two tasks as they train these tasks in a separate or pipeline manner, which means their performance can not be well balanced. In this paper, we propose a novel end-to-end model for the two tasks by introducing an additional code generation task. More specifically, we explicitly exploit the probabilistic correlation between code summarization and code generation with dual learning, and utilize the two encoders for code summarization and code generation to train the code retrieval task via multi-task learning. We have carried out extensive experiments on an existing dataset of SQL and Python, and results show that our model can significantly improve the results of the code retrieval task over the-state-of-art models, as well as achieve competitive performance in terms of BLEU score for the code summarization task.Comment: Published at The Web Conference (WWW) 2020, full pape

arXiv.org e-Print Archive

Crossref

Taking MT evaluation metrics to extremes : beyond correlation with human judgments

Author: Bahdanau Dzmitry
Baig Taimur
Banerjee Satanjeev
Berentsen Geir Drage
Bojar Ondřej
Callison-Burch Chris
Callison-Burch Chris
Coughlin Deborah
Culy Christopher
Denkowski Michael
Fomicheva Marina
Giménez Jesús
Graham Yvette
Hjort Nils Lid
Junczys-Dowmunt Marcin
Levene Howard
Liu Ding
Lucia Specia
Marina Fomicheva
Moore Robert C.
Nießen Sonja
Papineni Kishore
Snover Matthew
Specia Lucia
Specia Lucia
Specia Lucia
Specia Lucia
Sutskever Ilya
Tillmann Christoph
Williams Evan James
Publication venue: 'MIT Press - Journals'
Publication date: 12/06/2019
Field of study

Automatic Machine Translation (MT) evaluation is an active field of research, with a handful of new metrics devised every year. Evaluation metrics are generally benchmarked against manual assessment of translation quality, with performance measured in terms of overall correlation with human scores. Much work has been dedicated to the improvement of evaluation metrics to achieve a higher correlation with human judgments. However, little insight has been provided regarding the weaknesses and strengths of existing approaches and their behavior in different settings. In this work we conduct a broad meta-evaluation study of the performance of a wide range of evaluation metrics focusing on three major aspects. First, we analyze the performance of the metrics when faced with different levels of translation quality, proposing a local dependency measure as an alternative to the standard, global correlation coefficient. We show that metric performance varies significantly across different levels of MT quality: Metrics perform poorly when faced with low-quality translations and are not able to capture nuanced quality distinctions. Interestingly, we show that evaluating low-quality translations is also more challenging for humans. Second, we show that metrics are more reliable when evaluating neural MT than the traditional statistical MT systems. Finally, we show that the difference in the evaluation accuracy for different metrics is maintained even if the gold standard scores are based on different criteria

Crossref

Spiral - Imperial College Digital Repository

White Rose Research Online