Search CORE

35,583 research outputs found

Semi-Supervised Learning for Neural Machine Translation

Author: Cheng Yong
He Wei
He Zhongjun
Liu Yang
Sun Maosong
Wu Hua
Xu Wei
Publication venue
Publication date: 01/01/2016
Field of study

While end-to-end neural machine translation (NMT) has made remarkable progress recently, NMT systems only rely on parallel corpora for parameter estimation. Since parallel corpora are usually limited in quantity, quality, and coverage, especially for low-resource languages, it is appealing to exploit monolingual corpora to improve NMT. We propose a semi-supervised approach for training NMT models on the concatenation of labeled (parallel corpora) and unlabeled (monolingual corpora) data. The central idea is to reconstruct the monolingual corpora using an autoencoder, in which the source-to-target and target-to-source translation models serve as the encoder and decoder, respectively. Our approach can not only exploit the monolingual corpora of the target language, but also of the source language. Experiments on the Chinese-English dataset show that our approach achieves significant improvements over state-of-the-art SMT and NMT systems.Comment: Corrected a typ

arXiv.org e-Print Archive

Crossref

A review of parallel computing for large-scale remote sensing image mosaicking

Author: Chen Lajiao
He Jijun
Jie Wei
Liu Peng
Ma Yan
Wei Jingbo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/04/2015
Field of study

Interest in image mosaicking has been spurred by a wide variety of research and management needs. However, for large-scale applications, remote sensing image mosaicking usually requires significant computational capabilities. Several studies have attempted to apply parallel computing to improve image mosaicking algorithms and to speed up calculation process. The state of the art of this field has not yet been summarized, which is, however, essential for a better understanding and for further research of image mosaicking parallelism on a large scale. This paper provides a perspective on the current state of image mosaicking parallelization for large scale applications. We firstly introduce the motivation of image mosaicking parallel for large scale application, and analyze the difficulty and problem of parallel image mosaicking at large scale such as scheduling with huge number of dependent tasks, programming with multiple-step procedure, dealing with frequent I/O operation. Then we summarize the existing studies of parallel computing in image mosaicking for large scale applications with respect to problem decomposition and parallel strategy, parallel architecture, task schedule strategy and implementation of image mosaicking parallelization. Finally, the key problems and future potential research directions for image mosaicking are addressed

Crossref

UWL Repository

Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics

Author: Bao Linchao
He Shengfeng
Jiao Jianbo
Liu Wei
Liu Yunhui
Wang Jiangliu
Publication venue
Publication date: 01/01/2019
Field of study

We address the problem of video representation learning without human-annotated labels. While previous efforts address the problem by designing novel self-supervised tasks using video data, the learned features are merely on a frame-by-frame basis, which are not applicable to many video analytic tasks where spatio-temporal features are prevailing. In this paper we propose a novel self-supervised approach to learn spatio-temporal features for video representation. Inspired by the success of two-stream approaches in video classification, we propose to learn visual features by regressing both motion and appearance statistics along spatial and temporal dimensions, given only the input video data. Specifically, we extract statistical concepts (fast-motion region and the corresponding dominant direction, spatio-temporal color diversity, dominant color, etc.) from simple patterns in both spatial and temporal domains. Unlike prior puzzles that are even hard for humans to solve, the proposed approach is consistent with human inherent visual habits and therefore easy to answer. We conduct extensive experiments with C3D to validate the effectiveness of our proposed approach. The experiments show that our approach can significantly improve the performance of C3D when applied to video classification tasks. Code is available at https://github.com/laura-wang/video_repres_mas.Comment: CVPR 201

arXiv.org e-Print Archive

Crossref

University of Birmingham Research Portal

Institutional Knowledge at Singapore Management University

A Game-theoretic Machine Learning Approach for Revenue Maximization in Sponsored Search

Author: Chen Wei
He Di
Liu Tie-Yan
Wang Liwei
Publication venue
Publication date: 04/06/2014
Field of study

Sponsored search is an important monetization channel for search engines, in which an auction mechanism is used to select the ads shown to users and determine the prices charged from advertisers. There have been several pieces of work in the literature that investigate how to design an auction mechanism in order to optimize the revenue of the search engine. However, due to some unrealistic assumptions used, the practical values of these studies are not very clear. In this paper, we propose a novel \emph{game-theoretic machine learning} approach, which naturally combines machine learning and game theory, and learns the auction mechanism using a bilevel optimization framework. In particular, we first learn a Markov model from historical data to describe how advertisers change their bids in response to an auction mechanism, and then for any given auction mechanism, we use the learnt model to predict its corresponding future bid sequences. Next we learn the auction mechanism through empirical revenue maximization on the predicted bid sequences. We show that the empirical revenue will converge when the prediction period approaches infinity, and a Genetic Programming algorithm can effectively optimize this empirical revenue. Our experiments indicate that the proposed approach is able to produce a much more effective auction mechanism than several baselines.Comment: Twenty-third International Conference on Artificial Intelligence (IJCAI 2013

arXiv.org e-Print Archive

CiteSeerX

Efficiency and power of minimally nonlinear irreversible heat engines with broken time-reversal symmetry

Author: He Jizhou
Li Wei
Liu Qin
Wang Jianhui
Zhang Min
Publication venue: 'MDPI AG'
Publication date: 19/02/2018
Field of study

We study the minimally nonlinear irreversible heat engines in which the time-reversal symmetry for the systems may b e broken. The expressions for the power and the efficiency are derived, in which the effects of the nonlinear terms due to dissipations are included. We show that, as within the linear responses, the minimally nonlinear irreversible heat engines enable attainment of Carnot efficiency at positive power. We also find that the Curzon-Ahlborn limit imposed on the efficiency at maximum power can be overcomed if the time-reversal symmetry is broken

arXiv.org e-Print Archive

Multidisciplinary Digital Publishing Institute