Search CORE

6 research outputs found

End-to-end Learning for Short Text Expansion

Author: Mei Qiaozhu
Tang Jian
Wang Yue
Zheng Kai
Publication venue
Publication date: 30/08/2017
Field of study

Effectively making sense of short texts is a critical task for many real world applications such as search engines, social media services, and recommender systems. The task is particularly challenging as a short text contains very sparse information, often too sparse for a machine learning algorithm to pick up useful signals. A common practice for analyzing short text is to first expand it with external information, which is usually harvested from a large collection of longer texts. In literature, short text expansion has been done with all kinds of heuristics. We propose an end-to-end solution that automatically learns how to expand short text to optimize a given learning task. A novel deep memory network is proposed to automatically find relevant information from a collection of longer documents and reformulate the short text through a gating mechanism. Using short text classification as a demonstrating task, we show that the deep memory network significantly outperforms classical text expansion methods with comprehensive experiments on real world data sets.Comment: KDD'201

arXiv.org e-Print Archive

Crossref

Neural Document Expansion with User Feedback

Author: Liu Zhiyuan
Luo Cheng
Xiong Chenyan
Yin Yue
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 08/08/2019
Field of study

This paper presents a neural document expansion approach (NeuDEF) that enriches document representations for neural ranking models. NeuDEF harvests expansion terms from queries which lead to clicks on the document and weights these expansion terms with learned attention. It is plugged into a standard neural ranker and learned end-to-end. Experiments on a commercial search log demonstrate that NeuDEF significantly improves the accuracy of state-of-the-art neural rankers and expansion methods on queries with different frequencies. Further studies show the contribution of click queries and learned expansion weights, as well as the influence of document popularity of NeuDEF's effectiveness.Comment: The 2019 ACM SIGIR International Conference on the Theory of Information Retrieva

arXiv.org e-Print Archive

Crossref

Joint Modeling of Local and Global Temporal Dynamics for Multivariate Time Series Forecasting with Missing Values

Author: Aggarwal Charu
Mitra Prasenjit
Sun Yiwei
Tang Xianfeng
Wang Suhang
Yao Huaxiu
Publication venue
Publication date: 22/11/2019
Field of study

Multivariate time series (MTS) forecasting is widely used in various domains, such as meteorology and traffic. Due to limitations on data collection, transmission, and storage, real-world MTS data usually contains missing values, making it infeasible to apply existing MTS forecasting models such as linear regression and recurrent neural networks. Though many efforts have been devoted to this problem, most of them solely rely on local dependencies for imputing missing values, which ignores global temporal dynamics. Local dependencies/patterns would become less useful when the missing ratio is high, or the data have consecutive missing values; while exploring global patterns can alleviate such problems. Thus, jointly modeling local and global temporal dynamics is very promising for MTS forecasting with missing values. However, work in this direction is rather limited. Therefore, we study a novel problem of MTS forecasting with missing values by jointly exploring local and global temporal dynamics. We propose a new framework LGnet, which leverages memory network to explore global patterns given estimations from local perspectives. We further introduce adversarial training to enhance the modeling of global temporal distribution. Experimental results on real-world datasets show the effectiveness of LGnet for MTS forecasting with missing values and its robustness under various missing ratios.Comment: Accepted by AAAI 202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

A Survey of Natural Language Generation

Author: Chen Miaoxin
Dong Chenhe
Gong Haifan
Li Junxin
Li Yinghui
Shen Ying
Yang Min
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 02/08/2022
Field of study

This paper offers a comprehensive review of the research on Natural Language Generation (NLG) over the past two decades, especially in relation to data-to-text generation and text-to-text generation deep learning methods, as well as new applications of NLG technology. This survey aims to (a) give the latest synthesis of deep learning research on the NLG core tasks, as well as the architectures adopted in the field; (b) detail meticulously and comprehensively various NLG tasks and datasets, and draw attention to the challenges in NLG evaluation, focusing on different evaluation methods and their relationships; (c) highlight some future emphasis and relatively recent research issues that arise due to the increasing synergy between NLG and other artificial intelligence areas, such as computer vision, text and computational creativity.Comment: Accepted by ACM Computing Survey (CSUR) 202

arXiv.org e-Print Archive

Klasifikasi Berita Pada Twitter Dengan Menggunakan Metode Naϊve Bayes Dan Feature Expansion Berbasis Cosine Similarity

Author: Febriana Resti
Publication venue
Publication date: 30/06/2018
Field of study

Informasi telah menjadi hal yang sangat dibutuhkan di era modern ini, terlebih dengan adanya berbagai media sosial yang mendukung perbaruan informasi. Twitter sebagai salah satu media sosial yang aktif digunakan untuk memperbarui informasi yang tergolong dalam short text atau berita pendek yang memiliki beberapa kesulitan ketika dilakukan klasifikasi, seperti kata yang ambigu, kata yang terdapat dalam data uji tidak pernah muncul dalam data latih dan sebagainya. Penelitian ini dilakukan untuk mengetahui pengaruh penggunaan feature expansion atau penambahan kata pada short text dalam hasil klasifikasi. Sebelum dilakukan klasifikasi, terlebih dahulu data yang akan diujikan ditambahkan dengan daftar kata yang telah dibuat sebelumnya sebagai sumber eksternal atau kamus dengan batasan tertentu yang telah ditetapkan. Batasan ini bertujuan untuk mengetahui nilai batasan minimal yang paling optimal dalam menghasilkan akurasi tertinggi dalam proses klasifikasi. Dalam proses pembuatan sumber eksternal dilakukan proses cosine similarity untuk mencari kedekatan antar kata. Hasil penelitian berupa akurasi yang menunjukkan adanya pengaruh penambahan feature expansion dalam hasil klasifikasi, hasil akurasi sebesar 83% pada klasifikasi tanpa penggunaan feature expansion dan meningkat menjadi 87% pada penggunaan feature expansion dengan nilai threshold 0,9

bkg