6 research outputs found

    End-to-end Learning for Short Text Expansion

    Full text link
    Effectively making sense of short texts is a critical task for many real world applications such as search engines, social media services, and recommender systems. The task is particularly challenging as a short text contains very sparse information, often too sparse for a machine learning algorithm to pick up useful signals. A common practice for analyzing short text is to first expand it with external information, which is usually harvested from a large collection of longer texts. In literature, short text expansion has been done with all kinds of heuristics. We propose an end-to-end solution that automatically learns how to expand short text to optimize a given learning task. A novel deep memory network is proposed to automatically find relevant information from a collection of longer documents and reformulate the short text through a gating mechanism. Using short text classification as a demonstrating task, we show that the deep memory network significantly outperforms classical text expansion methods with comprehensive experiments on real world data sets.Comment: KDD'201

    Neural Document Expansion with User Feedback

    Full text link
    This paper presents a neural document expansion approach (NeuDEF) that enriches document representations for neural ranking models. NeuDEF harvests expansion terms from queries which lead to clicks on the document and weights these expansion terms with learned attention. It is plugged into a standard neural ranker and learned end-to-end. Experiments on a commercial search log demonstrate that NeuDEF significantly improves the accuracy of state-of-the-art neural rankers and expansion methods on queries with different frequencies. Further studies show the contribution of click queries and learned expansion weights, as well as the influence of document popularity of NeuDEF's effectiveness.Comment: The 2019 ACM SIGIR International Conference on the Theory of Information Retrieva

    Joint Modeling of Local and Global Temporal Dynamics for Multivariate Time Series Forecasting with Missing Values

    Full text link
    Multivariate time series (MTS) forecasting is widely used in various domains, such as meteorology and traffic. Due to limitations on data collection, transmission, and storage, real-world MTS data usually contains missing values, making it infeasible to apply existing MTS forecasting models such as linear regression and recurrent neural networks. Though many efforts have been devoted to this problem, most of them solely rely on local dependencies for imputing missing values, which ignores global temporal dynamics. Local dependencies/patterns would become less useful when the missing ratio is high, or the data have consecutive missing values; while exploring global patterns can alleviate such problems. Thus, jointly modeling local and global temporal dynamics is very promising for MTS forecasting with missing values. However, work in this direction is rather limited. Therefore, we study a novel problem of MTS forecasting with missing values by jointly exploring local and global temporal dynamics. We propose a new framework LGnet, which leverages memory network to explore global patterns given estimations from local perspectives. We further introduce adversarial training to enhance the modeling of global temporal distribution. Experimental results on real-world datasets show the effectiveness of LGnet for MTS forecasting with missing values and its robustness under various missing ratios.Comment: Accepted by AAAI 202

    A Survey of Natural Language Generation

    Full text link
    This paper offers a comprehensive review of the research on Natural Language Generation (NLG) over the past two decades, especially in relation to data-to-text generation and text-to-text generation deep learning methods, as well as new applications of NLG technology. This survey aims to (a) give the latest synthesis of deep learning research on the NLG core tasks, as well as the architectures adopted in the field; (b) detail meticulously and comprehensively various NLG tasks and datasets, and draw attention to the challenges in NLG evaluation, focusing on different evaluation methods and their relationships; (c) highlight some future emphasis and relatively recent research issues that arise due to the increasing synergy between NLG and other artificial intelligence areas, such as computer vision, text and computational creativity.Comment: Accepted by ACM Computing Survey (CSUR) 202

    Klasifikasi Berita Pada Twitter Dengan Menggunakan Metode NaÏŠve Bayes Dan Feature Expansion Berbasis Cosine Similarity

    Get PDF
    Informasi telah menjadi hal yang sangat dibutuhkan di era modern ini, terlebih dengan adanya berbagai media sosial yang mendukung perbaruan informasi. Twitter sebagai salah satu media sosial yang aktif digunakan untuk memperbarui informasi yang tergolong dalam short text atau berita pendek yang memiliki beberapa kesulitan ketika dilakukan klasifikasi, seperti kata yang ambigu, kata yang terdapat dalam data uji tidak pernah muncul dalam data latih dan sebagainya. Penelitian ini dilakukan untuk mengetahui pengaruh penggunaan feature expansion atau penambahan kata pada short text dalam hasil klasifikasi. Sebelum dilakukan klasifikasi, terlebih dahulu data yang akan diujikan ditambahkan dengan daftar kata yang telah dibuat sebelumnya sebagai sumber eksternal atau kamus dengan batasan tertentu yang telah ditetapkan. Batasan ini bertujuan untuk mengetahui nilai batasan minimal yang paling optimal dalam menghasilkan akurasi tertinggi dalam proses klasifikasi. Dalam proses pembuatan sumber eksternal dilakukan proses cosine similarity untuk mencari kedekatan antar kata. Hasil penelitian berupa akurasi yang menunjukkan adanya pengaruh penambahan feature expansion dalam hasil klasifikasi, hasil akurasi sebesar 83% pada klasifikasi tanpa penggunaan feature expansion dan meningkat menjadi 87% pada penggunaan feature expansion dengan nilai threshold 0,9
    corecore