Search CORE

2,491 research outputs found

Neural Networks for Information Retrieval

Author: Bahdanau D.
Bordes A.
Goodfellow I.
Hermann K. M.
Hu B.
Kingma D.
Krizhevsky A.
Kusner M. J.
Lin Y.
Lu Z.
Mikolov T.
Robertson S. E.
Srivastava N.
Sutskever I.
Vinyals O.
Weston J.
Publication venue
Publication date: 01/01/2017
Field of study

Machine learning plays a role in many aspects of modern IR systems, and deep learning is applied in all of them. The fast pace of modern-day research has given rise to many different approaches for many different IR problems. The amount of information available can be overwhelming both for junior students and for experienced researchers looking for new research topics and directions. Additionally, it is interesting to see what key insights into IR problems the new technologies are able to give us. The aim of this full-day tutorial is to give a clear overview of current tried-and-trusted neural methods in IR and how they benefit IR research. It covers key architectures, as well as the most promising future directions.Comment: Overview of full-day tutorial at SIGIR 201

arXiv.org e-Print Archive

Crossref

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Informative sample generation using class aware generative adversarial networks for classification of chest Xrays

Author: Bozorgtabar Behzad
Ebner Lukas
Mahapatra Dwarikanath
Pollinger Alexander
Reyes Mauricio
Thiran Jean-Phillipe
von Teng Hendrik
Publication venue
Publication date: 01/01/2019
Field of study

Training robust deep learning (DL) systems for disease detection from medical images is challenging due to limited images covering different disease types and severity. The problem is especially acute, where there is a severe class imbalance. We propose an active learning (AL) framework to select most informative samples for training our model using a Bayesian neural network. Informative samples are then used within a novel class aware generative adversarial network (CAGAN) to generate realistic chest xray images for data augmentation by transferring characteristics from one class label to another. Experiments show our proposed AL framework is able to achieve state-of-the-art performance by using about

35\%

of the full dataset, thus saving significant time and effort over conventional methods

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Serveur académique lausannois

Bern Open Repository and Information System (BORIS)

Neural Ranking Models with Weak Supervision

Author: Bing Lidong
Bromley Jane
Diaz Fernando
Han Xianpei
Hoffmann Raphael
Huang Po-Sen
Kingma Diederik
Lu Zhengdong
Quoc
Severyn Aliaksei
Shen Yelong
Wauthier Fabian L.
Zamani Hamed
Zamani Hamed
Publication venue
Publication date: 01/01/2017
Field of study

Despite the impressive improvements achieved by unsupervised deep neural networks in computer vision and NLP tasks, such improvements have not yet been observed in ranking for information retrieval. The reason may be the complexity of the ranking problem, as it is not obvious how to learn from queries and documents when no supervised signal is available. Hence, in this paper, we propose to train a neural ranking model using weak supervision, where labels are obtained automatically without human annotators or any external resources (e.g., click data). To this aim, we use the output of an unsupervised ranking model, such as BM25, as a weak supervision signal. We further train a set of simple yet effective ranking models based on feed-forward neural networks. We study their effectiveness under various learning scenarios (point-wise and pair-wise models) and using different input representations (i.e., from encoding query-document pairs into dense/sparse vectors to using word embedding representation). We train our networks using tens of millions of training instances and evaluate it on two standard collections: a homogeneous news collection(Robust) and a heterogeneous large-scale web collection (ClueWeb). Our experiments indicate that employing proper objective functions and letting the networks to learn the input representation based on weakly supervised data leads to impressive performance, with over 13% and 35% MAP improvements over the BM25 model on the Robust and the ClueWeb collections. Our findings also suggest that supervised neural ranking models can greatly benefit from pre-training on large amounts of weakly labeled data that can be easily obtained from unsupervised IR models.Comment: In proceedings of The 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR2017

arXiv.org e-Print Archive

Crossref

UvA-DARE

International Migration, Integration and Social Cohesion online publications