Search CORE

1,248 research outputs found

Efficient Document Re-Ranking for Transformers by Precomputing Term Representations

Author: Frieder Ophir
Goharian Nazli
MacAvaney Sean
Nardini Franco Maria
Perego Raffaele
Tonellotto Nicola
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 26/05/2020
Field of study

Deep pretrained transformer networks are effective at various ranking tasks, such as question answering and ad-hoc document ranking. However, their computational expenses deem them cost-prohibitive in practice. Our proposed approach, called PreTTR (Precomputing Transformer Term Representations), considerably reduces the query-time latency of deep transformer networks (up to a 42x speedup on web document ranking) making these networks more practical to use in a real-time ranking scenario. Specifically, we precompute part of the document term representations at indexing time (without a query), and merge them with the query representation at query time to compute the final ranking score. Due to the large size of the token representations, we also propose an effective approach to reduce the storage requirement by training a compression layer to match attention scores. Our compression technique reduces the storage required up to 95% and it can be applied without a substantial degradation in ranking performance.Comment: Accepted at SIGIR 2020 (long

arXiv.org e-Print Archive

Crossref

Learning and Transferring IDs Representation in E-commerce

Author: Abadi Martín
Kinga D.
Li Yuezhang
Mnih Andriy
Sun Zhu
Yamada Ikuya
Zhao Kui
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 22/05/2018
Field of study

Many machine intelligence techniques are developed in E-commerce and one of the most essential components is the representation of IDs, including user ID, item ID, product ID, store ID, brand ID, category ID etc. The classical encoding based methods (like one-hot encoding) are inefficient in that it suffers sparsity problems due to its high dimension, and it cannot reflect the relationships among IDs, either homogeneous or heterogeneous ones. In this paper, we propose an embedding based framework to learn and transfer the representation of IDs. As the implicit feedbacks of users, a tremendous amount of item ID sequences can be easily collected from the interactive sessions. By jointly using these informative sequences and the structural connections among IDs, all types of IDs can be embedded into one low-dimensional semantic space. Subsequently, the learned representations are utilized and transferred in four scenarios: (i) measuring the similarity between items, (ii) transferring from seen items to unseen items, (iii) transferring across different domains, (iv) transferring across different tasks. We deploy and evaluate the proposed approach in Hema App and the results validate its effectiveness.Comment: KDD'18, 9 page

arXiv.org e-Print Archive

Crossref

Evaluating Semantic Vectors for Norwegian

Author: Stadsnes Cathrine
Velldal Erik
Øvrelid Lilja
Publication venue: NIKT Foundation
Publication date: 08/08/2018
Field of study

In this article, we present two benchmark data sets for evaluating models of semantic word similarity for Norwegian. While such resources are available for English, they did not exist for Norwegian prior to this work. Furthermore, we produce large-coverage semantic vectors trained on the Norwegian Newspaper Corpus using several popular word embedding frameworks. Finally, we demonstrate the usefulness of the created resources for evaluating performance of different word embedding models on the tasks of analogical reasoning and synonym detection. The benchmark data sets and word embeddings are all made freely available

BIBSYS: Open Journals Systems

Generic algorithms for bounded discrete logarithm vectors

Author: van Roij A.
Publication venue
Publication date: 30/04/2010
Field of study

Pure OAI Repository