Search CORE

5,111 research outputs found

Stochastic Query Covering for Fast Approximate Document Retrieval

Author: Anagnostopoulos Aristidis
Becchetti Luca
Ida Mele
Ilaria Bordino
Leonardi Stefano
Piotr Sankowski
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2015
Field of study

We design algorithms that, given a collection of documents and a distribution over user queries, return a small subset of the document collection in such a way that we can efficiently provide high-quality answers to user queries using only the selected subset. This approach has applications when space is a constraint or when the query-processing time increases significantly with the size of the collection. We study our algorithms through the lens of stochastic analysis and prove that even though they use only a small fraction of the entire collection, they can provide answers to most user queries, achieving a performance close to the optimal. To complement our theoretical findings, we experimentally show the versatility of our approach by considering two important cases in the context of Web search. In the first case, we favor the retrieval of documents that are relevant to the query, whereas in the second case we aim for document diversification. Both the theoretical and the experimental analysis provide strong evidence of the potential value of query covering in diverse application scenarios

Archivio della ricerca- Università di Roma La Sapienza

MPG.PuRe

Query Understanding in the Age of Large Language Models

Author: Anand Abhijit
Anand Avishek
Setty Vinay
V Venktesh
Publication venue
Publication date: 28/06/2023
Field of study

Querying, conversing, and controlling search and information-seeking interfaces using natural language are fast becoming ubiquitous with the rise and adoption of large-language models (LLM). In this position paper, we describe a generic framework for interactive query-rewriting using LLMs. Our proposal aims to unfold new opportunities for improved and transparent intent understanding while building high-performance retrieval systems using LLMs. A key aspect of our framework is the ability of the rewriter to fully specify the machine intent by the search engine in natural language that can be further refined, controlled, and edited before the final retrieval phase. The ability to present, interact, and reason over the underlying machine intent in natural language has profound implications on transparency, ranking performance, and a departure from the traditional way in which supervised signals were collected for understanding intents. We detail the concept, backed by initial experiments, along with open questions for this interactive query understanding framework.Comment: Accepted to GENIR(SIGIR'23

arXiv.org e-Print Archive

Introduction to the special issue on search as learning

Author: Eickhoff C. (Carsten)
Gwizdka J. (Jacek)
Hauff C. (Claudia)
He J. (Jiyin)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2017
Field of study

CWI's Institutional Repository

Explicit diversification of event aspects for temporal summarization

Author: Macdonald Craig
McCreadie Richard
Ounis Iadh
Santos Rodrygo L.T.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 02/02/2018
Field of study

During major events, such as emergencies and disasters, a large volume of information is reported on newswire and social media platforms. Temporal summarization (TS) approaches are used to automatically produce concise overviews of such events by extracting text snippets from related articles over time. Current TS approaches rely on a combination of event relevance and textual novelty for snippet selection. However, for events that span multiple days, textual novelty is often a poor criterion for selecting snippets, since many snippets are textually unique but are semantically redundant or non-informative. In this article, we propose a framework for the diversification of snippets using explicit event aspects, building on recent works in search result diversification. In particular, we first propose two techniques to identify explicit aspects that a user might want to see covered in a summary for different types of event. We then extend a state-of-the-art explicit diversification framework to maximize the coverage of these aspects when selecting summary snippets for unseen events. Through experimentation over the TREC TS 2013, 2014, and 2015 datasets, we show that explicit diversification for temporal summarization significantly outperforms classical novelty-based diversification, as the use of explicit event aspects reduces the amount of redundant and off-topic snippets returned, while also increasing summary timeliness

Crossref

Enlighten

Strategic flexibility, rigidity and barriers to the development of absorptive capacity in business markets: Themes and research perspectives.

Author: Matthyssens Paul
Pauwels Pieter
Vandenbempt Koen
Publication venue
Publication date
Field of study

Research Papers in Economics

A Survey on Automatically Mining Facets for Web Queries

Author: M. Lomte Vina
Pawar Duhita
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/12/2017
Field of study

In this paper, a detailed survey on different facet mining techniques, their advantages and disadvantages is carried out. Facets are any word or phrase which summarize an important aspect about the web query. Researchers proposed different efficient techniques which improves the user’s web query search experiences magnificently. Users are happy when they find the relevant information to their query in the top results. The objectives of their research are: (1) To present automated solution to derive the query facets by analyzing the text query; (2) To create taxonomy of query refinement strategies for efficient results; and (3) To personalize search according to user interest

IAES journal

Crossref

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Institute of Advanced Engineering and Science

Fusion-based Methods for result diversification in web search

Author: Crestani Fabio
Huang Chunlan
Li Liang
Wu Shengli
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

Crossref

Ulster University's Research Portal

DIR 2011: Dutch_Belgian Information Retrieval Workshop Amsterdam

Author: Boscarino C.
de Rijke M.
Hofmann K.
Jijkoun V.
Meij E.
Weerkamp W.
Publication venue: University of Amsterdam, Information and Language Processing group
Publication date: 01/01/2011
Field of study

International Migration, Integration and Social Cohesion online publications