Search CORE

125 research outputs found

On crowdsourcing relevance magnitudes for information retrieval evaluation

Author: Maddalena E
Mizzaro S
Scholer F
Turpin A
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

4siMagnitude estimation is a psychophysical scaling technique for the measurement of sensation, where observers assign numbers to stimuli in response to their perceived intensity. We investigate the use of magnitude estimation for judging the relevance of documents for information retrieval evaluation, carrying out a large-scale user study across 18 TREC topics and collecting over 50,000 magnitude estimation judgments using crowdsourcing. Our analysis shows that magnitude estimation judgments can be reliably collected using crowdsourcing, are competitive in terms of assessor cost, and are, on average, rank-aligned with ordinal judgments made by expert relevance assessors. We explore the application of magnitude estimation for IR evaluation, calibrating two gain-based effectiveness metrics, nDCG and ERR, directly from user-reported perceptions of relevance. A comparison of TREC system effectiveness rankings based on binary, ordinal, and magnitude estimation relevance shows substantial variation; in particular, the top systems ranked using magnitude estimation and ordinal judgments differ substantially. Analysis of the magnitude estimation scores shows that this effect is due in part to varying perceptions of relevance: different users have different perceptions of the impact of relative differences in document relevance. These results have direct implications for IR evaluation, suggesting that current assumptions about a single view of relevance being sufficient to represent a population of users are unlikely to hold.partially_openopenMaddalena, Eddy; Mizzaro, Stefano; Scholer, Falk; Turpin, AndrewMaddalena, Eddy; Mizzaro, Stefano; Scholer, Falk; Turpin, Andre

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Udine

RMIT Research Repository

Multidimensional news quality: A comparison of crowdsourcing and nichesourcing

Author: Ceolin D. (Davide)
Maddalena E. (Eddy)
Mizzaro S. (Stefano)
Publication venue
Publication date: 22/10/2018
Field of study

In the age of fake news and of filter bubbles, assessing the quality of information is a compelling issue: it is important for users to understand the quality of the information they consume online. We report on our experiment aimed at understanding if workers from the crowd can be a suitable alternative to

CWI's Institutional Repository

Towards building a standard dataset for Arabic keyphrase extraction evaluation

Author: Basaldella M.
Demartini G.
Helmy M.
Maddalena E.
Mizzaro S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

Keyphrases are short phrases that best represent a document content. They can be useful in a variety of applications, including document summarization and retrieval models. In this paper, we introduce the first dataset of keyphrases for an Arabic document collection, obtained by means of crowdsourcing. We experimentally evaluate different crowdsourced answer aggregation strategies and validate their performances against expert annotations to evaluate the quality of our dataset. We report about our experimental results, the dataset features

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Udine

White Rose Research Online

UQ eSpace (University of Queensland)

Visual exploration and retrieval of XML document collections with the generic system X2

Author: Felix Weigel
François Bry
H Meuss
Holger Meuss
Klaus U. Schulz
S Ceri
S Mizzaro
Simone Leonardi
T Catarci
T Schlieder
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2005
Field of study

This article reports on the XML retrieval system X2 which has been developed at the University of Munich over the last five years. In a typical session with X2, the user first browses a structural summary of the XML database in order to select interesting elements and keywords occurring in documents. Using this intermediate result, queries combining structure and textual references are composed semiautomatically. After query evaluation, the full set of answers is presented in a visual and structured way. X2 largely exploits the structure found in documents, queries and answers to enable new interactive visualization and exploration techniques that support mixed IR and database-oriented querying, thus bridging the gap between these three views on the data to be retrieved. Another salient characteristic of X2 which distinguishes it from other visual query systems for XML is that it supports various degrees of detailedness in the presentation of answers, as well as techniques for dynamically reordering and grouping retrieved elements once the complete answer set has been computed

Crossref

Open Access LMU ( Ludwig-Maximilians-Univ. München)

Causal Text-to-Text Transformers for Water Pollution Forecasting

Author: Della Mea V.
Gattazzo C.
Mizzaro S.
Roitero K.
Zancola A.
Publication venue: CEUR-WS
Publication date: 01/01/2022
Field of study

We propose a novel approach based on large language causal models to perform the task of time-series forecasting, and we use the proposed approach to effectively forecast the concentration of polluting substances in a water treatment plant; we address both short- and mid-term forecasting. As opposed to the classical state-of-the-art approaches for time-series forecasting, that handle numerical and categorical features following a standard deep learning approach, we transform the input features into a textual form and we then feed them to a standard causal model pre-trained on natural language tasks. Our empirical results provide evidence that large language models are more effective than state-of-the-art forecasting systems, and that they can be practically used in time-series forecasting tasks. We also show promising results on zero-shot learning. The results of this study open up to a wide range of works aimed at predicting future temporal values by leveraging natural language paradigms and models

Archivio istituzionale della ricerca - Università degli Studi di Udine

Detection of Wastewater Pollution through Natural Language Generation with a Low-Cost Sensing Platform

Author: Cerro G.
Mea V. D.
Mizzaro S.
Molinara M.
Portelli B.
Roitero K.
Serra G.
Vitelli M.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2023
Field of study

IRIS Unicas (Università degli Studi di Cassino e del Lazio Meridionale)

Supporting Fair and Efficient Emergency Medical Services in a Large Heterogeneous Region

Author: Da Ros F.
Della Mea V.
Deroma L.
Di Gaspero L.
La Barbera D.
Mizzaro S.
Roitero K.
Valent F.
Publication venue
Publication date: 01/01/2024
Field of study

Emergency Medical Services (EMS) are crucial in delivering timely and effective medical care to patients in need. However, the complex and dynamic nature of operations poses challenges for decision-making processes at strategic, tactical, and operational levels. This paper proposes an action-driven strategy for EMS management, employing a multi-objective optimizer and a simulator to evaluate potential outcomes of decisions. The approach combines historical data with dynamic simulations and multi-objective optimization techniques to inform decision-makers and improve the overall performance of the system. The research focuses on the Friuli Venezia Giulia region in north-eastern Italy. The region encompasses various landscapes and demographic situations that challenge fairness and equity in service access. Similar challenges are faced in other regions with comparable characteristics. The Decision Support System developed in this work accurately models the real-world system and provides valuable feedback and suggestions to EMS professionals, enabling them to make informed decisions and enhance the efficiency and fairness of the system

Archivio istituzionale della ricerca - Università degli Studi di Udine

A social approach to context-aware retrieval

Author: H Chen
HL Truong
K Sungrim
K Sungrim
Luca Vassena
M Sanderson
MC Gonzalez
MP Papazoglou
P Coppola
P Korpipää
S Ceri
SA Golder
Stefano Mizzaro
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Efficiency Theory: a Unifying Theory for Information, Computation and Intelligence

The paper serves as the first contribution towards the development of the theory of efficiency: a unifying framework for the currently disjoint theories of information, complexity, communication and computation. Realizing the defining nature of the brute force approach in the fundamental concepts in all of the above mentioned fields, the paper suggests using efficiency or improvement over the brute force algorithm as a common unifying factor necessary for the creation of a unified theory of information manipulation. By defining such diverse terms as randomness, knowledge, intelligence and computability in terms of a common denominator we are able to bring together contributions from Shannon, Levin, Kolmogorov, Solomonoff, Chaitin, Yao and many others under a common umbrella of the efficiency theory

arXiv.org e-Print Archive

Crossref

University of Louisville