Search CORE

30 research outputs found

L'utilisation des POMDP pour les résumés multi-documents orientés par une thématique

Author: Chali Yllias
Hasan Sadid A.
Mojahid Mustapha
Publication venue: HAL CCSD
Publication date: 01/01/2013
Field of study

National audienceL’objectif principal du résumé multi-documents orienté par une thématique est de générer un résumé à partir de documents sources en réponse à une requête formulée par l’utilisateur. Cette tâche est difficile car il n’existe pas de méthode efficace pour mesurer la satisfaction de l’utilisateur. Cela introduit ainsi une incertitude dans le processus de génération de résumé. Dans cet article, nous proposons une modélisation de l’incertitude en formulant notre système de résumé comme un processus de décision markovien partiellement observables (POMDP) car dans de nombreux domaines on a montré que les POMDP permettent de gérer efficacement les incertitudes. Des expériences approfondies sur les jeux de données du banc d’essai DUC ont démontré l’efficacité de notre approche

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte

A reinforcement learning formulation to the complex question answering problem

Author: Chali Yllias
Hasan Sadid A.
Mojahid Mustapha
Publication venue: 'Elsevier BV'
Publication date: 01/05/2015
Field of study

International audienceWe use extractive multi-document summarization techniques to perform complex question answering and formulate it as a reinforcement learning problem. Given a set of complex questions, a list of relevant documents per question, and the corresponding human generated summaries (i.e. answers to the questions) as training data, the reinforcement learning module iteratively learns a number of feature weights in order to facilitate the automatic generation of summaries i.e. answers to previously unseen complex questions. A reward function is used to measure the similarities between the candidate (machine generated) summary sentences and the abstract summaries. In the training stage, the learner iteratively selects the important document sentences to be included in the candidate summary, analyzes the reward function and updates the related feature weights accordingly. The final weights are used to generate summaries as answers to unseen complex questions in the testing stage. Evaluation results show the effectiveness of our system. We also incorporate user interaction into the reinforcement learner to guide the candidate summary sentence selection process. Experiments reveal the positive impact of the user interaction component on the reinforcement learning framework

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte

HAL Descartes

SemClinBr -- a multi institutional and multi specialty semantically annotated corpus for Portuguese clinical NLP tasks

Author: Carvalho Deborah Ribeiro
Cintho Lilian Mie Mukai
da Silva Adalniza Moura Pucca
Gebeluca Caroline P.
Gumiel Yohan Bonescki
Hasan Sadid A.
Moro Claudia Maria Cabral
Oliveira Lucas Emanuel Silva e
Peters Ana Carolina
Publication venue
Publication date: 27/01/2020
Field of study

The high volume of research focusing on extracting patient's information from electronic health records (EHR) has led to an increase in the demand for annotated corpora, which are a very valuable resource for both the development and evaluation of natural language processing (NLP) algorithms. The absence of a multi-purpose clinical corpus outside the scope of the English language, especially in Brazilian Portuguese, is glaring and severely impacts scientific progress in the biomedical NLP field. In this study, we developed a semantically annotated corpus using clinical texts from multiple medical specialties, document types, and institutions. We present the following: (1) a survey listing common aspects and lessons learned from previous research, (2) a fine-grained annotation schema which could be replicated and guide other annotation initiatives, (3) a web-based annotation tool focusing on an annotation suggestion feature, and (4) both intrinsic and extrinsic evaluation of the annotations. The result of this work is the SemClinBr, a corpus that has 1,000 clinical notes, labeled with 65,117 entities and 11,263 relations, and can support a variety of clinical NLP tasks and boost the EHR's secondary use for the Portuguese language

arXiv.org e-Print Archive

Overview of ImageCLEF 2018: Challenges, Datasets and Evaluation

Author: Andrearczyk Vincent
Dang-Nguyen Duc-Tien
Dicente Cid Yashin
Eickhoff Carsten
Farri Oladimeji
Garcia Seco De Herrera Alba
Gurrin Cathal
Hasan Sadid A
Ionescu Bogdan
Kovalev Vassili
Liauchuk Vitali
Ling Yuan
Liu Joey
Lungren Matthew
Lux Mathias
Müller Henning
Piras Luca
Riegler Michael
Villegas Mauricio
Zhou Liting
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

This paper presents an overview of the ImageCLEF 2018 evaluation campaign, an event that was organized as part of the CLEF (Conference and Labs of the Evaluation Forum) Labs 2018. ImageCLEF is an ongoing initiative (it started in 2003) that promotes the evaluation of technologies for annotation, indexing and retrieval with the aim of providing information access to collections of images in various usage scenarios and domains. In 2018, the 16th edition of ImageCLEF ran three main tasks and a pilot task: (1) a caption prediction task that aims at predicting the caption of a figure from the biomedical literature based only on the figure image; (2) a tuberculosis task that aims at detecting the tuberculosis type, severity and drug resistance from CT (Computed Tomography) volumes of the lung; (3) a LifeLog task (videos, images and other sources) about daily activities understanding and moment retrieval, and (4) a pilot task on visual question answering where systems are tasked with answering medical questions. The strong participation, with over 100 research groups registering and 31 submitting results for the tasks, shows an increasing interest in this benchmarking campaign

University of Essex Research Repository

Crossref

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)

Archivio istituzionale della ricerca - Università di Cagliari

DCU Online Research Access Service

Creative Contextual Dialog Adaptation in an Open World RPG

Author: Colton Simon
Creutz Mathias
Georgios
Hasan Sadid A
Heafield Kenneth
Heafield Kenneth
Hämäläinen Mika
Kerr Christopher
Klein Guillaume
Ryan James
Smedt Tom De
Tiedemann Jörg
Togelius Noor
Ventura Dan
Publication venue: ACM
Publication date: 01/01/2019
Field of study

Peer reviewe

Crossref

Helsingin yliopiston digitaalinen arkisto

ImageCLEF 2019: Multimedia Retrieval in Lifelogging, Medical, Nature, and Security Applications

Author: Ben Abacha Asma
Chamberlain Jon
Cid Yashin Dicente
Clark Adrian
Dang-Nguyen Duc-Tien
Datla Vivek
del Blanco Carlos Roberto
Demner-Fushman Dina
Friedrich Christoph M
Garcia Seco De Herrera Alba
Garcia Narciso
Gurrin Cathal
Hasan Sadid A
Ionescu Bogdan
Karampidis Konstantinos
Kavallieratou Ergina
Kovalev Vassili
Liauchuk Vitali
Liu Joey
Lux Mathias
Müller Henning
Pelka Obioma
Piras Luca
Péteri Renaud
Riegler Michael
Rodríguez Carlos Cuevas
Tran Minh-Triet
Vasillopoulos Nikos
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

This paper presents an overview of the foreseen ImageCLEF 2019 lab that will be organized as part of the Conference and Labs of the Evaluation Forum - CLEF Labs 2019. ImageCLEF is an ongoing evaluation initiative (started in 2003) that promotes the evaluation of technologies for annotation, indexing and retrieval of visual data with the aim of providing information access to large collections of images in various usage scenarios and domains. In 2019, the 17th edition of ImageCLEF will run four main tasks: (i) a Lifelog task (videos, images and other sources) about daily activities understanding, retrieval and summarization, (ii) a Medical task that groups three previous tasks (caption analysis, tuberculosis prediction, and medical visual question answering) with newer data, (iii) a new Coral task about segmenting and labeling collections of coral images for 3D modeling, and (iv) a new Security task addressing the problems of automatically identifying forged content and retrieve hidden information. The strong participation, with over 100 research groups registering and 31 submitting results for the tasks in 2018 shows an important interest in this benchmarking campaign and we expect the new tasks to attract at least as many researchers for 2019

University of Essex Research Repository

University of Bergen

Crossref

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)

NORA - Norwegian Open Research Archives

Archivo Digital UPM (Univ. Politécnica de Madrid)

ImageCLEF 2020: Multimedia Retrieval in Lifelogging, Medical, Nature, and Security Applications

Author: Abacha Asma Ben
Berari Raul
Brie Paul
Campello Antonio
Chamberlain Jon
Cid Yashin Dicente
Clark Adrian
Constantin Mihai Gabriel
Dang-Nguyen Duc-Tien
Datla Vivek
Demner-Fushman Dina
Dogariu Mihai
Fichou Dimitri
Friedrich Christoph M
Garcia Seco De Herrera Alba
Gurrin Cathal
Halvorsen På L
Hasan Sadid A
Ionescu Bogdan
Kovalev Vassili
Kozlovski Serge
Le Tu-Khiem
Liauchuk Vitali
Lux Mathias
Müller Henning
Ninh Van-Tu
Pelka Obioma
Piras Luca
Péteri Renaud
Riegler Michael
Stefan Liviu Daniel
Tran Minh-Triet
Zhou Liting
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

This paper presents an overview of the 2020 ImageCLEF lab that will be organized as part of the Conference and Labs of the Evaluation Forum - CLEF Labs 2020 in Thessaloniki, Greece. ImageCLEF is an ongoing evaluation initiative (run since 2003) that promotes the evaluation of technologies for annotation, indexing and retrieval of visual data with the aim of providing information access to large collections of images in various usage scenarios and domains. In 2020, the 18th edition of ImageCLEF will organize four main tasks: (i) a Lifelog task (videos, images and other sources) about daily activity understanding, retrieval and summarization, (ii) a Medical task that groups three previous tasks (caption analysis, tuberculosis prediction, and medical visual question answering) with new data and adapted tasks, (iii) a Coral task about segmenting and labeling collections of coral images for 3D modeling, and a new (iv) Web user interface task addressing the problems of detecting and recognizing hand drawn website UIs (User Interfaces) for generating automatic code. The strong participation, with over 235 research groups registering and 63 submitting over 359 runs for the tasks in 2019 shows an important interest in this benchmarking campaign. We expect the new tasks to attract at least as many researchers for 2020

University of Essex Research Repository

Crossref

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)

Irish Universities

DCU Online Research Access Service

Overview of the ImageCLEF 2021: Multimedia Retrieval in Medical, Nature, Internet and Social Media Applications

Author: Abacha Asma Ben
Berari Raul
Brie Paul
Campello Antonio
Chamberlain Jon
Cid Yashin Dicente
Clark Adrian
Constantin Mihai Gabriel
Demner-Fushman Dina
Deshayes-Chossart Jérôme
Dogariu Mihai
Fichou Dimitri
Friedrich Christoph M
Garcia Seco De Herrera Alba
Hasan Sadid A
Ionescu Bogdan
Jacutprakart Janadhip
Kovalev Vassili
Kozlovski Serge
Liauchuk Vitali
Moustahfid Hassan
Müller Henning
Oliver Thomas A
Pelka Obioma
Popescu Adrian
Péteri Renaud
Sarrouti Mourad
Tauteanu Andrei
Ştefan Liviu Daniel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

This paper presents an overview of the ImageCLEF 2021 lab that was organized as part of the Conference and Labs of the Evaluation Forum – CLEF Labs 2021. ImageCLEF is an ongoing evaluation initiative (first run in 2003) that promotes the evaluation of technologies for annotation, indexing and retrieval of visual data with the aim of providing information access to large collections of images in various usage scenarios and domains. In 2021, the 19th edition of ImageCLEF runs four main tasks: (i) a medical task that groups three previous tasks, i.e., caption analysis, tuberculosis prediction, and medical visual question answering and question generation, (ii) a nature coral task about segmenting and labeling collections of coral reef images, (iii) an Internet task addressing the problems of identifying hand-drawn and digital user interface components, and (iv) a new social media aware task on estimating potential real-life effects of online image sharing. Despite the current pandemic situation, the benchmark campaign received a strong participation with over 38 groups submitting more than 250 runs

University of Essex Research Repository

HAL-CEA