Search CORE

2 research outputs found

EventEpi-A natural language processing framework for event-based surveillance.

Author: Abbood Auss
Busche Rüdiger
Ghozzi Stéphane
Ullrich Alexander
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/11/2020
Field of study

According to the World Health Organization (WHO), around 60% of all outbreaks are detected using informal sources. In many public health institutes, including the WHO and the Robert Koch Institute (RKI), dedicated groups of public health agents sift through numerous articles and newsletters to detect relevant events. This media screening is one important part of event-based surveillance (EBS). Reading the articles, discussing their relevance, and putting key information into a database is a time-consuming process. To support EBS, but also to gain insights into what makes an article and the event it describes relevant, we developed a natural language processing framework for automated information extraction and relevance scoring. First, we scraped relevant sources for EBS as done at the RKI (WHO Disease Outbreak News and ProMED) and automatically extracted the articles' key data: disease, country, date, and confirmed-case count. For this, we performed named entity recognition in two steps: EpiTator, an open-source epidemiological annotation tool, suggested many different possibilities for each. We extracted the key country and disease using a heuristic with good results. We trained a naive Bayes classifier to find the key date and confirmed-case count, using the RKI's EBS database as labels which performed modestly. Then, for relevance scoring, we defined two classes to which any article might belong: The article is relevant if it is in the EBS database and irrelevant otherwise. We compared the performance of different classifiers, using bag-of-words, document and word embeddings. The best classifier, a logistic regression, achieved a sensitivity of 0.82 and an index balanced accuracy of 0.61. Finally, we integrated these functionalities into a web application called EventEpi where relevant sources are automatically analyzed and put into a database. The user can also provide any URL or text, that will be analyzed in the same way and added to the database. Each of these steps could be improved, in particular with larger labeled datasets and fine-tuning of the learning algorithms. The overall framework, however, works already well and can be used in production, promising improvements in EBS. The source code and data are publicly available under open licenses

Helmholtz Zentrum für Infektionsforschung Repository

Directory of Open Access Journals

Machine Learning for Health: Algorithm Auditing & Quality Control.

Author: Abbood Auss
Akogo Darlington
Alsalamah Shada
Arentz Matthew
Baird Pat
Balachandran Pradeep
Bielik Pavol
Cabitza Federico
Calderon-Ramirez Saul
Choudhary Shruti
Fehr Jana
Geierhofer Regina
Goldschmidt Peter G
Haufe Stefan
Johner Christian
Kazim Emre
Kherif Ferath
Koshiyama Adriano
Krois Joachim
Langer Nicolas
Leite Alixandro Werneck
Liu Xiaoxuan
Macpherson Sheena
Matek Christian
Meyer Martin
Murchison Andrew G
Nakasi Rose
Oala Luis
Piechottka Sven
Prabhu Carolin
Pujari Sameer
Samek Wojciech
Sanguinetti Bruno
Schörverth Elora D M
Shadforth Ian
Vogler Steffen
Weicken Eva
Wenzel Markus
Wiegand Thomas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Developers proposing new machine learning for health (ML4H) tools often pledge to match or even surpass the performance of existing tools, yet the reality is usually more complicated. Reliable deployment of ML4H to the real world is challenging as examples from diabetic retinopathy or Covid-19 screening show. We envision an integrated framework of algorithm auditing and quality control that provides a path towards the effective and reliable application of ML systems in healthcare. In this editorial, we give a summary of ongoing work towards that vision and announce a call for participation to the special issue Machine Learning for Health: Algorithm Auditing & Quality Control in this journal to advance the practice of ML4H auditing

Repository for Publications and Research Data

Serveur académique lausannois

UCL Discovery

Heart of England: HEFT Repository

ZORA

PuSH

Publikationsserver des Robert Koch-Instituts

Hochschulbibliothekszentrum des Landes Nordrhein-Westfalen (hbz)

HEFT Repository