2 research outputs found
Multilingual enrichment of disease biomedical ontologies
Translating biomedical ontologies is an important challenge, but doing it
manually requires much time and money. We study the possibility to use
open-source knowledge bases to translate biomedical ontologies. We focus on two
aspects: coverage and quality. We look at the coverage of two biomedical
ontologies focusing on diseases with respect to Wikidata for 9 European
languages (Czech, Dutch, English, French, German, Italian, Polish, Portuguese
and Spanish) for both ontologies, plus Arabic, Chinese and Russian for the
second one. We first use direct links between Wikidata and the studied
ontologies and then use second-order links by going through other intermediate
ontologies. We then compare the quality of the translations obtained thanks to
Wikidata with a commercial machine translation tool, here Google Cloud
Translation
Towards Unbiased and Accurate Deferral to Multiple Experts
Machine learning models are often implemented in cohort with humans in the
pipeline, with the model having an option to defer to a domain expert in cases
where it has low confidence in its inference. Our goal is to design mechanisms
for ensuring accuracy and fairness in such prediction systems that combine
machine learning model inferences and domain expert predictions. Prior work on
"deferral systems" in classification settings has focused on the setting of a
pipeline with a single expert and aimed to accommodate the inaccuracies and
biases of this expert to simultaneously learn an inference model and a deferral
system. Our work extends this framework to settings where multiple experts are
available, with each expert having their own domain of expertise and biases. We
propose a framework that simultaneously learns a classifier and a deferral
system, with the deferral system choosing to defer to one or more human experts
in cases of input where the classifier has low confidence. We test our
framework on a synthetic dataset and a content moderation dataset with biased
synthetic experts, and show that it significantly improves the accuracy and
fairness of the final predictions, compared to the baselines. We also collect
crowdsourced labels for the content moderation task to construct a real-world
dataset for the evaluation of hybrid machine-human frameworks and show that our
proposed learning framework outperforms baselines on this real-world dataset as
well.Comment: This paper has been accepted for publication at the AAAI/ACM
Conference on Artificial Intelligence, Ethics, and Society (AIES 2021