Closing the Loop with Concept Regularization

Posada-Moreno, Andres Felipe; Trimpe, Sebastian

research article

oai:biecoll.ub.uni-bielefeld.de:article/1173

Closing the Loop with Concept Regularization

Authors: Andres Felipe Posada-Moreno
Sebastian Trimpe
Publication date: 11 October 2024
Publisher: Data-NInJA research training group
Doi

Abstract

Convolutional Neural Networks (CNNs) are widely adopted in industrial settings, but are prone to biases and lack transparency. Explainable Artificial Intelligence (XAI), particularly through concept extraction (CE), allows for global explanations and bias detection, yet fails to offer corrective measures for identified biases. To bridge this gap, we introduce Concept Regularization (CoRe), which uses CE capabilities alongside human feedback to embed a regularization term during retraining. CoRe allows for the adjustments in model sensitivities based on identified biases, aligning model prediction process with expert human assessments. Our evaluations on a modified metal casting dataset demonstrate CoRe's efficacy in bias mitigation, highlighting its potential to refine models in practical applications

Similar works

Full text

Open in the Core reader

Download PDF

BieColl - Bielefeld eCollections

oai:biecoll.ub.uni-bielefeld.d...

Last time updated on 15/10/2024

This paper was published in BieColl - Bielefeld eCollections.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.

Licence: https://creativecommons.org/licenses/by/4.0