4 research outputs found

    PLENARY: Explaining black-box models in natural language through fuzzy linguistic summaries

    No full text
    We introduce an approach called PLENARY (exPlaining bLack-box modEls in Natural lAnguage thRough fuzzY linguistic summaries), which is an explainable classifier based on a data-driven predictive model. Neural learning is exploited to derive a predictive model based on two levels of labels associated with the data. Then, model explanations are derived through the popular SHapley Additive exPlanations (SHAP) tool and conveyed in a linguistic form via fuzzy linguistic summaries. The linguistic summarization allows translating the explanations of the model outputs provided by SHAP into statements expressed in natural language. PLENARY accounts for the imprecision related to model outputs by summarizing them into simple linguistic statements and for the imprecision related to the data labeling process by including additional domain knowledge in the form of middle-layer labels. PLENARY is validated on preprocessed speech signals collected from smartphones from patients with bipolar disorder and on publicly available mental health survey data. The experiments confirm that fuzzy linguistic summarization is an effective technique to support meta-analyses of the outputs of AI models. Also, PLENARY improves explainability by aggregating low-level attributes into high-level information granules, and by incorporating vague domain knowledge into a multi-task sequential and compositional multilayer perceptron. SHAP explanations translated into fuzzy linguistic summaries significantly improve understanding of the predictive modelling process and its outputs

    Dynamic Incremental Semi-supervised Fuzzy Clustering for Bipolar Disorder Episode Prediction

    No full text
    Bipolar Disorder (BD) is a chronic mental illness characterized by changing episodes from euthymia (healthy state) through depression and mania to the mixed states. In this context, data collected through the interaction of patients with smartphones enable the creation of predictive models to support the early prediction of a starting episode. Previous research on predicting a new BD episode use mostly supervised learning methods that require labeled data and hence force a filtering of the available data to retain only those data that have valid labels (from the psychiatric assessment). To avoid limitations of supervised learning, in this paper we investigate the use of a semi-supervised learning approach that combines both labeled and unlabeled data to derive a model for BD episode prediction. Specifically we apply the DISSFCM (Dynamic Incremental Semi-Supervised Fuzzy C-Means) algorithm which offers the possibility to process in an incremental fashion the data stream of the voice signal captured by the smartphone, thus exploiting the evolving time structure of data which is ignored by static learning methods. DISSFCM processes data in form of chunks and creates a dynamic collection of clusters thanks to a splitting mechanism that generates new clusters to better capture the hidden geometrical structure of data. This gives DISSFCM the ability to detect changes in data and dynamically adapt the model to them, thus improving the prediction accuracy. Preliminary results on real-world data collected at the Department of Affective Disorders, Institute of Psychiatry and Neurology in Warsaw (Poland) show that DISSFCM is able to predict some of healthy episodes (euthymia) and disease episodes even when only 25% of labeled data are available. Moreover DISSFM performs better than its previous version without split (ISSFCM) and it also overcomes the batch algorithm (SSFCM) that uses the whole dataset to create the model
    corecore