27 research outputs found

    A novel hybrid approach for interestingness analysis of classification rules

    Get PDF
    Data mining is the efficient discovery of patterns in large databases, and classification rules are perhaps the most important type of patterns in data mining applications. However, the number of such classification rules is generally very big that selection of interesting ones among all discovered rules becomes an important task. In this paper, factors related to the interestingness of a rule are investigated and some new factors are proposed. Following this, an interactive rule interestingness-learning algorithm (IRIL) is developed to automatically label the classification rules either as "interesting" or "uninteresting" with limited user participation. In our study, VFP (Voting Feature Projections), a feature projection based incremental classification learning algorithm, is also developed in the framework of IRIL. The concept description learned by the VFP algorithm constitutes a novel hybrid approach for interestingness analysis of classification rules. © Springer-Verlag Berlin Heidelberg 2007

    Explainable 2D Vision Models for 3D Medical Data

    Full text link
    Training Artificial Intelligence (AI) models on three-dimensional image data presents unique challenges compared to the two-dimensional case: Firstly, the computational resources are significantly higher, and secondly, the availability of large pretraining datasets is often limited, impeding training success. In this study, we propose a simple approach of adapting 2D networks with an intermediate feature representation for processing 3D volumes. Our method involves sequentially applying these networks to slices of a 3D volume from all orientations. Subsequently, a feature reduction module combines the extracted slice features into a single representation, which is then used for classification. We evaluate our approach on medical classification benchmarks and a real-world clinical dataset, demonstrating comparable results to existing methods. Furthermore, by employing attention pooling as a feature reduction module we obtain weighted importance values for each slice during the forward pass. We show that slices deemed important by our approach allow the inspection of the basis of a model's prediction

    Data mining experiments on the Angiotensin II-Antagonist in Paroxysmal Atrial Fibrillation (ANTIPAF-AFNET 2) trial: ‘Exposing the invisible’

    Get PDF
    Aims: The aims of this study include (i) pursuing data-mining experiments on the Angiotensin II-Antagonist in Paroxysmal Atrial Fibrillation (ANTIPAF-AFNET 2) trial dataset containing atrial fibrillation (AF) burden scores of patients with many clinical parameters and (ii) revealing possible correlations between the estimated risk factors of AF and other clinical findings or measurements provided in the dataset. Methods: Ranking Instances by Maximizing the Area under a Receiver Operating Characteristics (ROC) Curve (RIMARC) is used to determine the predictive weights (Pw) of baseline variables on the primary endpoint. Chi-square automatic interaction detector algorithm is performed for comparing the results of RIMARC. The primary endpoint of the ANTIPAF-AFNET 2 trial was the percentage of days with documented episodes of paroxysmal AF or with suspected persistent AF. Results: By means of the RIMARC analysis algorithm, baseline SF-12 mental component score (Pw = 0.3597), age (Pw = 0.2865), blood urea nitrogen (BUN) (Pw = 0.2719), systolic blood pressure (Pw = 0.2240), and creatinine level (Pw = 0.1570) of the patients were found to be predictors of AF burden. Atrial fibrillation burden increases as baseline SF-12 mental component score gets lower; systolic blood pressure, BUN and creatinine levels become higher; and the patient gets older. The AF burden increased significantly at age >76. Conclusions: With the ANTIPAF-AFNET 2 dataset, the present data-mining analyses suggest that a baseline SF-12 mental component score, age, systolic blood pressure, BUN, and creatinine level of the patients are predictors of AF burden. Additional studies are necessary to understand the distinct kidney-specific pathophysiological pathways that contribute to AF burden. Published on behalf of the European Society of Cardiology. All rights reserved. © The Author 2016

    Application of the RIMARC algorithm to a large data set of action potentials and clinical parameters for risk prediction of atrial fibrillation

    Get PDF
    Ex vivo recorded action potentials (APs) in human right atrial tissue from patients in sinus rhythm (SR) or atrial fibrillation (AF) display a characteristic spike-and-dome or triangular shape, respectively, but variability is huge within each rhythm group. The aim of our study was to apply the machine-learning algorithm ranking instances by maximizing the area under the ROC curve (RIMARC) to a large data set of 480 APs combined with retrospectively collected general clinical parameters and to test whether the rules learned by the RIMARC algorithm can be used for accurately classifying the preoperative rhythm status. APs were included from 221 SR and 158 AF patients. During a learning phase, the RIMARC algorithm established a ranking order of 62 features by predictive value for SR or AF. The model was then challenged with an additional test set of features from 28 patients in whom rhythm status was blinded. The accuracy of the risk prediction for AF by the model was very good (0.93) when all features were used. Without the seven AP features, accuracy still reached 0.71. In conclusion, we have shown that training the machine-learning algorithm RIMARC with an experimental and clinical data set allows predicting a classification in a test data set with high accuracy. In a clinical setting, this approach may prove useful for finding hypothesis-generating associations between different parameters. © 2014, International Federation for Medical and Biological Engineering

    A systematic review of mental health outcome measures for young people aged 12 to 25 years

    Full text link

    Lipoid proteinosis: Urbach-Wiethe disease

    No full text
    PubMedID: 8241657Lipoid proteinosis (Urbach-Wicthe disease) is a rare autosomal recessive disorder in which hyalinized material is deposited in the skin, mucous membranes and brain. Laryngeal changes resulting in hoarseness may be present at birth or in early infancy, often being the first sign of the discase. A typical case is presented. Copyright © 1993, Wiley Blackwell. All rights reserve

    Supervised Classification with Associative SOM

    No full text

    A prescription fraud detection model

    No full text
    Prescription fraud is a main problem that causes substantial monetary loss in health care systems. We aimed to develop a model for detecting cases of prescription fraud and test it on real world data from a large multi-center medical prescription database. Conventionally, prescription fraud detection is conducted on random samples by human experts. However, the samples might be misleading and manual detection is costly. We propose a novel distance based on data-mining approach for assessing the fraudulent risk of prescriptions regarding cross-features. Final tests have been conducted on adult cardiac surgery database. The results obtained from experiments reveal that the proposed model works considerably well with a true positive rate of 77.4% and a false positive rate of 6% for the fraudulent medical prescriptions. The proposed model has the potential advantages including on-line risk prediction for prescription fraud, off-line analysis of high-risk prescriptions by human experts, and self-learning ability by regular updates of the integrative data sets. We conclude that incorporating such a system in health authorities, social security agencies and insurance companies would improve efficiency of internal review to ensure compliance with the law, and radically decrease human-expert auditing costs. © 2011 Elsevier Ireland Ltd
    corecore