221,863 research outputs found

    An Overview of the Use of Neural Networks for Data Mining Tasks

    Get PDF
    In the recent years the area of data mining has experienced a considerable demand for technologies that extract knowledge from large and complex data sources. There is a substantial commercial interest as well as research investigations in the area that aim to develop new and improved approaches for extracting information, relationships, and patterns from datasets. Artificial Neural Networks (NN) are popular biologically inspired intelligent methodologies, whose classification, prediction and pattern recognition capabilities have been utilised successfully in many areas, including science, engineering, medicine, business, banking, telecommunication, and many other fields. This paper highlights from a data mining perspective the implementation of NN, using supervised and unsupervised learning, for pattern recognition, classification, prediction and cluster analysis, and focuses the discussion on their usage in bioinformatics and financial data analysis tasks

    k-Nearest Neighbor Classification over Semantically Secure Encrypted Relational Data

    Full text link
    Data Mining has wide applications in many areas such as banking, medicine, scientific research and among government agencies. Classification is one of the commonly used tasks in data mining applications. For the past decade, due to the rise of various privacy issues, many theoretical and practical solutions to the classification problem have been proposed under different security models. However, with the recent popularity of cloud computing, users now have the opportunity to outsource their data, in encrypted form, as well as the data mining tasks to the cloud. Since the data on the cloud is in encrypted form, existing privacy preserving classification techniques are not applicable. In this paper, we focus on solving the classification problem over encrypted data. In particular, we propose a secure k-NN classifier over encrypted data in the cloud. The proposed k-NN protocol protects the confidentiality of the data, user's input query, and data access patterns. To the best of our knowledge, our work is the first to develop a secure k-NN classifier over encrypted data under the semi-honest model. Also, we empirically analyze the efficiency of our solution through various experiments.Comment: 29 pages, 2 figures, 3 tables arXiv admin note: substantial text overlap with arXiv:1307.482

    Adversarial Attacks on Deep Neural Networks for Time Series Classification

    Full text link
    Time Series Classification (TSC) problems are encountered in many real life data mining tasks ranging from medicine and security to human activity recognition and food safety. With the recent success of deep neural networks in various domains such as computer vision and natural language processing, researchers started adopting these techniques for solving time series data mining problems. However, to the best of our knowledge, no previous work has considered the vulnerability of deep learning models to adversarial time series examples, which could potentially make them unreliable in situations where the decision taken by the classifier is crucial such as in medicine and security. For computer vision problems, such attacks have been shown to be very easy to perform by altering the image and adding an imperceptible amount of noise to trick the network into wrongly classifying the input image. Following this line of work, we propose to leverage existing adversarial attack mechanisms to add a special noise to the input time series in order to decrease the network's confidence when classifying instances at test time. Our results reveal that current state-of-the-art deep learning time series classifiers are vulnerable to adversarial attacks which can have major consequences in multiple domains such as food safety and quality assurance.Comment: Accepted at IJCNN 201

    Mining Medical Data: Bridging the Knowledge Divide

    Get PDF
    Due to the signi¯cant amount of data generated by modern medicine there is a growing reliance on tools such as data mining and knowledge discovery to help make sense and comprehend such data. The success of this process requires collaboration and interaction between such methods and medical professionals. Therefore an important question is: How can we strengthen the relationship between two traditionally separate fields (technology and medicine) in order to work simultaneously towards enhancing knowledge in modern medicine. To address this question, this study examines the application of data mining techniques to a large asthma medical dataset. A discussion introducing various methods for a smooth approach, straying from the `jack of all trades, master of none' to a modular cooperative approach for a successful outcome is pro-posed. The results of this study support the use of data mining as a useful tool and highlight the advantages on a global scale of closer relations between the two distinct fields. The exploration of CRISP methodology suggests that a `one methodology fits all approach' is not appropriate, but rather combines to create a hybrid holistic approach to data mining

    Grid data mining for outcome prediction in intensive care medicine

    Get PDF
    This paper introduces a distributed data mining approach suited to grid computing environments based on a supervised learning classifier system. Specific Classifier and Majority Voting methods for Distributed Data Mining (DDM) are explored and compared with the Centralized Data Mining (CDM) approach. Experimental tests were conducted considering a real world data set from the intensive care medicine in order to predict the outcome of the patients. The results demonstrate that the performance of the DDM methods are better than the CDM method.Fundação para a Ciência e a Tecnologia (FCT

    Data Mining

    Get PDF
    The availability of big data due to computerization and automation has generated an urgent need for new techniques to analyze and convert big data into useful information and knowledge. Data mining is a promising and leading-edge technology for mining large volumes of data, looking for hidden information, and aiding knowledge discovery. It can be used for characterization, classification, discrimination, anomaly detection, association, clustering, trend or evolution prediction, and much more in fields such as science, medicine, economics, engineering, computers, and even business analytics. This book presents basic concepts, ideas, and research in data mining

    A Hybrid Mining Approach to Facilitate Health Insurance Decision: Case Study of Non-Traditional Data Mining Applications in Taiwan NHI Databases

    Get PDF
    This study examines time-sensitive applications of data mining methods to facilitate claims review processing and provide policy information for insurance decision-making vis-à-vis the Taiwan National Health Insurance databases. In order to obtain the best payment management, a hybrid mining approach, which has been grounded on the extant knowledge of data mining projects and health insurance domain knowledge, is proposed. Through the integration of data warehousing, online analytical processing, data mining techniques and traditional data analysis in the healthcare field, an easy-to-use decision support platform, which will facilitate the health insurance decision-making process, is built. Drawing from lessons learned in case study, results showed that not only is hybrid mining approach a reliable, powerful, and user-friendly platform for diversified payment decision support, but that it also has great relevance for the practice and acceptance of evidence-based medicine. Researchers should develop hybrid mining approach combined with their own application systems in the future
    corecore