3 research outputs found

    A Novel Approach for Mining Big Data Using Multi-Model Fusion Mechanism (MMFM)

    Get PDF
    Big data processing and analytics require sophisticated systems and cutting-edge methodologies to extract useful data from the available data. Extracted data visualization is challenging because of the processing models' dependence on semantics and classification. To categorize and improve information-based semantics that have accumulated over time, this paper introduces the Multi-model fusion mechanism for data mining (MMFM) approach. Information dependencies are organized based on the links between the data model based on attribute values. This method divides the attributes under consideration based on processing time to handle complicated data in controlled amount of time. The proposed MMFM’s performance is assessed with real-time weather prediction dataset where the data is acquired from sensor (observed) and image data. MMFM is used to conduct semantic analytics and similarity-based classification on this collection. The processing time based on records and samples are investigated for the various data sizes, instances, and entries. It is found that the proposed MMFM gets 70 seconds of processing time for 2GB data and 0.99 seconds while handling 5000 records for various classification instances

    Early Detection of Disease using Electronic Health Records and Fisher\u27s Wishart Discriminant Analysis

    Get PDF
    Linear Discriminant Analysis (LDA) is a simple and effective technique for pattern classification, while it is also widely-used for early detection of diseases using Electronic Health Records (EHR) data. However, the performance of LDA for EHR data classification is frequently affected by two main factors: ill-posed estimation of LDA parameters (e.g., covariance matrix), and linear inseparability of the EHR data for classification. To handle these two issues, in this paper, we propose a novel classifier FWDA -- Fisher\u27s Wishart Discriminant Analysis, which is developed as a faster and robust nonlinear classifier. Specifically, FWDA first surrogates the distribution of potential inverse covariance matrix estimates using a Wishart distribution estimated from the training data. Then, FWDA samples a group of inverse covariance matrices from the Wishart distribution, predicts using LDA classifiers based on the sampled inverse covariance matrices, and weighted-averages the prediction results via Bayesian Voting scheme. The weights for voting are optimally updated to adapt each new input data, so as to enable the nonlinear classification
    corecore