8 research outputs found

    Learning from a Class Imbalanced Public Health Dataset: a Cost-based Comparison of Classifier Performance

    Get PDF
    Public health care systems routinely collect health-related data from the population. This data can be analyzed using data mining techniques to find novel, interesting patterns, which could help formulate effective public health policies and interventions. The occurrence of chronic illness is rare in the population and the effect of this class imbalance, on the performance of various classifiers was studied. The objective of this work is to identify the best classifiers for class imbalanced health datasets through a cost-based comparison of classifier performance. The popular, open-source data mining tool WEKA, was used to build a variety of core classifiers as well as classifier ensembles, to evaluate the classifiers’ performance. The unequal misclassification costs were represented in a cost matrix, and cost-benefit analysis was also performed.  In another experiment, various sampling methods such as under-sampling, over-sampling, and SMOTE was performed to balance the class distribution in the dataset, and the costs were compared. The Bayesian classifiers performed well with a high recall, low number of false negatives and were not affected by the class imbalance. Results confirm that total cost of Bayesian classifiers can be further reduced using cost-sensitive learning methods. Classifiers built using the random under-sampled dataset showed a dramatic drop in costs and high classification accuracy

    Quality of Service-Based Cross-Layer Protocol for Wireless Sensor Networks

    No full text
    Sensor nodes in wireless sensor networks (WSN) are used for perceiving, monitoring, and controlling a wide range of applications. Owing to the small size of sensor nodes and limited power sources, energy saving is critical for ensuring network longevity. Protocols in different layers consume energy for their function. It is possible to significantly reduce energy usage by including energy-efficiency measures in the protocol design. Most protocols in the literature focus on the energy efficiency in individual layers. Recent studies have shown that cross-layer designs are more energy efficient than individual layer designs. Therefore, this study presents a cross-layer protocol design that combines network and data link layers to minimize energy consumption. This article proposes a novel "Quality of Service Based Cross-layer (QSCL) Protocol" by combining the IEEE 802.15.4-based MAC protocol and the LEACH-based routing protocol. The dynamic duty cycle of the IEEE 802.15.4 protocol was modified based on the amount of data present in the node, which minimized the energy consumption of the data-transfer mechanism. The cluster head (CH) selection of the LEACH-based protocol was modified to consider the average residual energy (RE) of the nodes and their distance from the sink. This helps preserve the energy in the CH, thereby extending the network lifetime. Simulation studies demonstrated that the proposed QSCL outperformed the existing protocols by prolonging the network lifetime
    corecore