1,897 research outputs found

    Cost-Sensitive Learning-based Methods for Imbalanced Classification Problems with Applications

    Get PDF
    Analysis and predictive modeling of massive datasets is an extremely significant problem that arises in many practical applications. The task of predictive modeling becomes even more challenging when data are imperfect or uncertain. The real data are frequently affected by outliers, uncertain labels, and uneven distribution of classes (imbalanced data). Such uncertainties create bias and make predictive modeling an even more difficult task. In the present work, we introduce a cost-sensitive learning method (CSL) to deal with the classification of imperfect data. Typically, most traditional approaches for classification demonstrate poor performance in an environment with imperfect data. We propose the use of CSL with Support Vector Machine, which is a well-known data mining algorithm. The results reveal that the proposed algorithm produces more accurate classifiers and is more robust with respect to imperfect data. Furthermore, we explore the best performance measures to tackle imperfect data along with addressing real problems in quality control and business analytics

    Assessing and augmenting SCADA cyber security: a survey of techniques

    Get PDF
    SCADA systems monitor and control critical infrastructures of national importance such as power generation and distribution, water supply, transportation networks, and manufacturing facilities. The pervasiveness, miniaturisations and declining costs of internet connectivity have transformed these systems from strictly isolated to highly interconnected networks. The connectivity provides immense benefits such as reliability, scalability and remote connectivity, but at the same time exposes an otherwise isolated and secure system, to global cyber security threats. This inevitable transformation to highly connected systems thus necessitates effective security safeguards to be in place as any compromise or downtime of SCADA systems can have severe economic, safety and security ramifications. One way to ensure vital asset protection is to adopt a viewpoint similar to an attacker to determine weaknesses and loopholes in defences. Such mind sets help to identify and fix potential breaches before their exploitation. This paper surveys tools and techniques to uncover SCADA system vulnerabilities. A comprehensive review of the selected approaches is provided along with their applicability

    Deep learning predicts function of live retinal pigment epithelium from quantitative microscopy.

    Get PDF
    Increases in the number of cell therapies in the preclinical and clinical phases have prompted the need for reliable and non-invasive assays to validate transplant function in clinical biomanufacturing. We developed a robust characterization methodology composed of quantitative bright-field absorbance microscopy (QBAM) and deep neural networks (DNNs) to non-invasively predict tissue function and cellular donor identity. The methodology was validated using clinical-grade induced pluripotent stem cell derived retinal pigment epithelial cells (iPSC-RPE). QBAM images of iPSC-RPE were used to train DNNs that predicted iPSC-RPE monolayer transepithelial resistance, predicted polarized vascular endothelial growth factor (VEGF) secretion, and matched iPSC-RPE monolayers to the stem cell donors. DNN predictions were supplemented with traditional machine learning algorithms that identified shape and texture features of single cells that were used to predict tissue function and iPSC donor identity. These results demonstrate non-invasive cell therapy characterization can be achieved with QBAM and machine learning

    The determination of ground granulated concrete compressive strength based machine learning models

    Get PDF
    The advancement of machine learning (ML) models has received remarkable attention by several science and engineering applications. Within the material engineering, ML models are usually utilized for building an expert system for supporting material design and attaining an optimal formulation material sustainability and maintenance. The current study is conducted on the based of the utilization of ML models for modeling compressive strength (Cs) of ground granulated blast furnace slag concrete (GGBFSC). Random Forest (RF) model is developed for this purpose. The predictive model is constructed based on multiple correlated properties for the concrete material including coarse aggregate (CA), curing temperature (T), GGBFSC to total binder ratio (GGBFSC/B), water to binder ratio (w/b), water content (W), fine aggregate (FA), superplasticizer (SP). A total of 268 experimental dataset are gather form the open-source previous published researches, are used to build the predictive model. For the verification purpose, a predominant ML model called support vector machine (SVM) is developed. The efficiency of the proposed predictive and the benchmark models is evaluated using statistical formulations and graphical presentation. Based on the attained prediction accuracy, RF model demonstrated an excellent performance for predicting the Cs using limited input parameters. Overall, the proposed methodology showed an exceptional predictive model that can be utilized for modeling compressive strength of GGBFSC

    Light-Weight Structural Optimization Through Biomimicry, Machine Learning, and Inverse Design

    Get PDF
    In load-bearing lightweight architectures, cellular materials were frequently utilized. While octahedron, tetrahedron, and octet truss lattice truss were built for lightweight architectures with stretching and flexural dominance, it can be believed that new cells could easily be designed that might perform much better than the present ones in terms of mechanical and architectural characteristics. Machine learning-based structure scouting and design improvisation for better mechanical performance is a growing field of study. Additionally, biomimicry—the science of imitating nature’s elements—offers people a wealth of resources from which to draw motivation as they work to create a better quality of life. Here, utilizing machine learning approaches, novel lattice truss unit cellular architectures with enhanced architectural characteristics were designed. An inverse design methodology employing generative adversarial networks is suggested to investigate and improvise the lightweight lattice truss unit cellular architectures. The proposed framework was utilized to identify various lattice truss unit cellular architectures with load carrying capacities 40–120% greater than those of octet unit cells. A further 130–160% raise in buckling load bearing capacity was made possible by substituting porous biomimicry columns for the solid trusses in the light-weight lattice truss unit cellular architectures. This dissertation\u27s main goal is to investigate various improvisation strategies for creating lightweight architectures, particularly when using data analysis and machine learning methods. Lightweight cellular architectures with thin-walls and lattice truss unit cellular architectures with improved shape memory capabilities were created using the knowledge gleaned from numerous of the research projects mentioned in the preceding paragraphs load-bearing architectures and devices, lightweight architecture with shape memory and with better strength, better stretchability, and better elastic stress recovery are widely desired. As compared to the bulk shape memory polymeric cylinders, the cellular architectures with thin walls show 200% betterer elastic stress recovery that is normalized with respect to base designs. The architectural improvisation of many other additional designs and practical implementation can be accomplished using the inverse design framework

    Classification of customer call details records using Support Vector Machine (SVMs) and Decision Tree (DTs)

    Get PDF
    On a daily basis, telecom businesses create a massive amount of data. Decision-makers underlined that acquiring new customers is more difficult than maintaining current ones. Further, existing churn customers' data may be used to identify churn consumers as well as their behavior patterns. This study provides a churn prediction model for the telecom industry that employs SVMs and DTs to detect churn customers. The suggested model uses classification techniques to churn customers' data, with the Support Vector Machine (SVMs) method performing well 98.36 % properly categorized instances) and the Decision Tree (DTs) approach performing poorly 33.04 % and the decision tree algorithm deliver outstanding results
    • …
    corecore