645 research outputs found

    CUBOS: An Internal Cluster Validity Index for Categorical Data

    Get PDF
    Internal cluster validity index is a powerful tool for evaluating clustering performance. The study on internal cluster validity indices for categorical data has been a challenging task due to the difficulty in measuring distance between categorical attribute values. While some efforts have been made, they ignore the relationship between different categorical attribute values and the detailed distribution information between data objects. To solve these problems, we propose a novel index called Categorical data cluster Utility Based On Silhouette (CUBOS). Specifically, we first make clear the superiority of the paradigm of Silhouette index in exploring the details of clustering results. Then, we raise the Improved Distance metric for Categorical data (IDC) inspired by Category Distance to measure distance between categorical data exactly. Finally, the paradigm of Silhouette index and IDC are combined to construct the CUBOS, which can overcome the aforementioned shortcomings and produce more accurate evaluation results than other baselines, as shown by the experimental results on several UCI datasets

    Auto Insurance Business Analytics Approach for Customer Segmentation Using Multiple Mixed-Type Data Clustering Algorithms

    Get PDF
    Customer segmentation is critical for auto insurance companies to gain competitive advantage by mining useful customer related information. While some efforts have been made for customer segmentation to support auto insurance decision making, their customer segmentation results tend to be affected by the characteristics of the algorithm used and lack multiple validation from multiple algorithms. To this end, we propose an auto insurance business analytics approach that segments customers by using three mixed-type data clustering algorithms including k-prototypes, improved k-prototypes and similarity-based agglomerative clustering. The customer segmentation results of these algorithms can complement and reinforce each other and demonstrate as much information as possible to support decision-making. To confirm its practical value, the proposed approach extracts seven rules for an auto insurance company that may support the company to make customer related decisions and develop insurance products

    A finite-difference method for the one-dimensional time-dependent schrƶdinger equation on unbounded domain

    Get PDF
    AbstractA finite-difference scheme is proposed for the one-dimensional time-dependent Schrƶdinger equation. We introduce an artificial boundary condition to reduce the original problem into an initial-boundary value problem in a finite-computational domain, and then construct a finite-difference scheme by the method of reduction of order to solve this reduced problem. This scheme has been proved to be uniquely solvable, unconditionally stable, and convergent. Some numerical examples are given to show the effectiveness of the scheme

    Design and Development of Variable Pitch Quadcopter for Long Endurance Flight

    Get PDF
    The variable pitch quadrotor is not a new concept but has been largely ignored in small unmanned aircraft, unlike the fixed pitch quadcopter which is controlled only by changing the RPM of the motors and only has about 30 minutes of total flight time. The variable pitch quadrotor can be controlled either by the change of the motor RPM or rotor blade pitch angle or by the combination of both. This gives the variable pitch quadrotor potential advantages in payload, maneuverability and long endurance flight. This research is focused on the design methodology for a variable pitch quadrotor using a single motor with potential applications for a long endurance flight. This variable pitch quadcopter uses a single power plant to power all four rotors through a power transmission system. All four rotors have the same rpm but vary the blade pitch angle to control its attitude in the air. A proof of concept variable pitch quadcopter is developed for testing the drivetrain mechanism on the vehicle and evaluating performance of the vehicle through numbers of testing.Mechanical and Aerospace Engineerin

    Understanding the Evaluation Abilities of External Cluster Validity Indices to Internal Ones

    Get PDF
    Evaluating internal Cluster Validity Index (CVI) is a critical task in clustering research. Existing studies mainly employ the number of clusters (NC-based method) or external CVIs (external CVIs-based method) to evaluate internal CVIs, which are not always reasonable in all scenarios. Additionally, there is no guideline of choosing appropriate methods to evaluate internal CVIs in different cases. In this paper, we focus on the evaluation abilities of external CVIs to internal CVIs, and propose a novel approach, named external CVI\u27s evaluation Ability MEasurement approach through Ranking consistency (CAMER), to measure the evaluation abilities of external CVIs quantitatively, for assisting in selecting appropriate external CVIs to evaluate internal CVIs. Specifically, we formulate the evaluation ability measurement problem as a ranking consistency task, by measuring the consistency between the evaluation results of external CVIs to internal CVIs and the ground truth performance of internal CVIs. Then, the superiority of CAMER is validated through a real-world case. Moreover, the evaluation abilities of seven popular external CVIs to internal CVIs in six different scenarios are explored by CAMER. Finally, these explored evaluation abilities are validated on four real-world datasets, demonstrating the effectiveness of CAMER

    Clustering Algorithm Based on Sparse Feature Vector without Specifying Parameter

    Get PDF
    Parameter setting is an essential factor affecting algorithm performance in data mining techniques. CABOSFV is an efficient clustering algorithm which can cluster binary data with sparse features, but it is challenging to specify the threshold parameter. To solve the difficulty of parameter decision, a clustering algorithm based on sparse feature vector without specifying parameter (CASP) is proposed in this paper. The calculation method of an upper limit of threshold is firstly defined to determine the range of threshold. Furthermore, we use the sparseness index to sort the data and conduct the clustering process based on the adjusted sparse feature vector after data sorting. An interval search strategy is adopted to find a suitable threshold within the defined threshold range, and the clustering result with the selected suitable parameter is the outcome. Experiments on 7 UCI datasets demonstrate that the clustering results of the CASP algorithm are superior to other baselines in terms of both effectiveness and efficiency. CASP not only simplifies the parameter decision process, but also obtains desirable clustering results quickly and stably, which shows the practicability of the algorithm

    The effect of PFAS exposure on glucolipid metabolism in children and adolescents: a meta-analysis

    Get PDF
    BackgroundPrevious studies showed that per- and polyfluoroalkyl substances (PFAS), which are widely found in the environment, can disrupt endocrine homeostasis when they enter the human body. This meta-analysis aimed to evaluate current human epidemiological evidence on the relationship between PFAS exposure and glucolipid metabolism in childhood and adolescence.MethodsWe searched PubMed, Web of Science, Embase, and Cochrane Library databases, and identified population-based epidemiological studies related to PFAS and glucolipid metabolism indexes that were published before 30 December 2022. The heterogeneity of the included literature was assessed using the I-square (I2) test and statistics Q. Random-effects and fixed-effects models were used to combine the effect size. Subgroup analysis based on age and sex of the study participants was performed. A sensitivity analysis was used to evaluate the robustness and reliability of the combined results. Eggerā€™s and Beggā€™s tests were used to analyze publication bias.ResultsA total of 12 studies were included in this analysis. There was a positive association between PFAS and TC (Ī² = 1.110, 95% CI: 0.601, 1.610) and LDL (Ī² = 1.900, 95% CI: 1.030, 2.770), and a negative association between PFAS and HOMA-IR in children and adolescents (Ī² = āˆ’0.130, 95% CI: āˆ’0. 200, āˆ’0.059). PFOS was significant positive associated with TC (Ī² = 8.22, 95% CI: 3.93, 12.51), LDL (Ī² = (12.04, 95% CI: 5.08, 18.99), and HOMA-IR (Ī² = āˆ’0.165, 95% CI: āˆ’0.292, āˆ’0.038). Subgroup analysis showed that exposure to PFAS in the adolescent group was positively associated with TC and LDL levels, and the relationship was stronger in females.ConclusionPFAS exposure is associated with glucolipid metabolism in children and adolescents. Among them, PFOS may play an important role. Recognition of environmental PFAS exposure is critical for stabilizing the glycolipid metabolism relationship during the growth and development of children and adolescents
    • ā€¦
    corecore