2,250 research outputs found

    Using Conservative Estimation for Conditional Probability instead of Ignoring Infrequent Case

    Full text link
    There are several estimators of conditional probability from observed frequencies of features. In this paper, we propose using the lower limit of confidence interval on posterior distribution determined by the observed frequencies to ascertain conditional probability. In our experiments, this method outperformed other popular estimators.Comment: The 2016 International Conference on Advanced Informatics: Concepts, Theory and Application (ICAICTA2016

    A survey on utilization of data mining approaches for dermatological (skin) diseases prediction

    Get PDF
    Due to recent technology advances, large volumes of medical data is obtained. These data contain valuable information. Therefore data mining techniques can be used to extract useful patterns. This paper is intended to introduce data mining and its various techniques and a survey of the available literature on medical data mining. We emphasize mainly on the application of data mining on skin diseases. A categorization has been provided based on the different data mining techniques. The utility of the various data mining methodologies is highlighted. Generally association mining is suitable for extracting rules. It has been used especially in cancer diagnosis. Classification is a robust method in medical mining. In this paper, we have summarized the different uses of classification in dermatology. It is one of the most important methods for diagnosis of erythemato-squamous diseases. There are different methods like Neural Networks, Genetic Algorithms and fuzzy classifiaction in this topic. Clustering is a useful method in medical images mining. The purpose of clustering techniques is to find a structure for the given data by finding similarities between data according to data characteristics. Clustering has some applications in dermatology. Besides introducing different mining methods, we have investigated some challenges which exist in mining skin data

    Internet Medical Privacy Disclosure Mining and Prediction Model Construction Based on Association Rules

    Get PDF
    In recent years, China\u27s Internet medical industry has developed rapidly and the market scale has been expanding. Medical privacy is an important research point in the Internet medical field. If the patient cannot fully communicate with the doctor on the other end of the Internet, then it is obvious that the patient will not be well treated. Then it becomes very worthwhile to mine the factors affecting patients\u27 privacy disclosure and predict patients\u27 disclosure behavior. This paper uses the classical and improved multidimensional Apriori (MD-Apriori) to mine patient privacy disclosure factors, which proves that the improved MD-Apriori algorithm is more applicable in this study. In order to prove the validity and authority of the mining results, this paper uses SPSS to analyze 331 valid questionnaires. The results show that the privacy disclosure factors obtained by the two methods are almost the same. Finally, based on the above factors, we establish the Internet medical privacy disclosure intention prediction model, in order to guide the construction and improvement of internet medical

    Frequent pattern growth algorithm for maximizing display items

    Get PDF
    Products are goods that are available and provided in stores for sale. Products provided in stores must be arranged properly to order to attract the attention of consumers to buy. Products arranged in a store will depend on the type of store. The product arrangement at a retail store will be different from the product arrangement at a clothing store. Store display will reflect a picture that is in the store so consumers know the types of products sold by product arrangement. An attractive arrangement will stimulate the desire of consumers to buy. In data mining there are several types of methods by use including prediction, association, classification and estimation. In the prediction method there are several techniques including the frequent pattern growth (FP-growth) method. FP-growth algorithm is the development of the apriori algorithm. So, the shortcomings of the apriori algorithm are corrected by the FP-growth algorithm. FP-growth is one alternative algorithm that can be used to determine the set of data that most often appears (frequent itemset) in a data set. Results of research on the application of the FP-growth algorithm to maximizing the display of goods. It is hoped that this research can be used to adjust the product layout according to the level of frequency the product is sought by the customer so that the customer has no difficulty finding the product they want

    Introduction of Empirical Topology in Construction of Relationship Networks of Informative Objects

    Full text link
    Understanding the structure of relationships between objects in a given database is one of the most important problems in the field of data mining. The structure can be defined for a set of single objects (clustering) or a set of groups of objects (network mapping). We propose a method for discovering relationships between individuals (single or groups) that is based on what we call the empirical topology, a system-theoretic measure of functional proximity. To illustrate the suitability and efficiency of the method, we apply it to an astronomical data base
    corecore