2,250 research outputs found
Using Conservative Estimation for Conditional Probability instead of Ignoring Infrequent Case
There are several estimators of conditional probability from observed
frequencies of features. In this paper, we propose using the lower limit of
confidence interval on posterior distribution determined by the observed
frequencies to ascertain conditional probability. In our experiments, this
method outperformed other popular estimators.Comment: The 2016 International Conference on Advanced Informatics: Concepts,
Theory and Application (ICAICTA2016
A survey on utilization of data mining approaches for dermatological (skin) diseases prediction
Due to recent technology advances, large volumes of medical data is obtained. These data contain valuable information. Therefore data mining techniques can be used to extract useful patterns. This paper is intended to introduce data mining and its various techniques and a survey of the available literature on medical data mining. We emphasize mainly on the application of data mining on skin diseases. A categorization has been provided based on the different data mining techniques. The utility of the various data mining methodologies is highlighted. Generally association mining is suitable for extracting rules. It has been used especially in cancer diagnosis. Classification is a robust method in medical mining. In this paper, we have summarized the different uses of classification in dermatology. It is one of the most important methods for diagnosis of erythemato-squamous diseases. There are different methods like Neural Networks, Genetic Algorithms and fuzzy classifiaction in this topic. Clustering is a useful method in medical images mining. The purpose of clustering techniques is to find a structure for the given data by finding similarities between data according to data characteristics. Clustering has some applications in dermatology. Besides introducing different mining methods, we have investigated some challenges which exist in mining skin data
Internet Medical Privacy Disclosure Mining and Prediction Model Construction Based on Association Rules
In recent years, China\u27s Internet medical industry has developed rapidly and the market scale has been expanding. Medical privacy is an important research point in the Internet medical field. If the patient cannot fully communicate with the doctor on the other end of the Internet, then it is obvious that the patient will not be well treated. Then it becomes very worthwhile to mine the factors affecting patients\u27 privacy disclosure and predict patients\u27 disclosure behavior. This paper uses the classical and improved multidimensional Apriori (MD-Apriori) to mine patient privacy disclosure factors, which proves that the improved MD-Apriori algorithm is more applicable in this study. In order to prove the validity and authority of the mining results, this paper uses SPSS to analyze 331 valid questionnaires. The results show that the privacy disclosure factors obtained by the two methods are almost the same. Finally, based on the above factors, we establish the Internet medical privacy disclosure intention prediction model, in order to guide the construction and improvement of internet medical
Frequent pattern growth algorithm for maximizing display items
Products are goods that are available and provided in stores for sale. Products provided in stores must be arranged properly to order to attract the attention of consumers to buy. Products arranged in a store will depend on the type of store. The product arrangement at a retail store will be different from the product arrangement at a clothing store. Store display will reflect a picture that is in the store so consumers know the types of products sold by product arrangement. An attractive arrangement will stimulate the desire of consumers to buy. In data mining there are several types of methods by use including prediction, association, classification and estimation. In the prediction method there are several techniques including the frequent pattern growth (FP-growth) method. FP-growth algorithm is the development of the apriori algorithm. So, the shortcomings of the apriori algorithm are corrected by the FP-growth algorithm. FP-growth is one alternative algorithm that can be used to determine the set of data that most often appears (frequent itemset) in a data set. Results of research on the application of the FP-growth algorithm to maximizing the display of goods. It is hoped that this research can be used to adjust the product layout according to the level of frequency the product is sought by the customer so that the customer has no difficulty finding the product they want
Introduction of Empirical Topology in Construction of Relationship Networks of Informative Objects
Understanding the structure of relationships between objects in a given
database is one of the most important problems in the field of data mining. The
structure can be defined for a set of single objects (clustering) or a set of
groups of objects (network mapping). We propose a method for discovering
relationships between individuals (single or groups) that is based on what we
call the empirical topology, a system-theoretic measure of functional
proximity. To illustrate the suitability and efficiency of the method, we apply
it to an astronomical data base
- …