Search CORE

5,530 research outputs found

Data Mining Decision Trees in Economy

Author: Badulescu Laviniu-Aurelian
Nicula Adrian
Publication venue
Publication date
Field of study

Data Mining represents the extraction previously unknown, and potentially useful information from data. Using Data Mining Decision Trees techniques our investigation tries to illustrate how to extract meaningful socio-economical knowledge from large data sets. Our tests find 5 attributes selection measures that perform more accurate then the best performance of the 17 algorithms presented in literature.Data Mining, Decision Trees, classification error rate

Research Papers in Economics

Region Based Data Mining on Agriculture Data

Author: Battu Babitha
Publication venue: North Dakota State University
Publication date: 01/01/2015
Field of study

Spatial Data Mining is the process of discovering interesting and previously unknown, but potentially useful patterns from large spatial databases. Most relationships in spatial datasets are regional and there is a great need for regional regression methods that derive regional reflects different spatial characteristics of different regions. A central challenge in spatial data mining is the efficiency of spatial data mining algorithms, due to the often huge amount of spatial data and the complexity of spatial data types and spatial accessing methods. This paper proposes a regional regression technique for regions that are defined by a categorical attribute, in particular soil type. The result is a series of hierarchically grouped regions according to their similarity

NDSU Libraries Institutional Repository

Intelligent data analysis - support for development of SMEs sector

Author: Bosnjak Sasa
Bosnjak Zita
Grljevic Olivera
Publication venue
Publication date
Field of study

The paper studies possibilities of intelligent data analysis application for discovering knowledge hidden in small and medium-sized enterprisesâ€™ (SMEs) data, on the territory of the province of Vojvodina. The knowledge revealed by intelligent analysis, and not accessible by any other means, could be the valuable starting point for working out of proactive and preventive actions for the development of the SMEs sector.Intelligent data analysis, CRISP-DM, clustering, small and medium enterprises., Research and Development/Tech Change/Emerging Technologies, C8, L2,

Research Papers in Economics

Data Mining Applications in Big Data

Author: Wang Guanghui
Wang Lidong
Publication venue: 'Faculty of Computer Science, Sriwijaya University'
Publication date: 20/10/2015
Field of study

Data mining is a process of extracting hidden, unknown, but potentially useful information from massive data. Big Data has great impacts on scientific discoveries and value creation. This paper introduces methods in data mining and technologies in Big Data. Challenges of data mining and data mining with big data are discussed. Some technology progress of data mining and data mining with big data are also presented

ComEngApp-Journal

Directory of Open Access Journals

Computer Engineering and Applications Journal (ComEngApp, Universitas Sriwijaya)

Data mining in Cloud Computing

Author: Ruxandra-Ştefania PETRE
Publication venue: Bucharest Academy of Economic Studies Publishing House
Publication date: 01/10/2012
Field of study

This paper describes how data mining is used in cloud computing. Data Mining is used for extracting potentially useful information from raw data. The integration of data mining techniques into normal day-to-day activities has become common place. Every day people are confronted with targeted advertising, and data mining techniques help businesses to become more efficient by reducing costs.Data mining techniques and applications are very much needed in the cloud computing paradigm. The implementation of data mining techniques through Cloud computing will allow the users to retrieve meaningful information from virtually integrated data warehouse that reduces the costs of infrastructure and storage

Directory of Open Access Journals

New probabilistic interest measures for association rules

Author: Hahsler Michael
Hornik Kurt
Publication venue
Publication date: 07/02/2008
Field of study

Mining association rules is an important technique for discovering meaningful patterns in transaction databases. Many different measures of interestingness have been proposed for association rules. However, these measures fail to take the probabilistic properties of the mined data into account. In this paper, we start with presenting a simple probabilistic framework for transaction data which can be used to simulate transaction data when no associations are present. We use such data and a real-world database from a grocery outlet to explore the behavior of confidence and lift, two popular interest measures used for rule mining. The results show that confidence is systematically influenced by the frequency of the items in the left hand side of rules and that lift performs poorly to filter random noise in transaction data. Based on the probabilistic framework we develop two new interest measures, hyper-lift and hyper-confidence, which can be used to filter or order mined association rules. The new measures show significantly better performance than lift for applications where spurious rules are problematic

arXiv.org e-Print Archive

CiteSeerX

Discovery of Frequent Itemsets: Frequent Item Tree-Based Approach

Author: D. Wahidabanu R. S.
Senthil Kumar A. V.
Publication venue: LPPM ITBis Lembah Dempo
Publication date: 01/09/2013
Field of study

Mining frequent patterns in large transactional databases is a highly researched area in the field of data mining. Existing frequent pattern discovering algorithms suffer from many problems regarding the high memory dependency when mining large amount of data, computational and I/O cost. Additionally, the recursive mining process to mine these structures is also too voracious in memory resources. In this paper, we describe a more efficient algorithm for mining complete frequent itemsets from transactional databases. The suggested algorithm is partially based on FP-tree hypothesis and extracts the frequent itemsets directly from the tree. Its memory requirement, which is independent from the number of processed transactions, is another benefit of the new method. We present performance comparisons for our algorithm against the Apriori algorithm and FP-growth

Journal of ICT Research and Applications

Directory of Open Access Journals

ITB Journal

Discovery of Frequent Itemsets: Frequent Item Tree-Based Approach

Author: D. Wahidabanu R. S.
Senthil Kumar A. V.
Publication venue: 'The Institute for Research and Community Services (LPPM) ITB'
Publication date: 13/09/2013
Field of study

Journal of ICT Research and Applications