254,507 research outputs found
ADVANCES IN KNOWLEDGE DISCOVERY IN DATABASES
The Knowledge Discovery in Databases and Data Mining field proposes the development of methods and techniques for assigning useful meanings for data stored in databases. It gathers researches from many study fields like machine learning, pattern recognition, databases, statistics, artificial intelligence, knowledge acquisition for expert systems, data visualization and grids. While Data Mining represents a set of specific algorithms of finding useful meanings in stored data, Knowledge Discovery in Databases represents the overall process of finding knowledge and includes the Data Mining as one step among others such as selection, pre�processing, transformation and interpretation of mined data. This paper aims to point the most important steps that were made in the Knowledge Discovery in Databases field of study and to show how the overall process of discovering can be improved in the future.
Knowledge Discovery in Databases
KNOWLEDGE DISCOVERY IN DATABASES (KDD) revolves around the investigation and creation of knowledge, processes, algorithms, and the mechanisms for retrieving potential knowledge from data collections. Related issues include data collection, database design, the description of entries in the database using the most appropriate representation, and data quality. This article is an introductory overview of knowledge discovery in databases. The rationale and environment of its development and applications are discussed. Issues related to database design and collection are reviewed
DATA MINING TECHNOLOGIES
Knowledge discovery and data mining software (Knowledge Discovery and Data Mining - KDD) as an interdisciplinary field emersion have been in rapid growth to merge databases, statistics, industries closely related to the desire to extract valuable information and knowledge in a volume as possible.There is a difference in understanding of "knowledge discovery" and "data mining." Discovery information (Knowledge Discovery) in the database is a process to identify patterns / templates of valid data, innovative, useful and, in the last measure, understandable.data mining, knowledge discovery, data warehouse, data mining tools, data mining applications
Knowledge discovery and modeling in genomic databases
This dissertation research is targeted toward developing effective and accurate methods for identifying gene structures in the genomes of high eukaryotes, such as vertebrate organisms. Several effective hidden Markov models (HMMs) are developed to represent the consensus and degeneracy features of the functional sites including protein-translation start sites, mRNA splicing junction donor and acceptor sites in vertebrate genes. The HMM system based on the developed models is fully trained using an expectation maximization (EM) algorithm and the system performance is evaluated using a 10-way cross-validation method. Experimental results show that the proposed HMM system achieves high sensitivity and specificity in detecting the functional sites.
This HMM system is then incorporated into a new gene detection system, called GeneScout. The main hypothesis is that, given a vertebrate genomic DNA sequence S, it is always possible to construct a directed acyclic graph G such that the path for the actual coding region of S is in the set of all paths on G. Thus, the gene detection problem is reduced to the analysis of paths in the graph G. A dynamic programming algorithm is employed by GeneScout to find the optimal path in G. Experimental results on the standard test dataset collected by Burset and Guigo indicate that GeneScout is comparable to existing gene discovery tools and complements the widely used GenScan system
Knowledge discovery in spatial databases: the PADRÃO’s qualitative approach
Knowledge discovery in databases is a complex process concerned with the discovery of relationships and other descriptions from data. Knowledge discovery in spatial databases represents a particular case of discovery, allowing the discovery of relationships that exist between spatial and non-spatial data, and other data characteristics that aren’t explicitly stored in spatial databases.
This paper describes the conception and implementation of PADRÃO, a system for knowledge discovery in spatial databases. PADRÃO presents a new approach to this process, which is based on qualitative spatial reasoning. The spatial semantic knowledge and the principles of qualitative spatial reasoning needed for the spatial reasoning process are available in the PADRÃO’s geographic database and
PADRÃO’s spatial knowledge base, allowing the integration of the geo-spatial component, associated with the analysed non-geographic data, in the process of knowledge discovery
Combining expert knowledge and databases for risk management
Correctness, transparency and effectiveness are the principalattributes of knowledge derived from databases. In current data miningresearch there is a focus on efficiency improvement of algorithms forknowledge discovery. However important limitations of data mining canonly be dissolved by the integration of knowledge of experts in thefield, encoded in some accessible way, with knowledge derived formpatterns in the database. In this paper we will in particular discussmethods for combining expert knowledge and knowledge derived fromtransaction databases.The framework proposed is applicable to widevariety of risk management problems. We will illustrate the method ina case study on fraud discovery in an insurance company.risk management;datamining;knowledge discovery;knowledge based systems
- …