Search CORE

6,848 research outputs found

Data mining and fusion

Author: Addis M. J.
Choi F.
Taylor S. J.
Upstill C.
Watkins E. R.
Publication venue: s.n.
Publication date: 01/04/2006
Field of study

Southampton (e-Prints Soton)

Service-Oriented Data Mining

Author: Derya Birant
Publication venue: 'IntechOpen'
Publication date: 21/01/2011
Field of study

IntechOpen

Data Mining - A Review and Description

Author: Nancy, Jasdeep Kaur, Ramneet Kaur, Nishu
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 31/07/2013
Field of study

Data mining is a powerful and new technique with great potential. It converts the raw data into the useful informati on. Data Mining is the process of extracting knowledge fr om data warehouses. To store databases, enterprises make data warehouses and data marts. Data warehouses and data marts contain large amounts of data. Due to extracting knowledge from large data warehou ses or depositories, data mining plays great role in various fields of machine learning, advancements in static, database system, pattern matching, and artificial intelligence. Various algorithms and programs are used for data mining approach

International Journal on Recent and Innovation Trends in Computing and Communication

A versatile programming model for dynamic task scheduling on cluster computers

Author: Jin Dejiang
Publication venue: Digital Commons @ NJIT
Publication date: 31/05/2005
Field of study

This dissertation studies the development of application programs for parallel and distributed computer systems, especially PC clusters. A methodology is proposed to increase the efficiency of code development, the productivity of programmers and enhance performance of executing the developed programs on PC clusters while facilitating improvement of scalability and code portability of these programs. A new programming model, named the Super-Programming Model (SPM), is created. Programs are developed assuming an instruction set architecture comprised of SuperInstructions (SIs). SPM models the target system as a large Virtual Machine (VM); VM contains functional units which are underlain with sub-computer systems and SIs are implemented with codes. When these functional units execute SIs, their codes will run on member computers to perform the corresponding operations. This approach resembles the process of designing instruction sets for microprocessors but the VM employs much coarser instructions and data structures. SIs use Super-Data Blocks (SDBs) as their operands. Each SI is assigned to a single member computer and is indivisible (i.e., its implementation is not interrupted for I/O). SIs have predictable execution times because SDB sizes are limited by predefined thresholds. These qualities of SIs help dynamic load balancing. Employing software to implement instructions makes this approach more flexible. The developed programs fit to architectures of cluster systems better. SPM provides mechanisms, such as dynamic load balancing, to assure the efficient execution of programs. The vast majority of current programming models lack such mechanisms for distributed environments that suffer from long communication latencies. Since SPM employs coarse-grain tasks, the overall management overhead is small. SDB access can often overlap the execution of other SIs; a cache system further decreases average memory latencies. Since all SDBs are virtual entities, with the runtime system support, they can be accessed in parallel and efficiently minimizes additional constraints to parallelism from underlying computer systems. In this research, a reference implementation of VM has been developed. A performance estimation model is developed that takes these features into account. Finally, the definition of scalability for parallel/distributed processing is refined to represent a multi-dimensional entity. Sample cases are analyzed

Digital Commons @ New Jersey Institute of Technology (NJIT)

A service oriented architecture to provide data mining services for non-expert data miners

Author: García Saiz Diego
Zorrilla Pantaleón Marta E.
Publication venue: 'Elsevier BV'
Publication date: 01/04/2013
Field of study

In today's competitive market, companies need to use discovery knowledge techniques to make better, more informed decisions. But these techniques are out of the reach of most users as the knowledge discovery process requires an incredible amount of expertise. Additionally, business intelligence vendors are moving their systems to the cloud in order to provide services which offer companies cost-savings, better performance and faster access to new applications. This work joins both facets. It describes a data mining service addressed to non-expert data miners which can be delivered as Software-as-a-Service. Its main advantage is that by simply indicating where the data file is, the service itself is able to perform all the process. © 2012 Elsevier B.V. All rights reserved

UCrea