Search CORE

200,411 research outputs found

A review on Data Mining for Indian Online Retail Industry

Author: Ms. Pradnya Muley, Dr. Aniruddha Joshi
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 31/05/2016
Field of study

Data mining, the technique of extracting hidden information from large databases, is a powerful new technology with great potential to help companies focus on the most important information in their data warehouses. Data mining tools predict future trends and behaviors, allowing businesses to make proactive, knowledge-driven decisions. The Indian retail industry has emerged as one of the most dynamic and fast-paced industries due to the entry of several new players in the Indian Retail Sector. It increases the need of different knowledge extraction tools which can extract usable information form large data sets. In this paper we have discussed the Overview of an Indian retail sector, the previous study related to the retailing industry. Also the study presents the gaps identified in the previous work

International Journal on Recent and Innovation Trends in Computing and Communication

Conceptual biology, hypothesis discovery, and text mining: Swanson's legacy

Author: Bekhuis T
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/04/2006
Field of study

Innovative biomedical librarians and information specialists who want to expand their roles as expert searchers need to know about profound changes in biology and parallel trends in text mining. In recent years, conceptual biology has emerged as a complement to empirical biology. This is partly in response to the availability of massive digital resources such as the network of databases for molecular biologists at the National Center for Biotechnology Information. Developments in text mining and hypothesis discovery systems based on the early work of Swanson, a mathematician and information scientist, are coincident with the emergence of conceptual biology. Very little has been written to introduce biomedical digital librarians to these new trends. In this paper, background for data and text mining, as well as for knowledge discovery in databases (KDD) and in text (KDT) is presented, then a brief review of Swanson's ideas, followed by a discussion of recent approaches to hypothesis discovery and testing. 'Testing' in the context of text mining involves partially automated methods for finding evidence in the literature to support hypothetical relationships. Concluding remarks follow regarding (a) the limits of current strategies for evaluation of hypothesis discovery systems and (b) the role of literature-based discovery in concert with empirical research. Report of an informatics-driven literature review for biomarkers of systemic lupus erythematosus is mentioned. Swanson's vision of the hidden value in the literature of science and, by extension, in biomedical digital databases, is still remarkably generative for information scientists, biologists, and physicians. © 2006Bekhuis; licensee BioMed Central Ltd

Springer - Publisher Connector

PubMed Central

D-Scholarship@Pitt

Machine learning and statistical approaches to classification – a case study

Author: Eyoh Imo
John Robert
Publication venue
Publication date: 07/09/2017
Field of study

The advent of information technology has led to the proliferation of data in disparate databases. Organisations have become data rich but knowledge poor. Users need efficient analysis tools to help them understand their data, predict future trends and relationships and generalise to new situations in order to make proactive knowledge-driven decisions in a competitive business world. Thus, there is an urgent need for techniques and tools that intelligently and automatically transform these data into useful information and knowledge for effective decision making. Data mining is considered to be the most appropriate technology for addressing this need. Datamining is the process of extracting or “mining” knowledge from large amounts of data. Regression analysis and classification are two datamining tasks used to predict future trends. In this study, we investigate the behaviour of a statistical model and three machine learning models (artificial neural network, decision tree and support vector machine) on a large electricity dataset. We evaluate their predictive abilities based on this dataset. Results show that machine learning models, for this real world dataset, outperform statistical regression while artificial neural network outperforms support vector machine and decision tree in the classification task. In terms of comprehensibility, decision tree is the best choice. Although not definitive this research indicates that certainly these machine learning methods are an alternative to regression with certain datasets

Nottingham ePrints

Nottingham eTheses

Image mining: trends and developments

Author: Hsu Wynne
Lee Mong Li
Zhang Ji
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2002
Field of study

[Abstract]: Advances in image acquisition and storage technology have led to tremendous growth in very large and detailed image databases. These images, if analyzed, can reveal useful information to the human users. Image mining deals with the extraction of implicit knowledge, image data relationship, or other patterns not explicitly stored in the images. Image mining is more than just an extension of data mining to image domain. It is an interdisciplinary endeavor that draws upon expertise in computer vision, image processing, image retrieval, data mining, machine learning, database, and artificial intelligence. In this paper, we will examine the research issues in image mining, current developments in image mining, particularly, image mining frameworks, state-of-the-art techniques and systems. We will also identify some future research directions for image mining

University of Southern Queensland ePrints

Using an Adaptive Fuzzy Logic System to Optimise Knowledge Discovery in Proteomics

Author: Bowerman Chris
Malone J
McGarry Kenneth
Publication venue: Nottingham Trent University
Publication date: 01/12/2004
Field of study

Sunderland University Institutional Repository

A conceptual analytics model for an outcome-driven quality management framework as part of professional healthcare education

Author: Barman L
Hervatis V
Loe A
O'Donoghue J
Zary N
Publication venue: 'JMIR Publications Inc.'
Publication date: 15/08/2015
Field of study

BACKGROUND: Preparing the future health care professional workforce in a changing world is a significant undertaking. Educators and other decision makers look to evidence-based knowledge to improve quality of education. Analytics, the use of data to generate insights and support decisions, have been applied successfully across numerous application domains. Health care professional education is one area where great potential is yet to be realized. Previous research of Academic and Learning analytics has mainly focused on technical issues. The focus of this study relates to its practical implementation in the setting of health care education. OBJECTIVE: The aim of this study is to create a conceptual model for a deeper understanding of the synthesizing process, and transforming data into information to support educators’ decision making. METHODS: A deductive case study approach was applied to develop the conceptual model. RESULTS: The analytics loop works both in theory and in practice. The conceptual model encompasses the underlying data, the quality indicators, and decision support for educators. CONCLUSIONS: The model illustrates how a theory can be applied to a traditional data-driven analytics approach, and alongside the context- or need-driven analytics approach

PubMed Central

Spiral - Imperial College Digital Repository

Spatial and temporal epidemiological analysis in the Big Data era

Author: Alvarado-Serrano
Anderson
Andrienko
Anon
Anon
Anon
Anon
Baker
Bell
Breiman
Brownstein
Brownstein
Brownstein
Brunker
Butler
Butler
Carneiro
Carrel
Carroll
Chan
Chew
Chunara
Clements
Collier
Collins
Correa
Costa
Cowen
de Glanville
Dhar
Dirk U. Pfeiffer
Dodge
Eastman
Elith
Elith
Eysenbach
Faghmous
Faria
Feizizadeh
Fernández
Firestone
Firestone
França
Freifeld
Gandomi
Gartner
Gibney
Giebultowicz
Gilbert
Ginsberg
Goodchild
Goodchild
Grein
Haklay
Hartley
Hay
Hay
Heipke
Heymann
Hirzel
Hirzel
Hongoh
Istepanian
Jankowski
Jones
Kambatla
Kamel Boulos
Kamel Boulos
Keller
Kim B. Stevens
Kuhn
Lawson
Lazer
Lee
Leetaru
Li
Liang
Ligmann-Zielinska
Malczewski
Malczewski
Martin
Mayer-Schönberger
Milinovich
Milinovich
Mortari
Mullins
Murray
Mykhalovskiy
Okabe
Oliver
Olsen
O’Driscoll
Peters
Pfeiffer
Pfeiffer
Pigliucci
Pigott
Porter
Prates
Pybus
Rutten
Sanchez-Matamoros
Sarojinie Fernando
Schadt
Scholkopf
Schutt
See
Signorini
Solanas
Sorensen
St Louis
Stevens
Stevens
Tatem
Tatem
Tolentino
Tran
van Zyl
van Zyl
Vatsavai
Wesolowski
Wesolowski
Wilson
Wilson
Wilson
Wing
Yemshanov
You
Zeldenrust
Ziegler
Publication venue: 'Elsevier BV'
Publication date: 01/11/2015
Field of study

Crossref