Search CORE

64,485 research outputs found

Data Mining

Author: Alkadi Ihssan
Publication venue: 'Clute Institute'
Publication date: 01/01/2008
Field of study

Recently data mining has become more popular in the information industry. It is due to the availability of huge amounts of data. Industry needs turning such data into useful information and knowledge. This information and knowledge can be used in many applications ranging from business management, production control, and market analysis, to engineering design and science exploration. Database and information technology have been evolving systematically from primitive file processing systems to sophisticated and powerful databases systems. The research and development in database systems has led to the development of relational database systems, data modeling tools, and indexing and data organization techniques. In relational database systems data are stored in relational tables. In addition, users can get convenient and flexible access to data through query languages, optimized query processing, user interfaces and transaction management and optimized methods for On-Line Transaction Processing (OLTP). The abundant data, which needs powerful data analysis tools, has been described as a data rich but information poor situation. The fast-growing, tremendous amount of data, collected and stored in large and numerous databases. Humans can not analyze these large amounts of data. So we need powerful tools to analyze this large amount of data. As a result, data collected in large databases become data tombs. These are data archives that are seldom visited. So, important decisions are often not made based on the information-rich data stored in databases rather based on a decision maker's intuition. This is because the decision maker does not have the tools to extract the valuable knowledge embedded in the vast amounts of data. Data mining tools which perform data analysis may uncover important data patterns, contributing greatly to business strategies, knowledge bases, and scientific and medical research. So data mining tools will turn data tombs into golden nuggets of knowledge

Clute Institute: Journals

The Hidden Web, XML and Semantic Web: A Scientific Data Management Perspective

Author: Nayak Richi
Senellart Pierre
Suchanek Fabian
Varde Aparna
Publication venue
Publication date: 01/01/2011
Field of study

The World Wide Web no longer consists just of HTML pages. Our work sheds light on a number of trends on the Internet that go beyond simple Web pages. The hidden Web provides a wealth of data in semi-structured form, accessible through Web forms and Web services. These services, as well as numerous other applications on the Web, commonly use XML, the eXtensible Markup Language. XML has become the lingua franca of the Internet that allows customized markups to be defined for specific domains. On top of XML, the Semantic Web grows as a common structured data source. In this work, we first explain each of these developments in detail. Using real-world examples from scientific domains of great interest today, we then demonstrate how these new developments can assist the managing, harvesting, and organization of data on the Web. On the way, we also illustrate the current research avenues in these domains. We believe that this effort would help bridge multiple database tracks, thereby attracting researchers with a view to extend database technology.Comment: EDBT - Tutorial (2011

arXiv.org e-Print Archive

HAL-CentraleSupelec

CiteSeerX

Montclair State University Digital Commons

INRIA a CCSD electronic archive server

Queensland University of Technology ePrints Archive

Hal-Diderot

HAL-Rennes 1

Nanoinformatics: developing new computing applications for nanomedicine

Author: Alberto Anguita
Alejandro Pazos
Antoine Geissbuhler
B Smith
BY Kim
C Kulikowski
C Rosse
CA Kulikowski
Casimir Kulikowski
Cristian Munteanu
D Dela Iglesia
David Perez-Rey
DG Thomas
Diana De la Iglesia
ED Green
F Martin-Sanchez
Fernando Gonzalez-Nilo
Fernando Martin-Sanchez
Ferran Sanz
George Potamias
Guillermo De la Calle
Guillermo Lopez-Campos
H Berman
IS Kohane
Isabel Hermosilla
Jose Crespo
Jose Maria Barreiro
Josipa Kern
Joyce A. Mitchell
Julio C. Facelli
K Jain
Luciano Milanesi
M Gerstein
M Viceconti
Martin Fritts
Miguel Garcia-Remesal
N Gordon
NA Baker
Nathan Baker
Norbert Graf
P Kiberstis
Paula Otero
Peter Ghazal
Pierre Grangeat
Rada Hussein
Raul E. Cachau
RB Altman
S Bewick
Sabine Koch
SI O’Donoghue
Sonia E. Benitez
V Maojo
V Maojo
V Maojo
V Maojo
Vassilis Moustakis
Victor Maojo
Victoria Lopez-Alonso
Yannick Legre
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Nanoinformatics has recently emerged to address the need of computing applications at the nano level. In this regard, the authors have participated in various initiatives to identify its concepts, foundations and challenges. While nanomaterials open up the possibility for developing new devices in many industrial and scientific areas, they also offer breakthrough perspectives for the prevention, diagnosis and treatment of diseases. In this paper, we analyze the different aspects of nanoinformatics and suggest five research topics to help catalyze new research and development in the area, particularly focused on nanomedicine. We also encompass the use of informatics to further the biological and clinical applications of basic research in nanoscience and nanotechnology, and the related concept of an extended ?nanotype? to coalesce information related to nanoparticles. We suggest how nanoinformatics could accelerate developments in nanomedicine, similarly to what happened with the Human Genome and other -omics projects, on issues like exchanging modeling and simulation methods and tools, linking toxicity information to clinical and personal databases or developing new approaches for scientific ontologies, among many others

Repositorio da Universidade da Coruña

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Online Research @ Cardiff

Springer - Publisher Connector

DSpace Universidad de Talca

PubMed Central

Edinburgh Research Explorer

Archivo Digital UPM

Archive ouverte UNIGE

A Molecular Biology Database Digest

Author: Bry François
Kröger Peer
Publication venue
Publication date: 01/01/2000
Field of study

Computational Biology or Bioinformatics has been defined as the application of mathematical and Computer Science methods to solving problems in Molecular Biology that require large scale data, computation, and analysis [18]. As expected, Molecular Biology databases play an essential role in Computational Biology research and development. This paper introduces into current Molecular Biology databases, stressing data modeling, data acquisition, data retrieval, and the integration of Molecular Biology data from different sources. This paper is primarily intended for an audience of computer scientists with a limited background in Biology

CiteSeerX

Open Access LMU

The LSST Data Mining Research Agenda

Author: A. Szalay
Coryn A.L. Bailer-Jones
I. Davidson
J. A. Tyson
J. Becla
K. Borne
Publication venue: 'AIP Publishing'
Publication date: 01/01/2008
Field of study

We describe features of the LSST science database that are amenable to scientific data mining, object classification, outlier identification, anomaly detection, image quality assurance, and survey science validation. The data mining research agenda includes: scalability (at petabytes scales) of existing machine learning and data mining algorithms; development of grid-enabled parallel data mining algorithms; designing a robust system for brokering classifications from the LSST event pipeline (which may produce 10,000 or more event alerts per night); multi-resolution methods for exploration of petascale databases; indexing of multi-attribute multi-dimensional astronomical databases (beyond spatial indexing) for rapid querying of petabyte databases; and more.Comment: 5 pages, Presented at the "Classification and Discovery in Large Astronomical Surveys" meeting, Ringberg Castle, 14-17 October, 200

arXiv.org e-Print Archive

Crossref

1st INCF Workshop on NeuroImaging Database Integration

Author: Lars Forsberg
Per Roland
Publication venue
Publication date: 08/04/2008
Field of study

The goal of this meeting was to map existing neuroimaging databases, particularly databases containing primary data, and to identify mechanisms that could facilitate integrated use of such databases, including possible fusion of databases. The report provides an overview of existing neuroimaging databases that were discussed during the workshop and examines the feasibility of database federations. The report includes several recommendations for future developments

Crossref

Nature Precedings