Search CORE

948 research outputs found

Mining and Filtering Multi-level Spatial Association Rules with ARES

Author: A. Appice
D. Malerba
F.A. Lisi
H. Mannila
K. Koperski
L. Dehaspe
M.C. Ludl
S. Džeroski
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

In spatial data mining, a common task is the discovery of spatial association rules from spatial databases. We propose a distributed system, named ARES that takes advantage of the use of a multi-relational approach to mine spatial association rules. It supports spatial database coupling and discovery of multi-level spatial association rules as a means for spatial data exploration. We also present some criteria to bias the search and to filter the discovered rules according to user's expectations. Finally, we show the applicability of our proposal to two different real world domains, namely, document image processing and geo-referenced analysis of census data

Crossref

Archivio istituzionale della ricerca - Università di Bari

SEMI SUPERVISED BASED FRAMEWORK FOR GENE DATA ANALYSIS USING SVM CLASSIFICATION AND RANDOM FOREST APPROACH

Author: R. JOTHI
Publication venue: JConsort
Publication date: 27/11/2019
Field of study

Microarray development is one of the basic biotechnological suggests that permit recording the enunciation levels of thousands of characteristics simultaneously inside different dissimilar models. A microarray quality enunciation educational list can be speak to by an appearance table, where each line looks at to one picky quality, each segment to a model, and each section of the grid is the conscious verbalization level of a particular quality in a model, correspondingly. A huge sales of microarray quality verbalization data in helpful genomics is to arrange tests according to their quality enunciation profiles. Close by the enormous measure of characteristics accessible in quality verbalization data, simply a more modest than typical segment of them is productive for playing out a convinced logical test. Regardless, for most quality verbalization data, the amount of planning tests is still little diverged from the colossal number of characteristics related with the assessments. Right when the amount of characteristics is essentially more imperative than the amount of tests, it is possible to find naturally relevant associations of value lead with the model characterizations or response factors. In this way, one of the mainly huge endeavors with the quality enunciation data is to recognize social occasions of co-coordinated characteristics whose supportive verbalization is unequivocally associated with the depiction classes or response factors. So execute feature subset assurance approach to manage decrease dimensionality, killing unnecessary data and augmentation end precision and presents learning strategy which can accumulate characteristics subject to their relationship to mine significant models from the quality verbalization data using Spatial EM estimation. It will in general be used to figure spatial mean and rank based scatter cross section to eliminate huge models and further execute KNN (K-nearest neighbor request) approach to manage investigation the diseases. A crucial finding is that the all-inclusive semi directed batching estimation is introduced to be valuable for perceiving naturally tremendous quality gatherings with excellent perceptive limit. An ideal sporadic forest area based figuring is proposed for the examinatio

Scholar

Web Data Extraction, Applications and Techniques: A Survey

Author: Abel
Amalfitano
Balduzzi
Baumgartner
Baumgartner
Baumgartner
Baumgartner
Baumgartner
Baumgartner
Berger
Berthold
Bettencourt
Califf
Catanese
Chang
Chen
Chen
Chen
Collins
Conover
Crandall
Crescenzi
Crescenzi
Dalvi
Dalvi
De Meo
De Meo
Doan
Emilio Ferrara
Ferrara
Ferrara
Ferrara
Ferrara
Ferrara
Flesca
Freitag
Furche
Gatterbauer
Gatterbauer
Giacomo Fiumara
Gjoka
Gkotsis
Gottlob
Gottlob
Hammersley
Han
Hecht
Hsu
Irmak
Khare
Kim
Kinsella
Kleinberg
Kleinberg
Kohlschütter
Kokkoras
Kokkoras
Kokkoras
Krüpl
Kushmerick
Kwak
Laender
Liu
Manning
Masanès
Mathes
Meng
Mislove
Monge
Muslea
Oro
Pan
Pasquale De Meo
Perito
Phan
Plake
Rahm
Rahm
Reis
Robert Baumgartner
Sahuguet
Sarawagi
Schifanella
Selkow
Shi
Soderland
Szomszor
Turmo
Vosecky
Wang
Wang
Weikum
Wilson
Winograd
Yang
Ye
Zafarani
Zanasi
Zhai
Zhang
Zhang
Publication venue: 'Elsevier BV'
Publication date: 09/06/2014
Field of study

Web Data Extraction is an important problem that has been studied by means of different scientific tools and in a broad range of applications. Many approaches to extracting data from the Web have been designed to solve specific problems and operate in ad-hoc domains. Other approaches, instead, heavily reuse techniques and algorithms developed in the field of Information Extraction. This survey aims at providing a structured and comprehensive overview of the literature in the field of Web Data Extraction. We provided a simple classification framework in which existing Web Data Extraction applications are grouped into two main classes, namely applications at the Enterprise level and at the Social Web level. At the Enterprise level, Web Data Extraction techniques emerge as a key tool to perform data analysis in Business and Competitive Intelligence systems as well as for business process re-engineering. At the Social Web level, Web Data Extraction techniques allow to gather a large amount of structured data continuously generated and disseminated by Web 2.0, Social Media and Online Social Network users and this offers unprecedented opportunities to analyze human behavior at a very large scale. We discuss also the potential of cross-fertilization, i.e., on the possibility of re-using Web Data Extraction techniques originally designed to work in a given domain, in other domains.Comment: Knowledge-based System

arXiv.org e-Print Archive

Crossref

Spatial associative classification: propositional vs structural approach

Author: A. Appice
A. Appice
Annalisa Appice
D. Malerba
E. Baralis
F. A. Lisi
G. Dong
H. Mannila
J. A. Robinson
J. Fitzpatrick
J. Knobbe
J. R. Quinlan
K. Koperski
L. Dehaspe
L. Dehaspe
M. C. Ludl
M. Ceci
M. Krogel
M. Modrzejewski
M. Pazzani
Michelangelo Ceci
N. Lavrač
P. Domingos
P. Flach
R. E. Bellman
S. Džeroski
S. Kramer
S. Muggleton
S. Shekhar
T. Mitchell
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A COMPREHENSIVE GEOSPATIAL KNOWLEDGE DISCOVERY FRAMEWORK FOR SPATIAL ASSOCIATION RULE MINING

Author: Dao Thi Hong Diep
NC DOCKS at The University of North Carolina at Charlotte
Publication venue
Publication date: 01/01/2013
Field of study

Continuous advances in modern data collection techniques help spatial scientists gain access to massive and high-resolution spatial and spatio-temporal data. Thus there is an urgent need to develop effective and efficient methods seeking to find unknown and useful information embedded in big-data datasets of unprecedentedly large size (e.g., millions of observations), high dimensionality (e.g., hundreds of variables), and complexity (e.g., heterogeneous data sources, space–time dynamics, multivariate connections, explicit and implicit spatial relations and interactions). Responding to this line of development, this research focuses on the utilization of the association rule (AR) mining technique for a geospatial knowledge discovery process. Prior attempts have sidestepped the complexity of the spatial dependence structure embedded in the studied phenomenon. Thus, adopting association rule mining in spatial analysis is rather problematic. Interestingly, a very similar predicament afflicts spatial regression analysis with a spatial weight matrix that would be assigned a priori, without validation on the specific domain of application. Besides, a dependable geospatial knowledge discovery process necessitates algorithms supporting automatic and robust but accurate procedures for the evaluation of mined results. Surprisingly, this has received little attention in the context of spatial association rule mining. To remedy the existing deficiencies mentioned above, the foremost goal for this research is to construct a comprehensive geospatial knowledge discovery framework using spatial association rule mining for the detection of spatial patterns embedded in geospatial databases and to demonstrate its application within the domain of crime analysis. It is the first attempt at delivering a complete geo-spatial knowledge discovery framework using spatial association rule mining

The University of North Carolina at Greensboro

A Two step optimized spatial Association rule Mining Algorithm by hybrid evolutionary algorithm and cluster segmentation.

Author: J.Arunadevi Dr.V.Rajamani, Dr.
Publication venue: Global Journals Inc. (US)
Publication date: 25/05/2011
Field of study

A novel two step approach by adopting hybrid evolutionary algorithm with cluster segmentation for Spatial Association Rule mining (SAR) is presented in this paper.Here first step concentrates on the optimization of SAR using the hybrid evolutionary algorithm which uses genetic algorithm and ant colony optimization (ACO). Multi objective genetic algorithm is used to provide the diversity of associations. ACO is performed to come out of local optima. In the second step, cluster the generated association rules used for the target group segmentation. Preferential based segmentation of the women of various groups belongs to the Madurai city, Tamilnadu, India. Here, number of rules generated by the first step of our SAR is minimized, also time generation for the rules are also minimized. Lift ratio increased for the generated rules

Global Journal of Computer Science and Technology (GJCST)

Link Prediction in Complex Networks: A Survey

Author: Adamic
Airoldi
Albert
Alon
Amaral
Arenas
Baiesi
Barabási
Barahona
Bayes
Bianconi
Blondel
Boccaletti
Breiman
Brin
Buntine
Burke
Butts
Caldarelli
Carmi
Casella
Casella
Chebotarev
Chu
Clauset
Colizza
Cui
da
Dasgupta
Dawah
Dorelan
Dorogovtsev
Fouss
Fouss
Gallagher
Gastner
Geisser
Getoor
Girvan
Granger
Guha
Guimerà
Guimerà
Guimerà
Guimerà
Hanely
Heckerman
Heckerman
Herlocker
Holland
Holme
Holme
Huang
Huang
Huss
Jaccard
Jeh
Jung
Kaluza
Katz
Kim
Klein
Kohavi
Kossinets
Krebs
Kunegis
Lambiotte
Leicht
Leroy
Leskovec
Liben-Nowell
Lin
Linyuan Lü
Liu
Liu
Liu
Liu
Liu
Lusseau
Lü
Lü
Mann
Manning
Mantrach
Marvel
Metropolis
Molloy
Moore
Mossel
Murata
Neal
Neville
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Ou
O’Madadhain
Pan
Pastor-Satorras
Pastor-Satorras
Penrose
Perotti
Polikar
Ravasz
Redner
Reed
Reichardt
Sales-Pardo
Salton
Salton
Schafer
Schafer
Shang
Shang
Shawe-Taylor
Spiegelhalter
Spring
Stumpf
Su
Sun
Szell
Sørensen
Tao Zhou
Taskar
Tong
Traag
Tylenda
Valverde
von Mering
Vázquez
Wang
Watts
White
White
White
Wilcoxon
Xiao
Xie
Yan
Yin
Yin
Yu
Yu
Yu
Zachary
Zeng
Zhang
Zhang
Zhang
Zhang
Zhang
Zheleva
Zhou
Zhou
Zhou
Zhou
Zhou
Zhou
Zhou
Publication venue: 'Elsevier BV'
Publication date: 04/10/2010
Field of study

Link prediction in complex networks has attracted increasing attention from both physical and computer science communities. The algorithms can be used to extract missing information, identify spurious interactions, evaluate network evolving mechanisms, and so on. This article summaries recent progress about link prediction algorithms, emphasizing on the contributions from physical perspectives and approaches, such as the random-walk-based methods and the maximum likelihood methods. We also introduce three typical applications: reconstruction of networks, evaluation of network evolving mechanism and classification of partially labelled networks. Finally, we introduce some applications and outline future challenges of link prediction algorithms.Comment: 44 pages, 5 figure

arXiv.org e-Print Archive

Crossref