Search CORE

2 research outputs found

Variation In Greedy Approach To Set Covering Problem

Author: Singhania Shreeya Naval
Publication venue: 'University of Windsor Leddy Library'
Publication date: 01/01/2019
Field of study

The weighted set covering problem is to choose a number of subsets to cover all the elements in a universal set at the lowest cost. It is a well-studied classical problem with applications in various fields like machine learning, planning, information retrieval, facility allocation, etc. Deep web crawling refers to the process of gathering documents that have been structured into a data source and can be retrieved through a search interface. Its query selection process calls for an efficient solution to the set covering problem

Scholarship at UWindsor

Performance Evaluation of Weighted Greedy Algorithm in Resource Management

Author: Singh Harjeet
Publication venue: 'University of Windsor Leddy Library'
Publication date: 01/01/2018
Field of study

Set covering is a well-studied classical problem with many applications across different fields. More recent work on this problem has taken into account the parallel computing architecture, the datasets at scale, the properties of the datasets, etc. Within the context of web crawling where the data follow the lognormal distribution, a weighted greedy algorithm has been proposed in the literature and demonstrated to outperform the traditional one. In the present work, we evaluate the performance of the weighted greedy algorithm using an open-source dataset in the context of resource management. The data are sampled from a given roadmap with 1.9 millions of nodes. Our research includes three different cost definitions i.e. location cost, driving cost and infrastructure cost. We also consider the different coverage radius to model possible parameters in the application. Our experiment results show that weighted greedy algorithm outperforms the greedy algorithm by 8% in average for all three different cost definitions

Scholarship at UWindsor