2,789 research outputs found
A Density-Based Approach to the Retrieval of Top-K Spatial Textual Clusters
Keyword-based web queries with local intent retrieve web content that is
relevant to supplied keywords and that represent points of interest that are
near the query location. Two broad categories of such queries exist. The first
encompasses queries that retrieve single spatial web objects that each satisfy
the query arguments. Most proposals belong to this category. The second
category, to which this paper's proposal belongs, encompasses queries that
support exploratory user behavior and retrieve sets of objects that represent
regions of space that may be of interest to the user. Specifically, the paper
proposes a new type of query, namely the top-k spatial textual clusters (k-STC)
query that returns the top-k clusters that (i) are located the closest to a
given query location, (ii) contain the most relevant objects with regard to
given query keywords, and (iii) have an object density that exceeds a given
threshold. To compute this query, we propose a basic algorithm that relies on
on-line density-based clustering and exploits an early stop condition. To
improve the response time, we design an advanced approach that includes three
techniques: (i) an object skipping rule, (ii) spatially gridded posting lists,
and (iii) a fast range query algorithm. An empirical study on real data
demonstrates that the paper's proposals offer scalability and are capable of
excellent performance
Keyword-aware Optimal Route Search
Identifying a preferable route is an important problem that finds
applications in map services. When a user plans a trip within a city, the user
may want to find "a most popular route such that it passes by shopping mall,
restaurant, and pub, and the travel time to and from his hotel is within 4
hours." However, none of the algorithms in the existing work on route planning
can be used to answer such queries. Motivated by this, we define the problem of
keyword-aware optimal route query, denoted by KOR, which is to find an optimal
route such that it covers a set of user-specified keywords, a specified budget
constraint is satisfied, and an objective score of the route is optimal. The
problem of answering KOR queries is NP-hard. We devise an approximation
algorithm OSScaling with provable approximation bounds. Based on this
algorithm, another more efficient approximation algorithm BucketBound is
proposed. We also design a greedy approximation algorithm. Results of empirical
studies show that all the proposed algorithms are capable of answering KOR
queries efficiently, while the BucketBound and Greedy algorithms run faster.
The empirical studies also offer insight into the accuracy of the proposed
algorithms.Comment: VLDB201
Top-k Route Search through Submodularity Modeling of Recurrent POI Features
We consider a practical top-k route search problem: given a collection of
points of interest (POIs) with rated features and traveling costs between POIs,
a user wants to find k routes from a source to a destination and limited in a
cost budget, that maximally match her needs on feature preferences. One
challenge is dealing with the personalized diversity requirement where users
have various trade-off between quantity (the number of POIs with a specified
feature) and variety (the coverage of specified features). Another challenge is
the large scale of the POI map and the great many alternative routes to search.
We model the personalized diversity requirement by the whole class of
submodular functions, and present an optimal solution to the top-k route search
problem through indices for retrieving relevant POIs in both feature and route
spaces and various strategies for pruning the search space using user
preferences and constraints. We also present promising heuristic solutions and
evaluate all the solutions on real life data.Comment: 11 pages, 7 figures, 2 table
Personalized content retrieval in context using ontological knowledge
Personalized content retrieval aims at improving the retrieval process by taking into account the particular interests of individual users. However, not all user preferences are relevant in all situations. It is well known that human preferences are complex, multiple, heterogeneous, changing, even contradictory, and should be understood in context with the user goals and tasks at hand. In this paper, we propose a method to build a dynamic representation of the semantic context of ongoing retrieval tasks, which is used to activate different subsets of user interests at runtime, in a way that out-of-context preferences are discarded. Our approach is based on an ontology-driven representation of the domain of discourse, providing enriched descriptions of the semantics involved in retrieval actions and preferences, and enabling the definition of effective means to relate preferences and context
Report on requirement analysis and design of PAUSANIAS
Nowadays an increasing amount of web-accessible information on spatial objects becomes available to the public every day. Apart from the spatial location of an object (e.g., a point of interest), additional descriptive information typically includes textual description as well as various ratings, often user generated. Modern applications employ spatio-textual queries, which take into account both the spatial location of an object and its textual similarity to retrieve the most relevant objects. However, existing applications provide a limited functionality to the users. For example, several meaningful queries cannot be expressed by existing approaches and motivate our novel prototype system. The goal of our research is to introduce a novel framework, called Pausanias, for supporting ranked spatial-keyword search over web-accessible geotagged data. In this deliverable, we provide an analysis of requirements that should be fulfilled by the proposed prototype and a concise description of the system design of Pausanias
- …