Search CORE

54,158 research outputs found

Knowledge Engineering in Search Engines

Author: Lin Yun-Chieh
Publication venue: SJSU ScholarWorks
Publication date: 01/04/2012
Field of study

With large amounts of information being exchanged on the Internet, search engines have become the most popular tools for helping users to search and filter this information. However, keyword-based search engines sometimes obtain information, which does not meet user’ needs. Some of them are even irrelevant to what the user queries. When the users get query results, they have to read and organize them by themselves. It is not easy for users to handle information when a search engine returns several million results. This project uses a granular computing approach to find knowledge structures of a search engine. The project focuses on knowledge engineering components of a search engine. Based on the earlier work of Dr. Lin and his former student [1], it represents concepts in the Web by simplicial complexes. We found that to represent simplicial complexes adequately, we only need the maximal simplexes. Therefore, this project focuses on building maximal simplexes. Since it is too costly to analyze all Web pages or documents, the project uses the sampling method to get sampling documents. The project constructs simplexes of documents and uses the simplexes to find maximal simplexes. These maximal simplexes are regarded as primitive concepts that can represent Web pages or documents. The maximal simplexes can be used to build an index of a search engine in the future

SJSU ScholarWorks

Using correlation matrix memories for inferencing in expert systems

Author: Austin J
Filer R
Publication venue: 'Royal College of Obstetricians & Gynaecologists (RCOG)'
Publication date: 01/01/1996
Field of study

Outline of The Chapter… Section 16.2 describes CMM and the Dynamic Variable Binding Problem. Section 16.3 deals with how CMM is used as part of an inferencing engine. Section 16.4 details the important performance characteristics of CMM

White Rose Research Online

The NASA Astrophysics Data System: Architecture

Author: Accomazzi A.
Eichhorn G.
Grant C. S.
Kurtz M. J.
Murray S. S.
Publication venue: 'EDP Sciences'
Publication date: 04/02/2000
Field of study

The powerful discovery capabilities available in the ADS bibliographic services are possible thanks to the design of a flexible search and retrieval system based on a relational database model. Bibliographic records are stored as a corpus of structured documents containing fielded data and metadata, while discipline-specific knowledge is segregated in a set of files independent of the bibliographic data itself. The creation and management of links to both internal and external resources associated with each bibliography in the database is made possible by representing them as a set of document properties and their attributes. To improve global access to the ADS data holdings, a number of mirror sites have been created by cloning the database contents and software on a variety of hardware and software platforms. The procedures used to create and manage the database and its mirrors have been written as a set of scripts that can be run in either an interactive or unsupervised fashion. The ADS can be accessed at http://adswww.harvard.eduComment: 25 pages, 8 figures, 3 table

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

A survey on the use of relevance feedback for information access systems

Author: Lalmas Mounia
Ruthven Ian
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/06/2003
Field of study

Users of online search engines often find it difficult to express their need for information in the form of a query. However, if the user can identify examples of the kind of documents they require then they can employ a technique known as relevance feedback. Relevance feedback covers a range of techniques intended to improve a user's query and facilitate retrieval of information relevant to a user's information need. In this paper we survey relevance feedback techniques. We study both automatic techniques, in which the system modifies the user's query, and interactive techniques, in which the user has control over query modification. We also consider specific interfaces to relevance feedback systems and characteristics of searchers that can affect the use and success of relevance feedback systems

Crossref

University of Strathclyde Institutional Repository

Automated Retrieval of Non-Engineering Domain Solutions to Engineering Problems

Author: Goeke M. S.
McAdams D. A.
Stone R. B.
Stroble J. K.
Watkins S. E.
Publication venue: Cranfield University Press
Publication date: 31/03/2009
Field of study

Organised by: Cranfield UniversityBiological inspiration for engineering design has occurred through a variety of techniques such as creation and use of databases, keyword searches of biological information in natural-language format, prior knowledge of biology, and chance observations of nature. This research focuses on utilizing the reconciled Functional Basis function and flow terms to identify suitable biological inspiration for function based design. The organized search provides two levels of results: (1) associated with verb function only and (2) narrowed results associated with verb-noun (function-flow). A set of heuristics has been complied to promote efficient searching using this technique. An example for creating smart flooring is also presented and discussed.Mori Seiki – The Machine Tool Compan

Cranfield CERES

An analysis of the use of graphics for information retrieval

Author: Dunlop Mark D.
Ruthven Ian G.
Publication venue
Publication date: 18/09/1995
Field of study

Several research groups have addressed the problem of retrieving vector graphics. This work has, however, focused either on domain-dependent areas or was based on very simple graphics languages. Here we take a fresh look at the issue of graphics retrieval in general and in particular at the tasks which retrieval systems must support. The paper presents a series of case studies which explored the needs of professionals in the hope that these needs can help direct future graphics IR research. Suggested modelling techniques for some of the graphic collections are also presented

Crossref

University of Strathclyde Institutional Repository

Towards memory supporting personal information management tools

Author: Adar
Barreau
Bederson
Boardman
Bower
Brown
Bruce
Bruce
Byström
Capra
Capra
Carroll
Case
Clark
Cohen
Crovitz
Czerwinski
Czerwinski
Dey
Dourish
Ducheneaut
Dumais
Ebbinghaus
Eldridge
Elsweiler
Eysenck
Freeman
Gifford
Gwizdka
Hayes
Heesch
Herrmann
Herrmann
Hertzum
Hightower
Jones
Jones
Jones
Jones
Krishnan
Kwasnik
Kwasnik
Lansdale
Loftus
Loftus
Malone
Marshall
Mills
Neisser
Palen
Platt
Reason
Rekimoto
Renaud
Rieman
Rodden
Rodden
Rubin
Rubin
Rubinstein
Sachs
Spink
Sunderland
Teevan
Terry
Whittaker
Yang
Yee
Publication venue: 'Wiley'
Publication date: 01/01/2007
Field of study

In this article we discuss re-retrieving personal information objects and relate the task to recovering from lapse(s) in memory. We propose that fundamentally it is lapses in memory that impede users from successfully re-finding the information they need. Our hypothesis is that by learning more about memory lapses in non-computing contexts and how people cope and recover from these lapses, we can better inform the design of PIM tools and improve the user's ability to re-access and re-use objects. We describe a diary study that investigates the everyday memory problems of 25 people from a wide range of backgrounds. Based on the findings, we present a series of principles that we hypothesize will improve the design of personal information management tools. This hypothesis is validated by an evaluation of a tool for managing personal photographs, which was designed with respect to our findings. The evaluation suggests that users' performance when re-finding objects can be improved by building personal information management tools to support characteristics of human memory

University of Regensburg Publication Server

Crossref

University of Strathclyde Institutional Repository

Efficient Spatial Keyword Search in Trajectory Databases

Author: Cong Gao
Lu Hua
Ooi Beng Chin
Zhang Dongxiang
Zhang Meihui
Publication venue
Publication date: 01/01/2012
Field of study

An increasing amount of trajectory data is being annotated with text descriptions to better capture the semantics associated with locations. The fusion of spatial locations and text descriptions in trajectories engenders a new type of top-

k

queries that take into account both aspects. Each trajectory in consideration consists of a sequence of geo-spatial locations associated with text descriptions. Given a user location

\lambda

and a keyword set

\psi

, a top-

k

query returns

k

trajectories whose text descriptions cover the keywords

\psi

and that have the shortest match distance. To the best of our knowledge, previous research on querying trajectory databases has focused on trajectory data without any text description, and no existing work has studied such kind of top-

k

queries on trajectories. This paper proposes one novel method for efficiently computing top-

k

trajectories. The method is developed based on a new hybrid index, cell-keyword conscious B

^+

-tree, denoted by \cellbtree, which enables us to exploit both text relevance and location proximity to facilitate efficient and effective query processing. The results of our extensive empirical studies with an implementation of the proposed algorithms on BerkeleyDB demonstrate that our proposed methods are capable of achieving excellent performance and good scalability.Comment: 12 page

arXiv.org e-Print Archive

Roskilde Universitet

VBN

The study of probability model for compound similarity searching

Author: Abd. Wahid Mohd. Taib
Alwee Razana
Dollah @ Md. Zain Rozilawati
Salim Naomie
Publication venue: Faculty of Computer Science and Information System
Publication date: 30/09/2006
Field of study

Information Retrieval or IR system main task is to retrieve relevant documents according to the users query. One of IR most popular retrieval model is the Vector Space Model. This model assumes relevance based on similarity, which is defined as the distance between query and document in the concept space. All currently existing chemical compound database systems have adapt the vector space model to calculate the similarity of a database entry to a query compound. However, it assumes that fragments represented by the bits are independent of one another, which is not necessarily true. Hence, the possibility of applying another IR model is explored, which is the Probabilistic Model, for chemical compound searching. This model estimates the probabilities of a chemical structure to have the same bioactivity as a target compound. It is envisioned that by ranking chemical structures in decreasing order of their probability of relevance to the query structure, the effectiveness of a molecular similarity searching system can be increased. Both fragment dependencies and independencies assumption are taken into consideration in achieving improvement towards compound similarity searching system. After conducting a series of simulated similarity searching, it is concluded that PM approaches really did perform better than the existing similarity searching. It gave better result in all evaluation criteria to confirm this statement. In terms of which probability model performs better, the BD model shown improvement over the BIR model

Universiti Teknologi Malaysia Institutional Repository