Search CORE

146 research outputs found

WAQS : a web-based approximate query system

Author: Chang George Jyh-Shian
Publication venue: Digital Commons @ NJIT
Publication date: 31/05/2001
Field of study

The Web is often viewed as a gigantic database holding vast stores of information and provides ubiquitous accessibility to end-users. Since its inception, the Internet has experienced explosive growth both in the number of users and the amount of content available on it. However, searching for information on the Web has become increasingly difficult. Although query languages have long been part of database management systems, the standard query language being the Structural Query Language is not suitable for the Web content retrieval. In this dissertation, a new technique for document retrieval on the Web is presented. This technique is designed to allow a detailed retrieval and hence reduce the amount of matches returned by typical search engines. The main objective of this technique is to allow the query to be based on not just keywords but also the location of the keywords within the logical structure of a document. In addition, the technique also provides approximate search capabilities based on the notion of Distance and Variable Length Don\u27t Cares. The proposed techniques have been implemented in a system, called Web-Based Approximate Query System, which contains an SQL-like query language called Web-Based Approximate Query Language. Web-Based Approximate Query Language has also been integrated with EnviroDaemon, an environmental domain specific search engine. It provides EnviroDaemon with more detailed searching capabilities than just keyword-based search. Implementation details, technical results and future work are presented in this dissertation

Digital Commons @ New Jersey Institute of Technology (NJIT)

Mining XML documents with association rule algorithms

Author: Gürel Görkem
Publication venue: Izmir Institute of Technology
Publication date: 01/01/2008
Field of study

Thesis (Master)--Izmir Institute of Technology, Computer Engineering, Izmir, 2008Includes bibliographical references (leaves: 59-63)Text in English; Abstract: Turkish and Englishx, 63 leavesFollowing the increasing use of XML technology for data storage and data exchange between applications, the subject of mining XML documents has become more researchable and important topic. In this study, we considered the problem of Mining Association Rules between items in XML document. The principal purpose of this study is applying association rule algorithms directly to the XML documents with using XQuery which is a functional expression language that can be used to query or process XML data. We used three different algorithms; Apriori, AprioriTid and High Efficient AprioriTid. We give comparisons of mining times of these three apriori-like algorithms on XML documents using different support levels, different datasets and different dataset sizes

CRIS-IR 2006

Author
Publication venue
Publication date: 01/11/2006
Field of study

The recognition of entities and their relationships in document collections is an important step towards the discovery of latent knowledge as well as to support knowledge management applications. The challenge lies on how to extract and correlate entities, aiming to answer key knowledge management questions, such as; who works with whom, on which projects, with which customers and on what research areas. The present work proposes a knowledge mining approach supported by information retrieval and text mining tasks in which its core is based on the correlation of textual elements through the LRD (Latent Relation Discovery) method. Our experiments show that LRD outperform better than other correlation methods. Also, we present an application in order to demonstrate the approach over knowledge management scenarios.Fundação para a Ciência e a Tecnologia (FCT) Denmark's Electronic Research Librar

Universidade do Minho: RepositoriUM

Personalizing Interactions with Information Systems

Author: Abiteboul
Allen
Anderson
André
Ashish
Ball
Belkin
Belkin
Belkin
Belkin
Belkin
Billsus
Bodner
Borgman
Brusilovsky
Brusilovsky
Brusilovsky
Bush
Card
Card
Card
Carroll
Chaudhuri
Chawathe
Chiaramella
Cingil
Croft
Croft
Cutting
De Bra
Deutsch
Fernández
Fernández
Florescu
Fuhr
Garcia-Molina
Garofalakis
Goh
Goldman
Haller
Hammer
Hearst
Hellerstein
Hiemstra
Joachims
John
Jones
Joshi
Kautz
Kirk
Knoblock
Kramer
Kushmerick
Lacroix
Lieberman
Madsen
Maglio
Manber
Marchetti
Marchionni
Meuss
Miller
Miller
Mintzer
Mobashier
Mostafa
Mukhopadhay
Mulvenna
Munroe
Munroe
Nestorov
Nestorov
O'Leary
Pancake
Pazzani
Pednault
Perkowitz
Ramakrishnan
Ramakrishnan
Resnick
Riecken
Riecken
Robertson
Robertson
Rocchio
Rosson
Rucker
Rus
Sacco
Sahuguet
Schwartz
Shneiderman
Shneiderman
Singh
Smith
Spiliopoulou
Srinivasan
Suchman
Terveen
Thomas
Thomas
Wexelblat
Widom
Williams
Wilson
Xie
Zadrozny
Publication venue: eCommons
Publication date: 01/01/2003
Field of study

Personalization constitutes the mechanisms and technologies necessary to customize information access to the end-user. It can be defined as the automatic adjustment of information content, structure, and presentation tailored to the individual. In this chapter, we study personalization from the viewpoint of personalizing interaction. The survey covers mechanisms for information-finding on the web, advanced information retrieval systems, dialog-based applications, and mobile access paradigms. Specific emphasis is placed on studying how users interact with an information system and how the system can encourage and foster interaction. This helps bring out the role of the personalization system as a facilitator which reconciles the user’s mental model with the underlying information system’s organization. Three tiers of personalization systems are presented, paying careful attention to interaction considerations. These tiers show how progressive levels of sophistication in interaction can be achieved. The chapter also surveys systems support technologies and niche application domains

Crossref

University of Dayton

Autonomous Consolidation of Heterogeneous Record-Structured HTML Data in Chameleon

Author: Chouvarine Philippe
Publication venue: Scholars Junction
Publication date: 30/08/2004
Field of study

While progress has been made in querying digital information contained in XML and HTML documents, success in retrieving information from the so called hidden Web (data behind Web forms) has been modest. There has been a nascent trend of developing autonomous tools for extracting information from the hidden Web. Automatic tools for ontology generation, wrapper generation, Weborm querying, response gathering, etc., have been reported in recent research. This thesis presents a system called Chameleon for automatic querying of and response gathering from the hidden Web. The approach to response gathering is based on automatic table structure identification, since most information repositories of the hidden Web are structured databases, and so the information returned in response to a query will have regularities. Information extraction from the identified record structures is performed based on domain knowledge corresponding to the domain specified in a query. So called domain plug-ins are used to make the dynamically generated wrappers domain-specific, rather than conventionally used document-specific

Mississippi State University Libraries ETD database

Scholars Junction - Mississippi State University Institutional Repository

Autonomous Consolidation of Heterogeneous Record-Structured HTML Data in Chameleon

Author: Chouvarine Philippe
Publication venue: Scholars Junction
Publication date: 07/05/2005
Field of study

Scholars Junction - Mississippi State University Institutional Repository

Proceedings of the 9th International Workshop on Information Retrieval on Current Research Information Systems

Author: Alroe Bo
Fugl Liv
Santos Leonel
Stempfhuber Maximilian
Tenreiro De Magalhaes Sérgio
Publication venue: Universidade do Minho, Gávea
Publication date: 01/01/2006
Field of study

Repositório Institucional da Universidade Católica Portuguesa

Information Systems and Healthcare XXXIV: Clinical Knowledge Management Systems—Literature Review and Research Issues for Information Systems

Author: Deokar Amit V.
El-Gayar Omar F.
Sarnikar Surendra
Wills Matthew J.
Publication venue: AIS Electronic Library (AISeL)
Publication date: 01/01/2010
Field of study

Knowledge Management (KM) has emerged as a possible solution to many of the challenges facing U.S. and international healthcare systems. These challenges include concerns regarding the safety and quality of patient care, critical inefficiency, disparate technologies and information standards, rapidly rising costs and clinical information overload. In this paper, we focus on clinical knowledge management systems (CKMS) research. The objectives of the paper are to evaluate the current state of knowledge management systems diffusion in the clinical setting, assess the present status and focus of CKMS research efforts, and identify research gaps and opportunities for future work across the medical informatics and information systems disciplines. The study analyzes the literature along two dimensions: (1) the knowledge management processes of creation, capture, transfer, and application, and (2) the clinical processes of diagnosis, treatment, monitoring and prognosis. The study reveals that the vast majority of CKMS research has been conducted by the medical and health informatics communities. Information systems (IS) researchers have played a limited role in past CKMS research. Overall, the results indicate that there is considerable potential for IS researchers to contribute their expertise to the improvement of clinical process through technology-based KM approaches

Beadle Scholar at Dakota State University

AIS Electronic Library (AISeL)

The Family of MapReduce and Large Scale Data Processing Systems

Author: Anna Liu
Ayman G. Fayoumi
King Abdulaziz
See Profile
Sherif Sakr
Sherif Sakr
South Wales
South Wales
Publication venue
Publication date: 12/02/2013
Field of study

In the last two decades, the continuous increase of computational power has produced an overwhelming flow of data which has called for a paradigm shift in the computing architecture and large scale data processing mechanisms. MapReduce is a simple and powerful programming model that enables easy development of scalable parallel applications to process vast amounts of data on large clusters of commodity machines. It isolates the application from the details of running a distributed program such as issues on data distribution, scheduling and fault tolerance. However, the original implementation of the MapReduce framework had some limitations that have been tackled by many research efforts in several followup works after its introduction. This article provides a comprehensive survey for a family of approaches and mechanisms of large scale data processing mechanisms that have been implemented based on the original idea of the MapReduce framework and are currently gaining a lot of momentum in both research and industrial communities. We also cover a set of introduced systems that have been implemented to provide declarative programming interfaces on top of the MapReduce framework. In addition, we review several large scale data processing systems that resemble some of the ideas of the MapReduce framework for different purposes and application scenarios. Finally, we discuss some of the future research directions for implementing the next generation of MapReduce-like solutions.Comment: arXiv admin note: text overlap with arXiv:1105.4252 by other author

arXiv.org e-Print Archive

CiteSeerX