Search CORE

4,990 research outputs found

Metadata harvesting for content-based distributed information retrieval

Author: Anan
Bailey
Bowman
Callan
Callan
Callan
Callan
Callan
Callan
Carmel
Chou
Craswell
Crow
DCMI
de Sompel
de Sompel
Dijk
French
Gatenby
Gravano
Joint Information Systems Committee
Lagoze
Lagoze
Lagoze
Larson
Liu
Lu
Lu
Lu
Lynch
Nelson
Nottelmann
Paepcke
Sanderson
Simeoni
Simon
Simons
Suleman
van der Kuil
Warner
Witten
Yang
Z39.50 Maintenance Agency
Publication venue
Publication date: 01/01/2007
Field of study

We propose an approach to content-based Distributed Information Retrieval based on the periodic and incremental centralisation of full content indices of widely dispersed and autonomously managed document sources. Inspired by the success of the Open Archive Initiative’s protocol for metadata harvesting, the approach occupies middle ground between content crawling and distributed retrieval. As in crawling, some data moves towards the retrieval process, but it is statistics about the content rather than content itself; this grants more efficient use of network resources and wider scope of application. As in distributed retrieval, some processing is distributed along with the data, but it is indexing rather than retrieval; this reduces the costs of content provision whilst promoting the simplicity, effectiveness, and responsiveness of retrieval. Overall, we argue that the approach retains the good properties of centralised retrieval without renouncing to cost-effective, large-scale resource pooling. We discuss the requirements associated with the approach and identify two strategies to deploy it on top of the OAI infrastructure. In particular, we define a minimal extension of the OAI protocol which supports the coordinated harvesting of full-content indices and descriptive metadata for content resources. Finally, we report on the implementation of a proof-of-concept prototype service for multi-model content-based retrieval of distributed file collections

Crossref

RERO DOC Digital Library

Generic XML-based Framework for Metadata Portals

Author: Diepenbroek Michael
Schindler Uwe
Publication venue: 'Elsevier BV'
Publication date: 01/01/2008
Field of study

Electronic Publication Information Center

HaIRST: Harvesting Institutional Resources in Scotland Testbed. Final Project Report

Author: Dunsire Gordon
Publication venue: University of Strathclyde
Publication date: 01/01/2005
Field of study

The HaIRST project conducted research into the design, implementation and deployment of a pilot service for UK-wide access of autonomously created institutional resources in Scotland, the aim being to investigate and advise on some of the technical, cultural, and organisational requirements associated with the deposit, disclosure, and discovery of institutional resources in the JISC Information Environment. The project involved a consortium of Scottish higher and further education institutions, with significant assistance from the Scottish Library and Information Council. The project investigated the use of technologies based on the Open Archives Initiative (OAI), including the implementation of OAI-compatible repositories for metadata which describe and link to institutional digital resources, the use of the OAI protocol for metadata harvesting (OAI-PMH) to automatically copy the metadata from multiple repositories to a central repository, and the creation of a service to search and identify resources described in the central repository. An important aim of the project was to identify issues of metadata interoperability arising from the requirements of individual institutional repositories and their impact on services based on the aggregation of metadata through harvesting. The project also sought to investigate issues in using these technologies for a wide range of resources including learning, teaching and administrative materials as well as the research and scholarly communication materials considered by many of the other projects in the JISC Focus on Access to Institutional Resources (FAIR) Programme, of which HaIRST was a part. The project tested and implemented a number of open source software packages supporting OAI, and was successful in creating a pilot service which provides effective information retrieval of a range of resources created by the project consortium institutions. The pilot service has been extended to cover research and scholarly communication materials produced by other Scottish universities, and administrative materials produced by a non-educational institution in Scotland. It is an effective testbed for further research and development in these areas. The project has worked extensively with a new OAI standard for 'static repositories' which offers a low-barrier, low-cost mechanism for participation in OAI-based consortia by smaller institutions with a low volume of resources. The project identified and successfully tested tools for transforming pre-existing metadata into a format compliant with OAI standards. The project identified and assessed OAI-related documentation in English from around the world, and has produced metadata for retrieving and accessing it. The project created a Web-based advisory service for institutions and consortia. The OAI Scotland Information Service (OAISIS) provides links to related standards, guidance and documentation, and discusses the findings of HaIRST relating to interoperability and the pilot harvesting service. The project found that open source packages relating to OAI can be installed and made to interoperate to create a viable method of sharing institutional resources within a consortium. HaIRST identified issues affecting the interoperability of shared metadata and suggested ways of resolving them to improve the effectiveness and efficiency of shared information retrieval environments based on OAI. The project demonstrated that application of OAI technologies to administrative materials is an effective way for institutions to meet obligations under Freedom of Information legislation

E-LIS

University of Strathclyde Institutional Repository

Servicing the federation : the case for metadata harvesting

Author: Agosti Maristella
Ferro Nicola
Frommholz I.
Thiel U.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

The paper presents a comparative analysis of data harvesting and distributed computing as complementary models of service delivery within large-scale federated digital libraries. Informed by requirements of flexibility and scalability of federated services, the analysis focuses on the identification and assessment of model invariants. In particular, it abstracts over application domains, services, and protocol implementations. The analytical evidence produced shows that the harvesting model offers stronger guarantees of satisfying the identified requirements. In addition, it suggests a first characterisation of services based on their suitability to either model and thus indicates how they could be integrated in the context of a single federated digital library

CiteSeerX

OPUS

Crossref

University of Strathclyde Institutional Repository

CWI's Institutional Repository

Fraunhofer-ePrints

DR-NTU (Digital Repository of NTU)

Archivio istituzionale della ricerca - Università di Padova

University of Queensland eSpace

Digital library research : current developments and trends

Author: Shiri Ali
Publication venue: MCB University Press
Publication date: 01/01/2003
Field of study

This column gives an overview of current trends in digital library research under the following headings: digital library architecture, systems, tools and technologies; digital content and collections; metadata; interoperability; standards; knowledge organisation systems; users and usability; legal, organisational, economic, and social issues in digital libraries

E-LIS

E-Learning and microformats: a learning object harvesting model and a sample application

Author: Kuru Selahattin
Mödritscher Felix
Soylu Ahmet
Wild Fridolin
Publication venue
Publication date: 01/01/2008
Field of study

In order to support interoperability of learning tools and reusability of resources, this paper introduces a framework for harvesting learning objects from web-based content. Therefore, commonly-known web technologies are examined with respect to their suitability for harvesting embedded meta-data. Then, a lightweight application profile and a microformat for learning objects are proposed based on well-known learning object metadata standards. Additionally, we describe a web service which utilizes XSL transformation (GRDDL) to extract learning objects from different web pages, and provide a SQI target as a retrieval facility using a more complex query language called SPARQL. Finally, we outline the applicability of our framework on the basis of a search client employing the new SQI service for searching and retrieving learning objects

CiteSeerX

Open Research Online (The Open University)

Isik University Academic Open Access

Proposal for an IMLS Collection Registry and Metadata Repository

Author: Bennett Nuala A.
Cole Timothy W.
Mischo William H.
Palmer Carole L.
Twidale Michael B.
Publication venue
Publication date: 01/01/2002
Field of study

The University of Illinois at Urbana-Champaign proposes to design, implement, and research a collection-level registry and item-level metadata repository service that will aggregate information about digital collections and items of digital content created using funds from Institute of Museum and Library Services (IMLS) National Leadership Grants. This work will be a collaboration by the University Library and the Graduate School of Library and Information Science. All extant digital collections initiated or augmented under IMLS aegis from 1998 through September 30, 2005 will be included in the proposed collection registry. Item-level metadata will be harvested from collections making such content available using the Open Archives Initiative Protocol for Metadata Harvesting (OAI PMH). As part of this work, project personnel, in cooperation with IMLS staff and grantees, will define and document appropriate metadata schemas, help create and maintain collection-level metadata records, assist in implementing OAI compliant metadata provider services for dissemination of item-level metadata records, and research potential benefits and issues associated with these activities. The immediate outcomes of this work will be the practical demonstration of technologies that have the potential to enhance the visibility of IMLS funded online exhibits and digital library collections and improve discoverability of items contained in these resources. Experience gained and research conducted during this project will make clearer both the costs and the potential benefits associated with such services. Metadata provider and harvesting service implementations will be appropriately instrumented (e.g., customized anonymous transaction logs, online questionnaires for targeted user groups, performance monitors). At the conclusion of this project we will submit a final report that discusses tasks performed and lessons learned, presents business plans for sustaining registry and repository services, enumerates and summarizes potential benefits of these services, and makes recommendations regarding future implementations of these and related intermediary and end user interoperability services by IMLS projects.unpublishednot peer reviewe

Illinois Digital Environment for Access to Learning and Scholarship Repository

The aDORe federation architecture: digital repositories at scale

Author: CHUTE R
Hochstenbach Patrick
VAN DE SOMPEL H
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Ghent University Academic Bibliography

Study on the use of metadata for digital learning objects in university institutional repositories (MODERI)

Author: Bueno-de-la-Fuente Gema
Hernández Pérez Antonio
Martín-Galán Bonifacio
Méndez Rodríguez Eva María
Rodríguez-Mateos David
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2009
Field of study

Metadata is a core issue for the creation of repositories. Different institutional repositories have chosen and use different metadata models, elements and values for describing the range of digital objects they store. Thus, this paper analyzes the current use of metadata describing those Learning Objects that some open higher educational institutions' repositories include in their collections. The goal of this work is to identify and analyze the different metadata models being used to describe educational features of those specific digital educational objects (such as audience, type of educational object, learning objectives, etc.). Also discussed is the concept and typology of Learning Objects (LO) through their use in University Repositories. We will also examine the usefulness of specifically describing those learning objects, setting them apart from other kind of documents included in the repository, mainly scholarly publications and research results of the Higher Education institution.En prens

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo