Search CORE

16,123 research outputs found

Smart Search in Newspaper Archives Using Topic Maps

Author: B. Paepen
J. Engelen
S. Van Hemel
Publication venue
Publication date
Field of study

The OmniPaper project has implemented three information retrieval prototypes in the area of electronic news publishing. One prototype uses SOAP as communication protocol between the central system and a number of distributed news archives. The second prototype uses an RDF metadata database, enabling direct metadata queries to the central system. Finally the Topic Map prototype uses query expansion and semantic linking for smart metadata search. The Topic Map prototype enhances thesearch experience by implementing a knowledge layer that combines the semantic content of a lexical database, consisting of concepts and keywords, with a metadata-set of newspaper articles. The linking between both is currently implemented at the level of keywords but will be developed at the level of concepts in the final prototype. The knowledge layer has been designed from a Topic Map point of view, although the XTM syntax has not been used to avoid performance issues. The consortium’s adopted view on information publishing and retrieval considers querying and navigation as two very related actions that can both be captured under the name “search for relevant information”. Navigation forces the user to followpredefined paths whereas querying enables the user to look freely for a suitable starting point. The query and navigation functionality is provided through a web engine and is build on top of the information structure of the knowledge layer

Horizon Report 2009

Author: Johnson L.
Levine A.
Smith R.
Publication venue
Publication date: 01/01/2009
Field of study

El informe anual Horizon investiga, identifica y clasifica las tecnologías emergentes que los expertos que lo elaboran prevén tendrán un impacto en la enseñanza aprendizaje, la investigación y la producción creativa en el contexto educativo de la enseñanza superior. También estudia las tendencias clave que permiten prever el uso que se hará de las mismas y los retos que ellos suponen para las aulas. Cada edición identifica seis tecnologías o prácticas. Dos cuyo uso se prevé emergerá en un futuro inmediato (un año o menos) dos que emergerán a medio plazo (en dos o tres años) y dos previstas a más largo plazo (5 años)

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Diposit Digital de Documents de la UAB

Design of metadata elements for digital news articles in the omnipaper project

Author: Baptista Ana Alice
Pereira T.
Yaginuma T.
Publication venue: Universidade do Minho
Publication date: 01/06/2003
Field of study

This paper examines and proposes a set of metadata elements for describing digital news articles for the benefit of distributed and heterogeneous news resource discovery. Existing digital news description standards such as NITF and NewsML are analysed and compared with Dublin Core Metadata Element Set (DCMES), which results in that the use of Dublin Core is encouraged for interoperability of the resources. The suggested metadata elements are carefully selected and defined considering the characteristics of news articles. Some elements are detailed with refinement qualifiers and recommended encoding scheme. This set of metadata has been developed as a part of the tasks in the IST (Information Society Technologies)-funded European project OmniPaper (Smart Access to European Newspapers, IST-2001-32174)

Universidade do Minho: RepositoriUM

DARIAH and the Benelux

Author: Backes Marianne
Chambers Sally
Hoogerwerf Maarten
Van der West Jan
Publication venue: Department of Applied Linguistics, Translators and Interpreters, University of Antwerp
Publication date: 01/01/2015
Field of study

Ghent University Academic Bibliography

Towards Affordable Disclosure of Spoken Word Archives

Author: Heeren W.F.L.
Hiemstra D.
Huijbregts M.A.H.
Jong F.M.G. de
Ordelman R.J.F.
Publication venue: ILPS, University of Amsterdam
Publication date: 01/01/2008
Field of study

This paper presents and discusses ongoing work aiming at affordable disclosure of real-world spoken word archives in general, and in particular of a collection of recorded interviews with Dutch survivors of World War II concentration camp Buchenwald. Given such collections, the least we want to be able to provide is search at different levels and a flexible way of presenting results. Strategies for automatic annotation based on speech recognition – supporting e.g., within-document search– are outlined and discussed with respect to the Buchenwald interview collection. In addition, usability aspects of the spoken word search are discussed on the basis of our experiences with the online Buchenwald web portal. It is concluded that, although user feedback is generally fairly positive, automatic annotation performance is still far from satisfactory, and requires additional research

CiteSeerX

Radboud Repository

University of Twente Research Information

Metadata elements for digital news resource description

Author: Baptista Ana Alice
Pereira T.
Yaginuma T.
Publication venue
Publication date: 01/01/2003
Field of study

Universidade do Minho: RepositoriUM

Academic Gateway, Fall 2009

Author: San Jose State University Library
Publication venue: SJSU ScholarWorks
Publication date: 01/10/2009
Field of study

SJSU ScholarWorks

Implementation of metadata for OmniPaper RDF prototype

Author: Ariza Ávila Cesar E.
Baptista Ana Alice
Pereira T.
Yaginuma T.
Publication venue
Publication date: 01/01/2004
Field of study

Information Society Technologies (IST) funded OmniPaper project investigates efficient ways for access to distributed and heterogeneous digital news archives using state-of-the-art technologies such as RDF, XTM and SOAP. An approach taken is to create small prototypes based on each of them. This paper presents the first stage of the prototype development, particularly of RDF approach, including analysis on existing news text format standards and metadata vocabularies, definition of metadata elements for OmniPaper, implementation of application profile and RDF schema and development of the RDF prototype in a web-based RDF specific application. The elaborated analysis shows that Dublin Core Metadata Element Set has to be a principal vocabulary to implement the OmniPaper application profile as it provides greater interoperability. The RDF prototype provides RDF “metadatabase” with searchable interface for simple and advance search on the defined metadata elements

Universidade do Minho: RepositoriUM