18,023 research outputs found
The information retrieval challenge of human digital memories
Today people are storing increasing amounts of personal information in digital format. While storage of such
information is becoming straight forward, retrieval from the vast personal archives that this is creating poses
significant challenges. Existing retrieval techniques are good at retrieving from non-personal spaces, such as the
World Wide Web. However they are not sufficient for retrieval of items from these new unstructured spaces
which contain items that are personal to the individual, and of which the user has personal memories and with
which has had previous interaction. We believe that there are new and exciting possibilities for retrieval from
personal archives. Memory cues act as triggers for individuals in the remembering process, a better
understanding of memory cues will enable us to design new and effective retrieval algorithms and systems for
personal archives. Context data, such as time and location, is already proving to play a key part in this special
retrieval domain, for example for searching personal photo archives, we believe there are many other rich
sources of context that can be exploited for retrieval from personal archives
Multiple Retrieval Models and Regression Models for Prior Art Search
This paper presents the system called PATATRAS (PATent and Article Tracking,
Retrieval and AnalysiS) realized for the IP track of CLEF 2009. Our approach
presents three main characteristics: 1. The usage of multiple retrieval models
(KL, Okapi) and term index definitions (lemma, phrase, concept) for the three
languages considered in the present track (English, French, German) producing
ten different sets of ranked results. 2. The merging of the different results
based on multiple regression models using an additional validation set created
from the patent collection. 3. The exploitation of patent metadata and of the
citation structures for creating restricted initial working sets of patents and
for producing a final re-ranking regression model. As we exploit specific
metadata of the patent documents and the citation relations only at the
creation of initial working sets and during the final post ranking step, our
architecture remains generic and easy to extend
Future information environments: deserts, jungles or parks?
This paper discusses the basic functions of a common information environment and how they are supported by metadata. Several distinct categories of information landscapes are described, characterised by the availability and quality of metadata at the item and collection level. The paper suggests elements of professional practice which can improve the functionality of landscapes, and presents an illustrative scenario of how a common information environment might be effective
Theory and Practice of Data Citation
Citations are the cornerstone of knowledge propagation and the primary means
of assessing the quality of research, as well as directing investments in
science. Science is increasingly becoming "data-intensive", where large volumes
of data are collected and analyzed to discover complex patterns through
simulations and experiments, and most scientific reference works have been
replaced by online curated datasets. Yet, given a dataset, there is no
quantitative, consistent and established way of knowing how it has been used
over time, who contributed to its curation, what results have been yielded or
what value it has.
The development of a theory and practice of data citation is fundamental for
considering data as first-class research objects with the same relevance and
centrality of traditional scientific products. Many works in recent years have
discussed data citation from different viewpoints: illustrating why data
citation is needed, defining the principles and outlining recommendations for
data citation systems, and providing computational methods for addressing
specific issues of data citation.
The current panorama is many-faceted and an overall view that brings together
diverse aspects of this topic is still missing. Therefore, this paper aims to
describe the lay of the land for data citation, both from the theoretical (the
why and what) and the practical (the how) angle.Comment: 24 pages, 2 tables, pre-print accepted in Journal of the Association
for Information Science and Technology (JASIST), 201
Evaluating Digital Libraries: A Longitudinal and Multifaceted View
published or submitted for publicatio
- âŠ