Search CORE

6,528 research outputs found

Classifying document types to enhance search and recommendations in digital libraries

Author: F Sebastiani
L Maaten van der
Y Aphinyanaphongs
Publication venue
Publication date: 13/07/2017
Field of study

In this paper, we address the problem of classifying documents available from the global network of (open access) repositories according to their type. We show that the metadata provided by repositories enabling us to distinguish research papers, thesis and slides are missing in over 60% of cases. While these metadata describing document types are useful in a variety of scenarios ranging from research analytics to improving search and recommender (SR) systems, this problem has not yet been sufficiently addressed in the context of the repositories infrastructure. We have developed a new approach for classifying document types using supervised machine learning based exclusively on text specific features. We achieve 0.96 F1-score using the random forest and Adaboost classifiers, which are the best performing models on our data. By analysing the SR system logs of the CORE [1] digital library aggregator, we show that users are an order of magnitude more likely to click on research papers and thesis than on slides. This suggests that using document types as a feature for ranking/filtering SR results in digital libraries has the potential to improve user experience.Comment: 12 pages, 21st International Conference on Theory and Practise of Digital Libraries (TPDL), 2017, Thessaloniki, Greec

arXiv.org e-Print Archive

Crossref

Rule-based User Characteristics Acquisition from Logs with Semantics for Personalized Web-Based Systems

Author: Barla Michal
Bieliková Mária
Tvarožek Michal
Publication venue: Institute of Informatics, Slovak Academy of Sciences
Publication date: 26/01/2012
Field of study

Personalization of web-based information systems based on specialized user models has become more important in order to preserve the effectiveness of their use as the amount of available content increases. We describe a user modeling approach based on automated acquisition of user behaviour and its successive rule-based evaluation and transformation into an ontological user model. We stress reusability and flexibility by introducing a novel approach to logging, which preserves the semantics of logged events. The successive analysis is driven by specialized rules, which map usage patterns to knowledge about users, stored in an ontology-based user model. We evaluate our approach via a case study using an enhanced faceted browser, which provides personalized navigation support and recommendation

Computing and Informatics (E-Journal - Institute of Informatics, SAS, Bratislava)

Automated user modeling for personalized digital libraries

Author: Aihara
Angiulli
Belkin
Bezdek
Blum
Costabile
Cristianini
E. Frias-Martinez
Fausett
Ford
Friedman
G. Magoulas
Hartigan
Haykin
Jain
Kobsa
Krishnapuram
Magoulas
Manber
Mitchell
Montaner
R. Macredie
Rabiner
Ramsey
Riecken
S. Chen
Sarukkai
Tsukada
Webb
Winter
Witten
Publication venue: 'Elsevier BV'
Publication date: 01/01/2006
Field of study

Digital libraries (DL) have become one of the most typical ways of accessing any kind of digitalized information. Due to this key role, users welcome any improvements on the services they receive from digital libraries. One trend used to improve digital services is through personalization. Up to now, the most common approach for personalization in digital libraries has been user-driven. Nevertheless, the design of efficient personalized services has to be done, at least in part, in an automatic way. In this context, machine learning techniques automate the process of constructing user models. This paper proposes a new approach to construct digital libraries that satisfy user’s necessity for information: Adaptive Digital Libraries, libraries that automatically learn user preferences and goals and personalize their interaction using this information

CiteSeerX

Crossref

Birkbeck Institutional Research Online

Brunel University Research Archive

A unified view of data-intensive flows in business intelligence systems : a survey

Author: Abelló Gamazo Alberto
Jovanovic Petar
Romero Moral Óscar
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Data-intensive flows are central processes in today’s business intelligence (BI) systems, deploying different technologies to deliver data, from a multitude of data sources, in user-preferred and analysis-ready formats. To meet complex requirements of next generation BI systems, we often need an effective combination of the traditionally batched extract-transform-load (ETL) processes that populate a data warehouse (DW) from integrated data sources, and more real-time and operational data flows that integrate source data at runtime. Both academia and industry thus must have a clear understanding of the foundations of data-intensive flows and the challenges of moving towards next generation BI environments. In this paper we present a survey of today’s research on data-intensive flows and the related fundamental fields of database theory. The study is based on a proposed set of dimensions describing the important challenges of data-intensive flows in the next generation BI setting. As a result of this survey, we envision an architecture of a system for managing the lifecycle of data-intensive flows. The results further provide a comprehensive understanding of data-intensive flows, recognizing challenges that still are to be addressed, and how the current solutions can be applied for addressing these challenges.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

The semantic web framework (SWF)

Author: Ahmad yahya mohammed nada
أحمد يحيى محمد ندى
Publication venue: جامعة القدس
Publication date: 05/06/2010
Field of study

Al-Quds University Digital Repository