Search CORE

11 research outputs found

Ontology-Based Recommendation of Editorial Products

Author: Birukou Aliaksandr
Motta Enrico
Osborne Francesco
Thanapalasingam Thiviyan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/10/2018
Field of study

Major academic publishers need to be able to analyse their vast catalogue of products and select the best items to be marketed in scientific venues. This is a complex exercise that requires characterising with a high precision the topics of thousands of books and matching them with the interests of the relevant communities. In Springer Nature, this task has been traditionally handled manually by publishing editors. However, the rapid growth in the number of scientific publications and the dynamic nature of the Computer Science landscape has made this solution increasingly inefficient. We have addressed this issue by creating Smart Book Recommender (SBR), an ontology-based recommender system developed by The Open University (OU) in collaboration with Springer Nature, which supports their Computer Science editorial team in selecting the products to market at specific venues. SBR recommends books, journals, and conference proceedings relevant to a conference by taking advantage of a semantically enhanced representation of about 27K editorial products. This is based on the Computer Science Ontology, a very large-scale, automatically generated taxonomy of research areas. SBR also allows users to investigate why a certain publication was suggested by the system. It does so by means of an interactive graph view that displays the topic taxonomy of the recommended editorial product and compares it with the topic-centric characterization of the input conference. An evaluation carried out with seven Springer Nature editors and seven OU researchers has confirmed the effectiveness of the solution

arXiv.org e-Print Archive

Open Research Online (The Open University)

Recommended from our members

The Smart Book Recommender: An Ontology-Driven Application for Recommending Editorial Products

Author: Birukou Aliaksandr
Motta Enrico
Osborne Francesco
Thanapalasingam Thiviyan
Publication venue
Publication date: 08/10/2018
Field of study

Promoting books and journals to the relevant research communities is an important task for major academic publishers. Unfortunately, identifying which are the best editorial products to market at a certain academic venue is a time-consuming and error-prone process. Here we present the Smart Book Recommender (SBR), an ontology-based recommender that supports the Springer Nature editorial team in selecting the editorial products to market at specific venues. SBR provides an interactive visualisation for analysing the topics characterizing conference series and books. It builds on a dataset of 27K books, journals, and conference proceedings annotated with topics from the Computer Science Ontology, a large-scale ontology of research areas. A user study showed that SBR is able to produce useful recommendations for both editors and researchers

Open Research Online (The Open University)

Recommended from our members

Classifying Research Papers with the Computer Science Ontology

Author: Mannocci Andrea
Motta Enrico
Osborne Francesco
Salatino Angelo
Thanapalasingam Thiviyan
Publication venue
Publication date: 09/10/2018
Field of study

Ontologies of research areas are important tools for characterising, exploring and analysing the research landscape. We recently released the Computer Science Ontology (CSO), a large-scale, automatically generated ontology of research areas, which includes about 26K topics and 226K semantic relationships. CSO currently powers several tools adopted by the Springer Nature editorial team and has been used to enable a variety of solutions, such as classifying research publications, detecting research communities, and predicting research trends. As an effort to encourage the usage of CSO, we have developed the CSO Portal, a web application that enables users to download, explore, and provide granular feedbacks at different levels of the ontology. In this paper, we present the CSO Classifier, an application for automatically classifying academic papers according to the rich taxonomy of topics from CSO. The aim is to facilitate the adoption of CSO across the various communities engaged with scholarly data and to foster the development of new applications based on this knowledge base

Open Research Online (The Open University)

Pragmatic Ontology Evolution: Reconciling User Requirements and Application Performance

Author: A Groß
DN Xuan
F Osborne
F Osborne
F Osborne
F Osborne
F Zablith
F Zablith
H Kondylakis
I Guyon
L Ding
L Qin
M Hartung
M Sabou
NF Noy
P Cimiano
P Ristoski
R Kohavi
SE Middleton
Y Liang
Z Huang
Z Sellami
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Increasingly, organizations are adopting ontologies to describe their large catalogues of items. These ontologies need to evolve regularly in response to changes in the domain and the emergence of new requirements. An important step of this process is the selection of candidate concepts to include in the new version of the ontology. This operation needs to take into account a variety of factors and in particular reconcile user requirements and application performance. Current ontology evolution methods focus either on ranking concepts according to their relevance or on preserving compatibility with existing applications. However, they do not take in consideration the impact of the ontology evolution process on the performance of computational tasks – e.g., in this work we focus on instance tagging, similarity computation, generation of recommendations, and data clustering. In this paper, we propose the Pragmatic Ontology Evolution (POE) framework, a novel approach for selecting from a group of candidates a set of concepts able to produce a new version of a given ontology that i) is consistent with the a set of user requirements (e.g., max number of concepts in the ontology), ii) is parametrised with respect to a number of dimensions (e.g., topological considerations), and iii) effectively supports relevant computational tasks. Our approach also supports users in navigating the space of possible solutions by showing how certain choices, such as limiting the number of concepts or privileging trendy concepts rather than historical ones, would reflect on the application performance. An evaluation of POE on the real-world scenario of the evolving Springer Nature taxonomy for editorial classification yielded excellent results, demonstrating a significant improvement over alternative approaches

Crossref

Open Research Online (The Open University)

Implementing sharing platform based on ontology using a sequential recommender system

Author: Bennani Samir
Ragala Zaynabe
Retbi Asmaâ
Publication venue: Institute of Advanced Engineering and Science
Publication date: 01/12/2023
Field of study

While recommender systems have shown success in many fields, accurate recommendations in industrial settings remain challenging. In maintenance, existing techniques often struggle with the “cold start” problem and fail to consider differences in the target population's characteristics. To address this, additional user information can be incorporated into the recommendation process. This paper proposes a recommender system for recommending repair actions to technicians based on an ontology (knowledge base) and a sequential model. The approach utilizes two ontologies, one representing failure knowledge and the other representing asset attributes. The proposed method involves two steps: i) calculating score similarity based on ontology domain knowledge to make predictions for targeted failures and ii) generating Top-N repair actions through collaborative filtering recommendations for targeted failures. An additional module was implemented to evaluate the recommender system, and results showed improved performance

Institute of Advanced Engineering and Science

Recommended from our members

The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly Articles

Author: Motta Enrico
Osborne Francesco
Salatino Angelo
Thanapalasingam Thiviyan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Classifying research papers according to their research topics is an important task to improve their retrievability, assist the creation of smart analytics, and support a variety of approaches for analysing and making sense of the research environment. In this paper, we present the CSO Classifier, a new unsupervised approach for automatically classifying research papers according to the Computer Science Ontology (CSO), a comprehensive ontology of re-search areas in the field of Computer Science. The CSO Classifier takes as input the metadata associated with a research paper (title, abstract, keywords) and returns a selection of research concepts drawn from the ontology. The approach was evaluated on a gold standard of manually annotated articles yielding a significant improvement over alternative methods

Open Research Online (The Open University)

Improving Editorial Workflow and Metadata Quality at Springer Nature

Author: A Duvvuru
A Moro
AA Salatino
AG Nuzzolese
Angelo A. Salatino
B Sateli
DM Blei
F Osborne
F Osborne
F Osborne
F Osborne
F Osborne
M Herrera
R Usbeck
RL Ohniwa
S Peroni
T Thanapalasingam
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Identifying the research topics that best describe the scope of a scientific publication is a crucial task for editors, in particular because the quality of these annotations determine how effectively users are able to discover the right content in online libraries. For this reason, Springer Nature, the world's largest academic book publisher, has traditionally entrusted this task to their most expert editors. These editors manually analyse all new books, possibly including hundreds of chapters, and produce a list of the most relevant topics. Hence, this process has traditionally been very expensive, time-consuming, and confined to a few senior editors. For these reasons, back in 2016 we developed Smart Topic Miner (STM), an ontology-driven application that assists the Springer Nature editorial team in annotating the volumes of all books covering conference proceedings in Computer Science. Since then STM has been regularly used by editors in Germany, China, Brazil, India, and Japan, for a total of about 800 volumes per year. Over the past three years the initial prototype has iteratively evolved in response to feedback from the users and evolving requirements. In this paper we present the most recent version of the tool and describe the evolution of the system over the years, the key lessons learnt, and the impact on the Springer Nature workflow. In particular, our solution has drastically reduced the time needed to annotate proceedings and significantly improved their discoverability, resulting in 9.3 million additional downloads. We also present a user study involving 9 editors, which yielded excellent results in term of usability, and report an evaluation of the new topic classifier used by STM, which outperforms previous versions in recall and F-measure

arXiv.org e-Print Archive

Crossref

Open Research Online (The Open University)

The Computer Science Ontology: A Comprehensive Automatically-Generated Taxonomy of Research Areas

Author: Birukou Aliaksandr
Mannocci Andrea
Motta Enrico
Osborne Francesco
Salatino Angelo
Thanapalasingam Thiviyan
Publication venue: 'MIT Press - Journals'
Publication date: 24/07/2020
Field of study

Ontologies of research areas are important tools for characterising, exploring, and analysing the research landscape. Some fields of research are comprehensively described by large-scale taxonomies, e.g., MeSH in Biology and PhySH in Physics. Conversely, current Computer Science taxonomies are coarse-grained and tend to evolve slowly. For instance, the ACM classification scheme contains only about 2K research topics and the last version dates back to 2012. In this paper, we introduce the Computer Science Ontology (CSO), a large-scale, automatically generated ontology of research areas, which includes about 14K topics and 162K semantic relationships. It was created by applying the Klink-2 algorithm on a very large dataset of 16M scientific articles. CSO presents two main advantages over the alternatives: i) it includes a very large number of topics that do not appear in other classifications, and ii) it can be updated automatically by running Klink-2 on recent corpora of publications. CSO powers several tools adopted by the editorial team at Springer Nature and has been used to enable a variety of solutions, such as classifying research publications, detecting research communities, and predicting research trends. To facilitate the uptake of CSO, we have also released the CSO Classifier, a tool for automatically classifying research papers, and the CSO Portal, a web application that enables users to download, explore, and provide granular feedback on CSO. Users can use the portal to navigate and visualise sections of the ontology, rate topics and relationships, and suggest missing ones. The portal will support the publication of and access to regular new releases of CSO, with the aim of providing a comprehensive resource to the various research communities engaged with scholarly data

Crossref

Open Research Online (The Open University)

Recommended from our members

CSO Classifier 3.0: a scalable unsupervised method for classifying documents in terms of research topics

Author: Motta Enrico
Osborne Francesco
Salatino Angelo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Classifying scientific articles, patents, and other documents according to the relevant research topics is an important task, which enables a variety of functionalities, such as categorising documents in digital libraries, monitoring and predicting research trends, and recommending papers relevant to one or more topics. In this paper, we present the latest version of the CSO Classifier (v3.0), an unsupervised approach for automatically classifying research papers according to the Computer Science Ontology (CSO), a comprehensive taxonomy of research areas in the field of Computer Science. The CSO Classifier takes as input the textual components of a research paper (usually title, abstract, and keywords) and returns a set of research topics drawn from the ontology. This new version includes a new component for discarding outlier topics and offers improved scalability. We evaluated the CSO Classifier on a gold standard of manually annotated articles, demonstrating a significant improvement over alternative methods. We also present an overview of applications adopting the CSO Classifier and describe how it can be adapted to other fields

Open Research Online (The Open University)

Knowledge Graphs and Large Language Models for Intelligent Applications in the Tourism Domain

Author: SECCHI LUCA
Publication venue: Università degli Studi di Cagliari
Publication date: 20/02/2024
Field of study

In the current era of big data, the World Wide Web is transitioning from being merely a repository of content to a complex web of data. Two pivotal technologies underpinning this shift are Knowledge Graphs (KGs) and Data Lakes. Concurrently, Artificial Intelligence has emerged as a potent means to leverage data, creating knowledge and pioneering new tools across various sectors. Among these advancements, Large Language Models (LLM) stand out as transformative technologies in many domains. This thesis delves into an integrative exploration, juxtaposing the structured world of KGs and the raw data reservoirs of Data Lakes, together with a focus on harnessing LLM to derive meaningful insights in the domain of tourism. Starting with an exposition on the importance of KGs in the present digital milieu, the thesis delineates the creation and management of KGs that utilize entities and their relations to represent intricate data patterns within the tourism sector. In this context, we introduce a semi-automatic methodology for generating a Tourism Knowledge Graph (TKG) and a novel Tourism Analytics Ontology (TAO). Through integrating information from enterprise data lakes with public knowledge graphs, the thesis illustrates the creation of a comprehensive semantic layer built upon the raw data, demonstrating versatility and scalability. Subsequently, we present an in-depth investigation into transformer-based language models, emphasizing their potential and limitations. Addressing the exigency for domain-specific knowledge enrichment, we conduct a methodical study on knowledge enhancement strategies for transformers based language models. The culmination of this thesis is the presentation of an innovative method that fuses large language models with domain-specific knowledge graphs, targeting the optimisation of hospitality offers. This approach integrates domain KGs with feature engineering, enriching data representation in LLMs. Our scientific contributions span multiple dimensions: from devising methodologies for KG construction, especially in tourism, to the design and implementation of a novel ontology; from the analysis and comparison of techniques for enriching LLMs with specialized knowledge, to deploying such methods in a novel framework that effectively combines LLMs and KGs within the context of the tourism domain. In our research, we explore the potential benefits and challenges arising from the integration of knowledge engineering and artificial intelligence, with a specific emphasis on the tourism sector. We believe our findings offer a promising avenue and serve as a foundational platform for subsequent studies and practical implementations for the academic community and the tourism industry alike

Archivio istituzionale della ricerca - Università di Cagliari