Search CORE

2 research outputs found

Towards a Semantic Search Engine for Scientific Articles

Author: Forestier Germain
Hassenforder Michel
Latard Bastien
Weber Jonathan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/09/2017
Field of study

Because of the data deluge in scientific publication, finding relevant information is getting harder and harder for researchers and readers. Building an enhanced scientific search engine by taking semantic relations into account poses a great challenge. As a starting point, semantic relations between keywords from scientific articles could be extracted in order to classify articles. This might help later in the process of browsing and searching for content in a meaningful scientific way. Indeed, by connecting keywords, the context of the article can be extracted. This paper aims to provide ideas to build such a smart search engine and describes the initial contributions towards achieving such an ambitious goal

arXiv.org e-Print Archive

Crossref

Automated Machine Learning for Information Retrieval in Scientific Articles

Author: Brevilliers Mathieu
Forestier Germain
Hassenforder Michel
Idoumghar Lhassane
Latard Bastien
Lepagnot Julien
Rakhshani Hojjat
Weber Jonathan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/07/2020
Field of study

International audienceThe amount of scientific conferences and journal articles continues to increase and new approaches are required to support users in finding relevant publications. This study investigates to what extent a new machine learning (ML) pipeline may preferentially identify links between similar scientific articles. The characteristics of intersections and unions of keywords, contextualized keywords (i.e., synsets) and neighbors are computed and used to train a ML model. Automated machine learning (AutoML) is then applied to ease the search for a new pipeline. Extensive experiments demonstrated that a newly designed ML model achieves an accuracy of 90% on a dataset of approximately 120,000 article pairs. These results suggest that application of ML for proposing new recommendation systems could have in the long term a positive impact in the literature

Crossref

Hal-Diderot