Search CORE

17 research outputs found

Which one is better: presentation-based or content-based math search?

Author: A.S. Youssef
B.R. Miller
J. Mišutka
M. Adeel
M. Kohlhase
M.E. Altamimi
M.Q. Nghiem
R. Miner
R. Zanibbi
S. Kamali
Publication venue
Publication date: 01/01/2014
Field of study

Mathematical content is a valuable information source and retrieving this content has become an important issue. This paper compares two searching strategies for math expressions: presentation-based and content-based approaches. Presentation-based search uses state-of-the-art math search system while content-based search uses semantic enrichment of math expressions to convert math expressions into their content forms and searching is done using these content-based expressions. By considering the meaning of math expressions, the quality of search system is improved over presentation-based systems

arXiv.org e-Print Archive

CiteSeerX

Crossref

A Survey on Retrieval of Mathematical Knowledge

Author: A Asperti
A Asperti
A Kohlhase
A Kohlhase
AM Youssef
AS Youssef
AS Youssef
BR Miller
BR Miller
BR Miller
D Delahaye
F Guidi
F Rabe
G Bancerek
G Bancerek
G Bancerek
I Normann
M Adeel
M Líška
M-Q Nghiem
ME Altamimi
O Caprotti
P Baumgartner
P Cairns
P Libbrecht
P Libbrecht
P Libbrecht
Q Zhang
R Miner
R Zanibbi
S Kamali
T Gauthier
Y Haralambous
Publication venue
Publication date: 01/01/2015
Field of study

We present a short survey of the literature on indexing and retrieval of mathematical knowledge, with pointers to 72 papers and tentative taxonomies of both retrieval problems and recurring techniques.Comment: CICM 2015, 20 page

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

MIaS: Math-Aware Retrieval in Digital Mathematical Libraries

Author: Aizawa Akiko
Aizawa Akiko
Białecki Andrzej
Cervone Davide
Líska Martin
Líska Martin
Richard
Rygl Jan
Růžička Michal
Růžička Michal
Sojka Petr
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2018
Field of study

Digital mathematical libraries (DMLs) such as arXiv, Numdam, and EuDML contain mainly documents from STEM fields, where mathematical formulae are often more important than text for understanding. Conventional information retrieval (IR) systems are unable to represent formulae and they are therefore ill-suited for math information retrieval (MIR). To fill the gap, we have developed, and open-sourced the MIaS MIR system. MIaS is based on the full-text search engine Apache Lucene. On top of text retrieval, MIaS also incorporates a set of tools for preprocessing mathematical formulae. We describe the design of the system and present speed, and quality evaluation results. We show that MIaS is both efficient, and effective, as evidenced by our victory in the NTCIR-11 Math-2 task

arXiv.org e-Print Archive

Crossref

Univerzitní repozitář Masarykovy univerzity

Semantic formula search in digital mathematical libraries

Author: Elizarov Aleksandr Mihajlovich
Kirillovich Alexander Vitalevich
Lipachev Evgeny Konstantinovich
Nevzorova Olga Avenirovna
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

We are presenting semantic methods of search for mathematical objects in scientific publications. In particular, methods of search for mathematical formulas, as well as methods based on the logical structure of mathematical documents, are being discussed here. Based on the digital mathematical library Lobachevskii DML, created at Kazan Federal University in 2017, declared as Lobachevsky Year, we developed and tested new methods of search in digital collections of mathematical documents

Kazan Federal University Digital Repository

Navegador ontológico matemático-NOMAT

Author: De la Hoz Correa Eduardo Miguel
De-La-Hoz-Franco Emiro
FAJARDO TORO CARLOS HERNAN
Varela Arregoces Ernesto Eduardo
Publication venue: Revista Espacios
Publication date: 22/07/2017
Field of study

The query algorithms in search engines use indexing, contextual analysis and ontologies, among other techniques, for text search. However, they do not use equations due to their writing complexity. NOMAT is a prototype of mathematical expression search engine that seeks information both in thesaurus and internet, using ontological tool for filtering and contextualizing information and LaTeX editor for the symbols in these expressions. This search engine was created to support mathematical research. Compared to other Internet search engines, NOMAT does not require prior knowledge of LaTeX, because has an editing tool which enables writing directly the symbols that make up the mathematical expression of interest. The results obtained were accurate and contextualized, compared to other commercial and no-commercial search engines.Los algoritmos de consulta de los motores de búsqueda utilizan indexación, análisis contextual y ontologías, entre otras técnicas, para la búsqueda de texto. Sin embargo, no utilizan ecuaciones debido a su complejidad de escritura. Nomat es un prototipo de motor de búsqueda de expresión matemática que busca información tanto en tesauro como en Internet, utilizando la Herramienta ontológica para filtrar y contextualizar información y editor de látex para los símbolos de estas expresiones. Este buscador fue creado para apoyar la investigación matemática. En comparación con otros motores de búsqueda de Internet, Nomat no requiere conocimientos previos de látex, ya que cuenta con una herramienta de edición que permite escribir directamente los símbolos que componen la expresión matemática de interés. Los resultados obtenidos fueron precisos y contextualizados, en comparación con otros motores de búsqueda comerciales y no comerciales

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositorio Digital CUC

Recommended from our members

Mathematical Information Retrieval based on type embeddings and query expansion

Author: Stathopoulos YA
Teufel SH
Publication venue: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
Publication date: 01/12/2016
Field of study

We present an approach to mathematical information retrieval (MIR) that exploits a special kind of technical terminology, referred to as a mathematical type. In this paper, we present and evaluate a type detection mechanism and show its positive effect on the retrieval of research-level mathematics. Our best model, which performs query expansion with a type-aware embedding space, strongly outperforms standard IR models with state-of-the-art query expansion (vector space-based and language modelling-based), on a relatively new corpus of research-level queries

Apollo (Cambridge)

数学情報アクセスのための数式表現の検索と曖昧性解消

Author: Giovanni Yoko Kristianto
ギオヴァニヨコクリスティアント
Publication venue: 情報理工学系研究科コンピュータ科学専攻
Publication date: 23/03/2017
Field of study

学位の種別: 課程博士審査委員会委員 : （主査）東京大学准教授渋谷哲朗, 東京大学教授萩谷昌己, 東京大学准教授蓮尾一郎, 東京大学准教授鶴岡慶雅, 東京工業大学准教授藤井敦University of Tokyo(東京大学

Discovering Mathematical Objects of Interest -- A Study of Mathematical Notations

Author: Formánek David
Grün Christian
Hueske Fabian
Kohlhase Andrea
Kohlhase Andrea
Kohlhase Andrea
Kohlhase Michael
Kristianto Giovanni Yoko
Lipani Aldo
Lohia Ashish
Schubotz Moritz
Schubotz Moritz
Schubotz Moritz
Schubotz Moritz
Youssef Abdou
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 19/02/2020
Field of study

Mathematical notation, i.e., the writing system used to communicate concepts in mathematics, encodes valuable information for a variety of information search and retrieval systems. Yet, mathematical notations remain mostly unutilized by today's systems. In this paper, we present the first in-depth study on the distributions of mathematical notation in two large scientific corpora: the open access arXiv (2.5B mathematical objects) and the mathematical reviewing service for pure and applied mathematics zbMATH (61M mathematical objects). Our study lays a foundation for future research projects on mathematical information retrieval for large scientific corpora. Further, we demonstrate the relevance of our results to a variety of use-cases. For example, to assist semantic extraction systems, to improve scientific search engines, and to facilitate specialized math recommendation systems. The contributions of our presented research are as follows: (1) we present the first distributional analysis of mathematical formulae on arXiv and zbMATH; (2) we retrieve relevant mathematical objects for given textual search queries (e.g., linking

P_{n}^{(\alpha, \beta)}\!\left(x\right)

with `Jacobi polynomial'); (3) we extend zbMATH's search engine by providing relevant mathematical formulae; and (4) we exemplify the applicability of the results by presenting auto-completion for math inputs as the first contribution to math recommendation systems. To expedite future research projects, we have made available our source code and data.Comment: Proceedings of The Web Conference 2020 (WWW'20), April 20--24, 2020, Taipei, Taiwa

arXiv.org e-Print Archive

Crossref