Search CORE

10,547 research outputs found

Three Steps to Heaven: Semantic Publishing in a Real World Workflow

Author: Bizer
Brazma
Knuth
Lamport
Lord
Phillip Lord
Robert Stevens
Shadbolt
Shotton
Simon Cockell
Publication venue: 'MDPI AG'
Publication date: 01/01/2012
Field of study

Semantic publishing offers the promise of computable papers, enriched visualisation and a realisation of the linked data ideal. In reality, however, the publication process contrives to prevent richer semantics while culminating in a `lumpen' PDF. In this paper, we discuss a web-first approach to publication, and describe a three-tiered approach which integrates with the existing authoring tooling. Critically, although it adds limited semantics, it does provide value to all the participants in the process: the author, the reader and the machine.Comment: Published as part of SePublica 201

arXiv.org e-Print Archive

Multidisciplinary Digital Publishing Institute

Crossref

Directory of Open Access Journals

The University of Manchester - Institutional Repository

VMEXT: A Visualization Tool for Mathematical Expression Trees

Author: B Gipp
B Gipp
BR Miller
HS Cohl
HS Cohl
M Schubotz
N Meuschke
Q Zhang
R Miner
S Kamali
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Mathematical expressions can be represented as a tree consisting of terminal symbols, such as identifiers or numbers (leaf nodes), and functions or operators (non-leaf nodes). Expression trees are an important mechanism for storing and processing mathematical expressions as well as the most frequently used visualization of the structure of mathematical expressions. Typically, researchers and practitioners manually visualize expression trees using general-purpose tools. This approach is laborious, redundant, and error-prone. Manual visualizations represent a user's notion of what the markup of an expression should be, but not necessarily what the actual markup is. This paper presents VMEXT - a free and open source tool to directly visualize expression trees from parallel MathML. VMEXT simultaneously visualizes the presentation elements and the semantic structure of mathematical expressions to enable users to quickly spot deficiencies in the Content MathML markup that does not affect the presentation of the expression. Identifying such discrepancies previously required reading the verbose and complex MathML markup. VMEXT also allows one to visualize similar and identical elements of two expressions. Visualizing expression similarity can support support developers in designing retrieval approaches and enable improved interaction concepts for users of mathematical information retrieval systems. We demonstrate VMEXT's visualizations in two web-based applications. The first application presents the visualizations alone. The second application shows a possible integration of the visualizations in systems for mathematical knowledge management and mathematical information retrieval. The application converts LaTeX input to parallel MathML, computes basic similarity measures for mathematical expressions, and visualizes the results using VMEXT.Comment: 15 pages, 4 figures, Intelligent Computer Mathematics - 10th International Conference CICM 2017, Edinburgh, UK, July 17-21, 2017, Proceeding

arXiv.org e-Print Archive

KOPS - The Institutional Repository of the University of Konstanz

Crossref

Improving the Representation and Conversion of Mathematical Formulae by Considering their Textual Context

Author: Aizawa A.
Cajori F.
Cohl H. S.
Dehaye P.
Ginev D.
Ion P. D. F.
Nghiem M.-Q.
Padovani L.
Schubotz M.
Schubotz M.
Schubotz M.
Schubotz M.
Schubotz M.
Schubotz M.
Stamerjohanns H.
Watt S. M.
Youssef A.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2018
Field of study

Mathematical formulae represent complex semantic information in a concise form. Especially in Science, Technology, Engineering, and Mathematics, mathematical formulae are crucial to communicate information, e.g., in scientific papers, and to perform computations using computer algebra systems. Enabling computers to access the information encoded in mathematical formulae requires machine-readable formats that can represent both the presentation and content, i.e., the semantics, of formulae. Exchanging such information between systems additionally requires conversion methods for mathematical representation formats. We analyze how the semantic enrichment of formulae improves the format conversion process and show that considering the textual context of formulae reduces the error rate of such conversions. Our main contributions are: (1) providing an openly available benchmark dataset for the mathematical format conversion task consisting of a newly created test collection, an extensive, manually curated gold standard and task-specific evaluation metrics; (2) performing a quantitative evaluation of state-of-the-art tools for mathematical format conversions; (3) presenting a new approach that considers the textual context of formulae to reduce the error rate for mathematical format conversions. Our benchmark dataset facilitates future research on mathematical format conversions as well as research on many problems in mathematical information retrieval. Because we annotated and linked all components of formulae, e.g., identifiers, operators and other entities, to Wikidata entries, the gold standard can, for instance, be used to train methods for formula concept discovery and recognition. Such methods can then be applied to improve mathematical information retrieval systems, e.g., for semantic formula search, recommendation of mathematical content, or detection of mathematical plagiarism.Comment: 10 pages, 4 figure

arXiv.org e-Print Archive

KOPS - The Institutional Repository of the University of Konstanz

Crossref

PubMed Central

Which one is better: presentation-based or content-based math search?

Author: A.S. Youssef
B.R. Miller
J. Mišutka
M. Adeel
M. Kohlhase
M.E. Altamimi
M.Q. Nghiem
R. Miner
R. Zanibbi
S. Kamali
Publication venue
Publication date: 01/01/2014
Field of study

Mathematical content is a valuable information source and retrieving this content has become an important issue. This paper compares two searching strategies for math expressions: presentation-based and content-based approaches. Presentation-based search uses state-of-the-art math search system while content-based search uses semantic enrichment of math expressions to convert math expressions into their content forms and searching is done using these content-based expressions. By considering the meaning of math expressions, the quality of search system is improved over presentation-based systems

arXiv.org e-Print Archive

CiteSeerX

Crossref

$OntoMath^{PRO}$ Ontology: A Linked Data Hub for Mathematics

Author: C. Bizer
C. David
C. Lange
C. Lange
E. Sirin
E.V. Biryaltsev
F. Kamareddine
H. Barendregt
H.S. Barrows
M. Doerr
M. Kohlhase
N. Sloane
O. Nevzorova
O.A. Nevzorova
Publication venue
Publication date: 01/01/2014
Field of study

In this paper, we present an ontology of mathematical knowledge concepts that covers a wide range of the fields of mathematics and introduces a balanced representation between comprehensive and sensible models. We demonstrate the applications of this representation in information extraction, semantic search, and education. We argue that the ontology can be a core of future integration of math-aware data sets in the Web of Data and, therefore, provide mappings onto relevant datasets, such as DBpedia and ScienceWISE.Comment: 15 pages, 6 images, 1 table, Knowledge Engineering and the Semantic Web - 5th International Conferenc

arXiv.org e-Print Archive

Crossref