A Multimodal Approach to exploit similarity in documents

C.L. Sable; D.V. Mikhailov; G. Park; J. Ah-Pine; J. Qin; L. Aronovich; L. Denoyer; N. Bouguila; Q. Ye; W. Chan

A Multimodal Approach to exploit similarity in documents

Authors: C.L. Sable
D.V. Mikhailov
G. Park
J. Ah-Pine
J. Qin
L. Aronovich
L. Denoyer
N. Bouguila
Q. Ye
W. Chan
Publication date: 1 January 2014
Publisher: 'Springer Science and Business Media LLC'
Doi

Abstract

Automated document classification process extracts information with a systematic analysis of the content of documents. This is an active research field of growing importance due to the large amount of electronic documents produced in the world wide web and available thanks to diffused technologies including mobile ones. Several application areas benefit from automated document classification, including document archiving, invoice processing in business environments, press releases and research engines. Current tools classify or \u201dtag\u201d either text or images separately.In this paper we show how, by linking image and text-based contents together, a technology improves fundamental document management tasks like retrieving information from a database or automated documents. We present an investigation of a model of conceptual spaces for investigation using joint information sources from the text and the images forming complex documents. We present a formal model and the computable algorithms and the dataset from which we took a subset to make experiments and relative tests and results

Similar works

Full text

Available Versions

Crossref

info:doi/10.1007%2F978-3-319-0...

Last time updated on 22/07/2021

Catalogo dei prodotti della ricerca

oai:iris.univr.it:11562/906384

Last time updated on 09/07/2019