Search CORE

3,888 research outputs found

Towards content-based retrieval of technical drawings through high-dimensional indexing

Author: Bakergem
Berchtold
Chang
Cvetkovic
Fonseca
Joaquim A Jorge
Manuel J Fonseca
Park
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Improving Engineering Information Access and Knowledge Discovery through Model-Based Information Navigation

Author: Jones David
Publication venue
Publication date: 23/01/2019
Field of study

Explore Bristol Research

The hunt for submarines in classical art: mappings between scientific invention and artistic interpretation

Author: Christie Polly
Pringle Mike
Shepherd Rupert
Publication venue: AHDS Visual Arts
Publication date: 01/01/2007
Field of study

This is a report to the AHRC's ICT in Arts and Humanities Research Programme. This report stems from a project which aimed to produce a series of mappings between advanced imaging information and communications technologies (ICT) and needs within visual arts research. A secondary aim was to demonstrate the feasibility of a structured approach to establishing such mappings. The project was carried out over 2006, from January to December, by the visual arts centre of the Arts and Humanities Data Service (AHDS Visual Arts).1 It was funded by the Arts and Humanities Research Council (AHRC) as one of the Strategy Projects run under the aegis of its ICT in Arts and Humanities Research programme. The programme, which runs from October 2003 until September 2008, aims ‘to develop, promote and monitor the AHRC’s ICT strategy, and to build capacity nation-wide in the use of ICT for arts and humanities research’.2 As part of this, the Strategy Projects were intended to contribute to the programme in two ways: knowledge-gathering projects would inform the programme’s Fundamental Strategic Review of ICT, conducted for the AHRC in the second half of 2006, focusing ‘on critical strategic issues such as e-science and peer-review of digital resources’. Resource-development projects would ‘build tools and resources of broad relevance across the range of the AHRC’s academic subject disciplines’.3 This project fell into the knowledge-gathering strand. The project ran under the leadership of Dr Mike Pringle, Director, AHDS Visual Arts, and the day-to-day management of Polly Christie, Projects Manager, AHDS Visual Arts. The research was carried out by Dr Rupert Shepherd

UCA Research Online

Methodological considerations concerning manual annotation of musical audio in function of algorithm development

Author: De Baets Bernard
Leman Marc
Lesaffre Micheline
Martens Jean-Pierre
Publication venue
Publication date: 01/01/2004
Field of study

In research on musical audio-mining, annotated music databases are needed which allow the development of computational tools that extract from the musical audiostream the kind of high-level content that users can deal with in Music Information Retrieval (MIR) contexts. The notion of musical content, and therefore the notion of annotation, is ill-defined, however, both in the syntactic and semantic sense. As a consequence, annotation has been approached from a variety of perspectives (but mainly linguistic-symbolic oriented), and a general methodology is lacking. This paper is a step towards the definition of a general framework for manual annotation of musical audio in function of a computational approach to musical audio-mining that is based on algorithms that learn from annotated data. 1

CiteSeerX

Ghent University Academic Bibliography

A Multimodal Approach for Semantic Patent Image Retrieval

Author: Bruns Gerrit
Ewerth Ralph
Pustu-Iren Kader
Publication venue: Aachen, Germany : RWTH Aachen
Publication date: 01/01/2021
Field of study

Patent images such as technical drawings contain valuable information and are frequently used by experts to compare patents. However, current approaches to patent information retrieval are largely focused on textual information. Consequently, we review previous work on patent retrieval with a focus on illustrations in figures. In this paper, we report on work in progress for a novel approach for patent image retrieval that uses deep multimodal features. Scene text spotting and optical character recognition are employed to extract numerals from an image to subsequently identify references to corresponding sentences in the patent document. Furthermore, we use a neural state-of-the-art CLIP model to extract structural features from illustrations and additionally derive textual features from the related patent text using a sentence transformer model. To fuse our multimodal features for similarity search we apply re-ranking according to averaged or maximum scores. In our experiments, we compare the impact of different modalities on the task of similarity search for patent images. The experimental results suggest that patent image retrieval can be successfully performed using the proposed feature sets, while the best results are achieved when combining the features of both modalities

Repositorium für Naturwissenschaften und Technik

LiveSketch: Query Perturbations for Guided Sketch-based Visual Search

Author: Bui Tu
Collomosse John
Jin Hailin
Publication venue
Publication date: 01/01/2019
Field of study

LiveSketch is a novel algorithm for searching large image collections using hand-sketched queries. LiveSketch tackles the inherent ambiguity of sketch search by creating visual suggestions that augment the query as it is drawn, making query specification an iterative rather than one-shot process that helps disambiguate users' search intent. Our technical contributions are: a triplet convnet architecture that incorporates an RNN based variational autoencoder to search for images using vector (stroke-based) queries; real-time clustering to identify likely search intents (and so, targets within the search embedding); and the use of backpropagation from those targets to perturb the input stroke sequence, so suggesting alterations to the query in order to guide the search. We show improvements in accuracy and time-to-task over contemporary baselines using a 67M image corpus.Comment: Accepted to CVPR 201

arXiv.org e-Print Archive

Crossref

University of Surrey

Surrey Research Insight

Model-based information navigation for engineering documents

Author: Barrie Jeff
Hicks Ben
Jones David
Matthews Jason
Robinson Kevin
Snider Chris
Yon Jason
Publication venue: 'Elsevier BV'
Publication date: 01/10/2020
Field of study

Engineering and the manner in which engineers think is largely visual and functional, and yet engineers are typically provided with search engines that are text-based. While software based on a visual and functional ethos exist (CAD for example), when searching for information engineers are still required to enter a text query into a search box. This process holds potential incompatibilities both with the nature of the data (i.e. 3D models) and with the way in which engineers think and work. Consequentially, the proposition tested in this paper is that a model-based approach to information access, i.e. a representation of an organisations information around a model of an artefact i.e. CAD model, can improve engineering information retrieval. In an A-B test with a traditional text-based search engine, and using study questions derived from real-world information seeking scenarios based on the activities of a world-leading aircraft manufacturer, the results presented in this paper suggest that there is merit to such an approach.Specifically, this paper shows that there is no significant difference in time to complete a search between a model-based and text-based interfaces in spite of the addition of a new stage in the search process (navigating a 3D model); that the system structure of the model-based interface allows for non-text based documents to be indexed, making up for the inherent limitations in traditional text-based search; and that participants enjoy using the model-based interface and find it intuitive, easy and simple to use. Further, this paper also finds that those with more experience/familiar with the product structure and those in managerial positions are more likely to find information using a model-based interface that those who are not, who perform better using a text-based interface

UWE Bristol Research Repository

Explore Bristol Research

A Convolutional Neural Network-based Patent Image Retrieval Method for Design Ideation

Author: Hu Jie
Jiang Shuo
Luo Jianxi
Magee Christopher L.
Pava Guillermo Ruiz
Publication venue: 'ASME International'
Publication date: 19/05/2020
Field of study

The patent database is often used in searches of inspirational stimuli for innovative design opportunities because of its large size, extensive variety and rich design information in patent documents. However, most patent mining research only focuses on textual information and ignores visual information. Herein, we propose a convolutional neural network (CNN)-based patent image retrieval method. The core of this approach is a novel neural network architecture named Dual-VGG that is aimed to accomplish two tasks: visual material type prediction and international patent classification (IPC) class label prediction. In turn, the trained neural network provides the deep features in the image embedding vectors that can be utilized for patent image retrieval and visual mapping. The accuracy of both training tasks and patent image embedding space are evaluated to show the performance of our model. This approach is also illustrated in a case study of robot arm design retrieval. Compared to traditional keyword-based searching and Google image searching, the proposed method discovers more useful visual information for engineering design.Comment: 11 pages, 11 figure

arXiv.org e-Print Archive

Crossref

Sketch-based interaction and modeling: where do we stand?

Author: Akman Alican
Bonnici Alexandra
Calleja Gabriel
Camilleri Kenneth P.
Fehling Patrick
Ferreira Alfredo
Hermuth Florian
Israel Johann Habakuk
Landwehr Tom
Liu Juncheng
Padfield Natasha M. J.
Rosin Paul L.
Sezgin T. Metin
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 30/11/2019
Field of study

Sketching is a natural and intuitive communication tool used for expressing concepts or ideas which are difficult to communicate through text or speech alone. Sketching is therefore used for a variety of purposes, from the expression of ideas on two-dimensional (2D) physical media, to object creation, manipulation, or deformation in three-dimensional (3D) immersive environments. This variety in sketching activities brings about a range of technologies which, while having similar scope, namely that of recording and interpreting the sketch gesture to effect some interaction, adopt different interpretation approaches according to the environment in which the sketch is drawn. In fields such as product design, sketches are drawn at various stages of the design process, and therefore, designers would benefit from sketch interpretation technologies which support these differing interactions. However, research typically focuses on one aspect of sketch interpretation and modeling such that literature on available technologies is fragmented and dispersed. In this paper, we bring together the relevant literature describing technologies which can support the product design industry, namely technologies which support the interpretation of sketches drawn on 2D media, sketch-based search interactions, as well as sketch gestures drawn in 3D media. This paper, therefore, gives a holistic view of the algorithmic support that can be provided in the design process. In so doing, we highlight the research gaps and future research directions required to provide full sketch-based interaction support

Online Research @ Cardiff