15,333 research outputs found
Slovenian Virtual Gallery on the Internet
The Slovenian Virtual Gallery (SVG) is a World Wide Web based multimedia collection of pictures, text, clickable-maps and video clips presenting Slovenian fine art from the gothic period up to the present days. Part of SVG is a virtual gallery space where pictures hang on the walls while another part is devoted to current exhibitions of selected Slovenian art galleries. The first version of this application was developed in the first half of 1995. It was based on a file system for storing all the data and custom developed software for search, automatic generation of HTML documents, scaling of pictures and remote management of the system. Due to the fast development of Web related tools a new version of SVG was developed in 1997 based on object-oriented relational database server technology. Both implementations are presented and compared in this article with issues related to the transion between the two versions. At the end, we will also discuss some extensions to SVG. We will present the GUI (Graphical User Interface) developed specially for presentation of current exhibitions over the Web which is based on GlobalView panoramic navigation extension to developed Internet Video Server (IVS). And since SVG operates with a lot of image data, we will confront with the problem of Image Content Retrieval
CAD-model-based vision for space applications
A pose acquisition system operating in space must be able to perform well in a variety of different applications including automated guidance and inspections tasks with many different, but known objects. Since the space station is being designed with automation in mind, there will be CAD models of all the objects, including the station itself. The construction of vision models and procedures directly from the CAD models is the goal of this project. The system that is being designed and implementing must convert CAD models to vision models, predict visible features from a given view point from the vision models, construct view classes representing views of the objects, and use the view class model thus derived to rapidly determine the pose of the object from single images and/or stereo pairs
Cross-Paced Representation Learning with Partial Curricula for Sketch-based Image Retrieval
In this paper we address the problem of learning robust cross-domain
representations for sketch-based image retrieval (SBIR). While most SBIR
approaches focus on extracting low- and mid-level descriptors for direct
feature matching, recent works have shown the benefit of learning coupled
feature representations to describe data from two related sources. However,
cross-domain representation learning methods are typically cast into non-convex
minimization problems that are difficult to optimize, leading to unsatisfactory
performance. Inspired by self-paced learning, a learning methodology designed
to overcome convergence issues related to local optima by exploiting the
samples in a meaningful order (i.e. easy to hard), we introduce the cross-paced
partial curriculum learning (CPPCL) framework. Compared with existing
self-paced learning methods which only consider a single modality and cannot
deal with prior knowledge, CPPCL is specifically designed to assess the
learning pace by jointly handling data from dual sources and modality-specific
prior information provided in the form of partial curricula. Additionally,
thanks to the learned dictionaries, we demonstrate that the proposed CPPCL
embeds robust coupled representations for SBIR. Our approach is extensively
evaluated on four publicly available datasets (i.e. CUFS, Flickr15K, QueenMary
SBIR and TU-Berlin Extension datasets), showing superior performance over
competing SBIR methods
Furniture models learned from the WWW: using web catalogs to locate and categorize unknown furniture pieces in 3D laser scans
In this article, we investigate how autonomous robots can exploit the high quality information already available from the WWW concerning 3-D models of office furniture. Apart from the hobbyist effort in Google 3-D Warehouse, many companies providing office furnishings already have the models for considerable portions of the objects found in our workplaces and homes. In particular, we present an approach that allows a robot to learn generic models of typical office furniture using examples found in the Web. These generic models are then used by the robot to locate and categorize unknown furniture in real indoor environments
A strategy for the visual recognition of objects in an industrial environment.
This thesis is concerned with the problem of recognizing industrial
objects rapidly and flexibly. The system design is based on a
general strategy that consists of a generalized local feature detector,
an extended learning algorithm and the use of unique structure of
the objects. Thus, the system is not designed to be limited to the
industrial environment.
The generalized local feature detector uses the gradient image of
the scene to provide a feature description that is insensitive to a
range of imaging conditions such as object position, and overall light
intensity. The feature detector is based on a representative point
algorithm which is able to reduce the data content of the image
without restricting the allowed object geometry. Thus, a major advantage
of the local feature detector is its ability to describe and
represent complex object structure. The reliance on local features
also allows the system to recognize partially visible objects.
The task of the learning algorithm is to observe the feature
description generated by the feature detector in order to select
features that are reliable over the range of imaging conditions of
interest. Once a set of reliable features is found for each object,
the system finds unique relational structure which is later used to
recognize the objects. Unique structure is a set of descriptions of
unique subparts of the objects of interest. The present implementation
is limited to the use of unique local structure. The recognition
routine uses these unique descriptions to recognize objects in new
images. An important feature of this strategy is the transference of
a large amount of processing required for graph matching from the
recognition stage to the learning stage, which allows the recognition
routine to execute rapidly.
The test results show that the system is able to function with a
significant level of insensitivity to operating conditions; The system
shows insensitivity to its 3 main assumptions -constant scale, constant
lighting, and 2D images- displaying a degree of graceful degradation
when the operating conditions degrade. For example, for one
set of test objects, the recognition threshold was reached when the
absolute light level was reduced by 70%-80%, or the object scale was
reduced by 30%-40%, or the object was tilted away from the learned 2D
plane by 300-400. This demonstrates a very important feature of the
learning strategy: It shows that the generalizations made by the system
are not only valid within the domain of the sampled set of images,
but extend outside this domain. The test results also show that the
recognition routine is able to execute rapidly, requiring 10ms-500ms
(on a PDP11/24 minicomputer) in the special case when ideal operating
conditions are guaranteed. (Note: This does not include pre-processing
time). This thesis describes the strategy, the architecture and the
implementation of the vision system in detail, and gives detailed test
results. A proposal for extending the system to scale independent 3D
object recognition is also given
- …