1,814 research outputs found
Quantifying appearance retention in carpets using geometrical local binary patterns
Quality assessment in carpet manufacturing is performed by humans who evaluate the appearance retention (AR) grade on carpet samples. To quantify the AR grades objectively, different research based on computer vision have been developed. Among them Local Binary Pattern (LBP) and its variations has shown promising results. Nevertheless, the requirements of quality assessment on a wide range of carpets have not been met yet. One of the difficulties is to distinguish between consecutive AR grades in carpets. For this, we adopt an extension of LBP called Geometrical Local Binary Patterns (GLBP) that we recently proposed. The basis of GLBP is to evaluate the grey scale differences between adjacent points defined on a path in a neighbourhood. Symmetries of the paths in the GLBPs are evaluated. The proposed technique is compared with an invariant rotational mirror based LBP technique. The results show that the GLBP technique performs better to distinguish consecutive AR grades in carpets
Kestävät jalat - kettujen jalkaterveyden kehityshanke. Raportti nro 3
Luken kirjat, raportit, oppaat ja esitteet. Raportti nro 3201
Expanding the Family of Grassmannian Kernels: An Embedding Perspective
Modeling videos and image-sets as linear subspaces has proven beneficial for
many visual recognition tasks. However, it also incurs challenges arising from
the fact that linear subspaces do not obey Euclidean geometry, but lie on a
special type of Riemannian manifolds known as Grassmannian. To leverage the
techniques developed for Euclidean spaces (e.g, support vector machines) with
subspaces, several recent studies have proposed to embed the Grassmannian into
a Hilbert space by making use of a positive definite kernel. Unfortunately,
only two Grassmannian kernels are known, none of which -as we will show- is
universal, which limits their ability to approximate a target function
arbitrarily well. Here, we introduce several positive definite Grassmannian
kernels, including universal ones, and demonstrate their superiority over
previously-known kernels in various tasks, such as classification, clustering,
sparse coding and hashing
Face Detection with Effective Feature Extraction
There is an abundant literature on face detection due to its important role
in many vision applications. Since Viola and Jones proposed the first real-time
AdaBoost based face detector, Haar-like features have been adopted as the
method of choice for frontal face detection. In this work, we show that simple
features other than Haar-like features can also be applied for training an
effective face detector. Since, single feature is not discriminative enough to
separate faces from difficult non-faces, we further improve the generalization
performance of our simple features by introducing feature co-occurrences. We
demonstrate that our proposed features yield a performance improvement compared
to Haar-like features. In addition, our findings indicate that features play a
crucial role in the ability of the system to generalize.Comment: 7 pages. Conference version published in Asian Conf. Comp. Vision
201
LBP and irregular graph pyramids
In this paper, a new codification of Local Binary Patterns (LBP) is given using graph pyramids. The LBP code characterizes the topological category (local max, min, slope, saddle) of the gray level landscape around the center region. Given a 2D grayscale image I, our goal is to obtain a simplified image which can be seen as “minimal” representation in terms of topological characterization of I. For this, a method is developed based on merging regions and Minimum Contrast Algorithm
The Conditional Lucas & Kanade Algorithm
The Lucas & Kanade (LK) algorithm is the method of choice for efficient dense
image and object alignment. The approach is efficient as it attempts to model
the connection between appearance and geometric displacement through a linear
relationship that assumes independence across pixel coordinates. A drawback of
the approach, however, is its generative nature. Specifically, its performance
is tightly coupled with how well the linear model can synthesize appearance
from geometric displacement, even though the alignment task itself is
associated with the inverse problem. In this paper, we present a new approach,
referred to as the Conditional LK algorithm, which: (i) directly learns linear
models that predict geometric displacement as a function of appearance, and
(ii) employs a novel strategy for ensuring that the generative pixel
independence assumption can still be taken advantage of. We demonstrate that
our approach exhibits superior performance to classical generative forms of the
LK algorithm. Furthermore, we demonstrate its comparable performance to
state-of-the-art methods such as the Supervised Descent Method with
substantially less training examples, as well as the unique ability to "swap"
geometric warp functions without having to retrain from scratch. Finally, from
a theoretical perspective, our approach hints at possible redundancies that
exist in current state-of-the-art methods for alignment that could be leveraged
in vision systems of the future.Comment: 17 pages, 11 figure
MinMax Radon Barcodes for Medical Image Retrieval
Content-based medical image retrieval can support diagnostic decisions by
clinical experts. Examining similar images may provide clues to the expert to
remove uncertainties in his/her final diagnosis. Beyond conventional feature
descriptors, binary features in different ways have been recently proposed to
encode the image content. A recent proposal is "Radon barcodes" that employ
binarized Radon projections to tag/annotate medical images with content-based
binary vectors, called barcodes. In this paper, MinMax Radon barcodes are
introduced which are superior to "local thresholding" scheme suggested in the
literature. Using IRMA dataset with 14,410 x-ray images from 193 different
classes, the advantage of using MinMax Radon barcodes over \emph{thresholded}
Radon barcodes are demonstrated. The retrieval error for direct search drops by
more than 15\%. As well, SURF, as a well-established non-binary approach, and
BRISK, as a recent binary method are examined to compare their results with
MinMax Radon barcodes when retrieving images from IRMA dataset. The results
demonstrate that MinMax Radon barcodes are faster and more accurate when
applied on IRMA images.Comment: To appear in proceedings of the 12th International Symposium on
Visual Computing, December 12-14, 2016, Las Vegas, Nevada, US
A good balance of costs and benefits: convincing a university administration to support the installation of an interactive multi-application display system on campus
Interactive digital signage systems allow passers-by to take (temporary) control of a public display in order to select content and applications of interest, or even upload content of their own. Not surprisingly, display owners are hesitant to embrace such interactivity, given the uncertainty of what will be shown on their displays. In this paper we summarize our experience of deploying an interactive multi-application display system in the context of a university environment, and in particular our engagements with display owners (i.e., university administration) in order to convince them and get their support for the installation and deployment of such a system. We present the results of semi-structured interviews with display owners regarding their motivations, needs, and concerns with respect to the deployment of such a system at our university. While one cannot generalize from our results, we nevertheless believe that our experiences offer helpful advice to developers of such systems (and/or researchers interested in designing and studying them) in order to aid them in successfully gathering the support of these important stakeholders
- …