Search CORE

775 research outputs found

Plant image retrieval using color, shape and texture features

Author: Kebapcı Hanife
Kebapci Hanife
Unal Gozde
Yanikoglu Berrin
Yanıkoğlu Berrin
Ünal Gözde
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/09/2011
Field of study

We present a content-based image retrieval system for plant image retrieval, intended especially for the house plant identification problem. A plant image consists of a collection of overlapping leaves and possibly flowers, which makes the problem challenging.We studied the suitability of various well-known color, shape and texture features for this problem, as well as introducing some new texture matching techniques and shape features. Feature extraction is applied after segmenting the plant region from the background using the max-flow min-cut technique. Results on a database of 380 plant images belonging to 78 different types of plants show promise of the proposed new techniques and the overall system: in 55% of the queries, the correct plant image is retrieved among the top-15 results. Furthermore, the accuracy goes up to 73% when a 132-image subset of well-segmented plant images are considered

Sabanci University Research Database

Ensemble of Different Approaches for a Reliable Person Re-identification System

Author: Braham Sheryl
Ghidoni Stefano
Menegatti Emanuele
Munaro Matteo
Nanni Loris
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

An ensemble of approaches for reliable person re-identification is proposed in this paper. The proposed ensemble is built combining widely used person re-identification systems using different color spaces and some variants of state-of-the-art approaches that are proposed in this paper. Different descriptors are tested, and both texture and color features are extracted from the images; then the different descriptors are compared using different distance measures (e.g., the Euclidean distance, angle, and the Jeffrey distance). To improve performance, a method based on skeleton detection, extracted from the depth map, is also applied when the depth map is available. The proposed ensemble is validated on three widely used datasets (CAVIAR4REID, IAS, and VIPeR), keeping the same parameter set of each approach constant across all tests to avoid overfitting and to demonstrate that the proposed system can be considered a general-purpose person re-identification system. Our experimental results show that the proposed system offers significant improvements over baseline approaches. The source code used for the approaches tested in this paper will be available at https://www.dei.unipd.it/node/2357 and http://robotics.dei.unipd.it/reid/

Elsevier - Publisher Connector

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università di Padova

Missouri State University: BearWorks

3DFeat-Net: Weakly Supervised Local 3D Features for Point Cloud Registration

Author: A Gordo
AE Johnson
D Holz
DG Lowe
F Pomerleau
Gim Hee Lee
H Bay
JANE BROMLEY
Kwang Moo Yi
PJ Besl
R Hänsch
S Salti
W Maddern
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/07/2018
Field of study

In this paper, we propose the 3DFeat-Net which learns both 3D feature detector and descriptor for point cloud matching using weak supervision. Unlike many existing works, we do not require manual annotation of matching point clusters. Instead, we leverage on alignment and attention mechanisms to learn feature correspondences from GPS/INS tagged 3D point clouds without explicitly specifying them. We create training and benchmark outdoor Lidar datasets, and experiments show that 3DFeat-Net obtains state-of-the-art performance on these gravity-aligned datasets.Comment: 17 pages, 6 figures. Accepted in ECCV 201

arXiv.org e-Print Archive

Crossref

Creating 3D object descriptors using a genetic algorithm

Author: Wegrzyn Dominik
Publication venue
Publication date: 01/06/2013
Field of study

In the technological world that we live in, the need for computer vision became almost as important as human vision. We are surrounded be all kinds of machines that need to have their own virtual eyes. The most developed cars have software that can analyze trafﬁc signs in order to warn the driver about the eventsontheroad. Whenwesendaspacerovertootherplanetitisimportantthatitcananalyzetheground in order to avoid obstacles that would lead to its destruction. Thereisstillmuchworktobedoneintheﬁeldofcomputervisionwiththeviewtoimprovetheperformance and speed of recognition tasks. There are many available descriptors used for 3D point cloud recognition and some of them are explained in this thesis. The aim of this work is to design descriptors that can match correctly 3D point clouds. The idea is to use artiﬁcial intelligence, in the form of a GA to obtain optimized parameters for the descriptors. For this purpose the PCL [RC11] is used, which deals with the manipulation of 3D points data. The created descriptors are explained and experiments are done to illustrate their performance. The main conclusions are that there is still much work to be done in shape recognition. The descriptor developed in this thesis that use only color information is better than the descriptors that use only shape data. Although we have achieved descriptors withgoodperformanceinthisthesis,therecouldbeawaytoimprovethemevenmore. As the descriptor that use only color data is better than the shape-only descriptors, we can expect that there is a better way to represent the shape of an object. Humans can recognize better objects by shape than by color, what makes us wonder if there is a way to improve the techniques used for shape description

UBibliorum repositorio digital da ubi

Algorithms for people re-identification from RGB-D videos exploiting skeletal information

Author: Tartaro Dizmen Deniz
Publication venue
Publication date: 08/04/2022
Field of study

In this thesis a novel methodology to face people re-identification problem is proposed. Re-identification is a complex research topic representing a fundamental issue especially for intelligent video surveillance applications. Its goal is to determine the occurrences of the same person in different video sequences or images, usually by choosing from a high number of candidates within a datasetope

Padua Thesis and Dissertation Archive

Multi-script text versus non-text classification of regions in scene images

Author: Schomaker Lambert
Sriman Bowornrat
Publication venue: 'Elsevier BV'
Publication date: 01/07/2019
Field of study

Text versus non-text region classification is an essential but difficult step in scene-image analysis due to the considerable shape complexity of text and background patterns. There exists a high probability of confusion between background elements and letter parts. This paper proposes a feature-based classification of image blocks using the color autocorrelation histogram (CAH) and the scale-invariant feature transform (SIFT) algorithm, yielding a combined scale and color-invariant feature suitable for scene-text classification. For the evaluation, features were extracted from different color spaces, applying color-histogram autocorrelation. The color features are adjoined with a SIFT descriptor. Parameter tuning is performed and evaluated. For the classification, a standard nearest-neighbor (1NN) and a support-vector machine (SVM) were compared. The proposed method appears to perform robustly and is especially suitable for Asian scripts such as Kannada and Thai, where urban scene-text fonts are characterized by a high curvature and salient color variations

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

SIFT applied to CBIR

Author: ALMEIDA J.
GOLDENSTEINS S. K.
TORRES R. S.
Publication venue: Faculdade Salesiana Maria Auxiliadora
Publication date: 01/12/2009
Field of study

Content-Based Image Retrieval (CBIR) is a challenging task. Common approaches use only low-level features. Notwithstanding, such CBIR solutions fail on capturing some local features representing the details and nuances of scenes. Many techniques in image processing and computer vision can capture these scene semantics. Among them, the Scale Invariant Features Transform~(SIFT) has been widely used in a lot of applications. This approach relies on the choice of several parameters which directly impact its effectiveness when applied to retrieve images. In this paper, we discuss the results obtained in several experiments proposed to evaluate the application of the SIFT in CBIR tasks

Directory of Open Access Journals