12 research outputs found

    Implementasi Metode Kombinasi Histogram of Oriented Gradients dan Hierarchical Centroid untuk Sketch Based Image Retrieval

    Get PDF
    Teknik pencarian gambar yang saat ini umum digunakan masih berbasis teks atau text based search seperti pada mesin pencarian Google Image, Yahoo, dan lain sebagainnya. Namun metode ini masih kurang efektif karena nama dari sebuah file tidak dapat merepresentasikan isinya, oleh karena itu diperlukan pemilihan kata kunci yang benar-benar tepat agar hasil yang diinginkan dapat ditampilkan dengan baik. Salah satu teknik pencarian gambar yang saat ini sedang diteliti adalah Sketch-Based Image Retrieval (SBIR). Dengan teknik ini user dapat menginputkan sketsa gambar atau user dapat menggambarkan obyek pada area yang disediakan lalu sistem akan melakukan pencocokkan sketsa dengan database gambar. Untuk mengimplementasikan teknik ini digunakan metode kombinasi Histogram of Oriented Gradient dan Hierarchical Centroid. Tahapan implementasi teknik tersebut yaitu, yang pertama melakukan preprocessing pada gambar dengan cara mendeteksi tepi obyek lalu membuat citra menjadi hitam putih. Yang kedua melakukan ektraksi fitur menggunakan Histogram of Oriented Gradients dan Hierarchical Centroid dan menghasilkan fitur vektor. Yang terakhir menghitung jarak kedekatan antara gambar yang diuji dengan gambar yang terdapat dalam database menggunakan Euclidean Distance. Hasil Euclidean Distance kemudian diurutkan secara ascending dan dikembalikan sejumlah gambar yang jaraknya terdekat. Hasil temu kembali menghasilkan nilai Average Normalized Modified Retrieval Rank sebesar 0,35 dan nilai presisi dan recall sebesar 78 % dan akurasi sebesar 96%

    Deep Sketch Hashing: Fast Free-hand Sketch-Based Image Retrieval

    Full text link
    Free-hand sketch-based image retrieval (SBIR) is a specific cross-view retrieval task, in which queries are abstract and ambiguous sketches while the retrieval database is formed with natural images. Work in this area mainly focuses on extracting representative and shared features for sketches and natural images. However, these can neither cope well with the geometric distortion between sketches and images nor be feasible for large-scale SBIR due to the heavy continuous-valued distance computation. In this paper, we speed up SBIR by introducing a novel binary coding method, named \textbf{Deep Sketch Hashing} (DSH), where a semi-heterogeneous deep architecture is proposed and incorporated into an end-to-end binary coding framework. Specifically, three convolutional neural networks are utilized to encode free-hand sketches, natural images and, especially, the auxiliary sketch-tokens which are adopted as bridges to mitigate the sketch-image geometric distortion. The learned DSH codes can effectively capture the cross-view similarities as well as the intrinsic semantic correlations between different categories. To the best of our knowledge, DSH is the first hashing work specifically designed for category-level SBIR with an end-to-end deep architecture. The proposed DSH is comprehensively evaluated on two large-scale datasets of TU-Berlin Extension and Sketchy, and the experiments consistently show DSH's superior SBIR accuracies over several state-of-the-art methods, while achieving significantly reduced retrieval time and memory footprint.Comment: This paper will appear as a spotlight paper in CVPR201

    Online sketch-based image retrieval using keyshape mining of geometrical objects

    Get PDF
    Online image retrieval has become an active information-sharing due to the massive use of the Internet. The key challenging problems are the semantic gap between the low-level visual features and high-semantic perception and interpretation, due to understating complexity of images and the hand-drawn query input representation which is not a regular input in addition to the huge amount of web images. Besides, the state-of-art research is highly desired to combine multiple types of different feature representations to close the semantic gap. This study developed a new schema to retrieve images directly from the web repository. It comprises three major phases. Firstly a new online input representation based on pixel mining to detect sketch shape features and correlate them with the semantic sketch objects meaning was designed. Secondly, training process was developed to obtain common templates using Singular Value Decomposition (SVD) technique to detect common sketch template. The outcome of this step is a sketch of variety templates dictionary. Lastly, the retrieval phase matched and compared the sketch with image repository using metadata annotation to retrieve the most relevant images. The sequence of processes in this schema converts the drawn input sketch to a string form which contains the sketch object elements. Then, the string is matched with the templates dictionary to specify the sketch metadata name. This selected name will be sent to a web repository to match and retrieve the relevant images. A series of experiments was conducted to evaluate the performance of the schema against the state of the art found in literature using the same datasets comprising one million images from FlickerIm and 0.2 million images from ImageNet. There was a significant retrieval in all cases of 100% precision for the first five retrieved images whereas the state of the art only achieved 88.8%. The schema has addressed many low features obstacles to retrieve more accurate images such as imperfect sketches, rotation, transpose and scaling. The schema has solved all these problems by using a high level semantic to retrieve accurate images from large databases and the web

    Identificación de personas mediante cámaras cenitales: generación de algoritmos de reconocimiento de personas captadas cenitalmente

    Get PDF
    En este proyecto se lleva acabo el diseño y la implementación de un sistema de detección de personas captadas mediante cámaras localizadas cenitalmente . El objet ivo es realizar una comparación entre algoritmos existentes para la detección de personas , y estimar que tipo de características y que tipo de clasificación se ajusta mejor a la detección que se quiere realizar. Se ha realizado la comparación entre dos mét odos de detección de objetos , entrenados con el fin de detectar personas captadas cenitalmente, analizando la tas a de errores que tiene cada método, la cantidad de falsos positivos y de falsos negativos detectados por cada método, tanto en un set de imágen es como en una secuencia de video, para poder conocer su precisión y sus debilidades. Todas las pruebas se realizaron con imágenes captadas en zonas de interior. La captación se ha realizado mediante una cámara “ LI - OV560 - USB - 72” y un PC. Todo el software s e ha desarrollado en lenguaje de programación C++ y el procesado de imagen se ha realizado con ayuda de la librería externa OpenCV, una librería de visión por computador de código abierto disponible para varios sistemas operativos. El software desarrollado consta de una aplicación que permite la identificación de personas en tiempo r eal, pudiendo alternar entre los dos métodos que se a nalizan a lo largo del proyecto.This Project develops the design and implementation of a people detector system , capturing people from aeri al view. The objective is perform a comparison between existent algorithms to detect people , and find out which kind of features and which kind of classification fits better to get the desired detection. A comparison between two object detection methods is made, training each one with t he purpose of detect people in aeri al view, analysing the error rate of each method, the number of false positives and false negatives detected by each method, tested with a set of images and also w ith a video sequence, so we can know the precision and the weakness of each method. All the test were made with interior images. The images were captured by one camera “ LI - OV560 - USB - 72” and a PC. All the software has been developed in C++ programming lang uage , and the image processing has be en done with the OpenCV external library, an open source library for computer vision available in different operative systems . The developed software c onsist mainly of an application that allows the people identificati on in real time, giving the choice to choose between the two methods analysed throughout this p roject
    corecore