16 research outputs found

    Object Detection using Deep Learning with Hierarchical Multi Swarm Optimization

    Get PDF
    Till now there is a huge research had in the field of visual information retrieval, but with the growth of data and with less processing speed we are not meeting the needs of current problem. The main focus of this paper is to identify the objects with salient features and object highlighting. Till now object identification is done with the pixel based or with the region based. Different methodologies are compared in this work and they will be processed with the learning work. Multi scale contrast is one of the pixel based technology where object borders are identified but not the object. This can be done with the histogram contrast. Still it is not covering all the features of the object and it is not clear in identifying the objects at high contrast regions. To solve this issue region based contrasting method is used which is the better solution for all this object identification. After extracting the features and identifying the object, now auto classification or identification of the object should be done. The other part of the work mainly concentrates on the learning system which uses most popular neural network algorithms. Identifying the drawbacks of neural network algorithms and proposing the new methodology identify the objects is done in this paper

    Penerapan CNN dengan Filter Gabor sebagai feature extractor untuk Content-Based Image Retrieval

    Get PDF
    Abstrakā€” Seiring dengan perkembangan teknologi informasi, kebutuhan dalam pencarian informasi menjadi hal yang penting. Jika pencarian informasi selama ini dilakukan pada data berjenis teks, maka pada perkembangan teknologi saat ini, memungkinkan adanya pencarian informasi dalam bentuk citra digital. Hal tersebut terjadi karena adanya peningkatan jumlah pustaka digital dalam bentuk citra. Sebuah metode pengembalian citra menjadi komponen utama untuk memecahkan masalah tersebut. CBIR merupakan sistem pengembalian citra yang akan membantu dalam proses pencarian citra dengan memanfaatkan fitur-fiturnya. Penggunaan ekstraksif fitur yang tepat diperlukan untuk mendapatkan fitur tersebut. Pemilihan ekstraksi ftur akan sangat memengaruhi hasil dari CBIR. Salah satu metode yang dapat melakukan ekstraksi fitur pada citra adalah CNN. Metode yang masih dalam satu jenis dalam deep learning ini mampu mempelajari fitur citra untuk dimanfaatkan ke dala bidang visi komputer. Karena itu, CNN menjadi perhatian menarik dalam penelitian ini untuk melakukan CBIR. Penggunaan filter Gabor yang mampu mendapatkan tekstur citra dengan baik juga akan diimplementasikan sebagai filter pada lapisan konvolusi CNN. Dengan menggunakan CNN dan filter gabor, penelitian ini mampu mendapatkan nilai mAP sebesar 0,895 terhadap data uji dengan dataset GHIM10k. Penelitian ini juga membandingkan beberapa metode pengukuran jarak untuk mendapatkan sistem CBIR terbaik. Kata Kunciā€” Content Based Image Retrieval; Convolutional Neural Networks; pengukuran jarak; filter Gabor; visi komputer

    A fast image retrieval method designed for network big data

    Get PDF
    In the field of big data applications, image information is widely used. The value density of information utilization in big data is very low, and how to extract useful information quickly is very important. So we should transform the unstructured image data source into a form that can be analyzed. In this paper, we proposed a fast image retrieval method which designed for big data. First of all, the feature extraction method is necessary and the feature vectors can be obtained for every image. Then, it is the most important step for us to encode the image feature vectors and make them into database, which can optimize the feature structure. Finally, the corresponding similarity matching is used to determined the retrieval results. There are three main contributions for image retrieval in this paper. New feature extraction method, reasonable elements ranking and appropriate distance metric can improve the algorithm performance. Experiments show that our method has a great improvement in the effective performance of feature extraction and can also get better search matching results

    Weakly Labelled AudioSet Tagging with Attention Neural Networks

    Full text link
    Audio tagging is the task of predicting the presence or absence of sound classes within an audio clip. Previous work in audio tagging focused on relatively small datasets limited to recognising a small number of sound classes. We investigate audio tagging on AudioSet, which is a dataset consisting of over 2 million audio clips and 527 classes. AudioSet is weakly labelled, in that only the presence or absence of sound classes is known for each clip, while the onset and offset times are unknown. To address the weakly-labelled audio tagging problem, we propose attention neural networks as a way to attend the most salient parts of an audio clip. We bridge the connection between attention neural networks and multiple instance learning (MIL) methods, and propose decision-level and feature-level attention neural networks for audio tagging. We investigate attention neural networks modeled by different functions, depths and widths. Experiments on AudioSet show that the feature-level attention neural network achieves a state-of-the-art mean average precision (mAP) of 0.369, outperforming the best multiple instance learning (MIL) method of 0.317 and Google's deep neural network baseline of 0.314. In addition, we discover that the audio tagging performance on AudioSet embedding features has a weak correlation with the number of training samples and the quality of labels of each sound class.Comment: 13 page

    Hierarchical visual perception and two-dimensional compressive sensing for effective content-based color image retrieval

    Get PDF
    Content-based image retrieval (CBIR) has been an active research theme in the computer vision community for over two decades. While the field is relatively mature, significant research is still required in this area to develop solutions for practical applications. One reason that practical solutions have not yet been realized could be due to a limited understanding of the cognitive aspects of the human vision system. Inspired by three cognitive properties of human vision, namely, hierarchical structuring, color perception and embedded compressive sensing, a new CBIR approach is proposed. In the proposed approach, the Hue, Saturation and Value (HSV) color model and the Similar Gray Level Co-occurrence Matrix (SGLCM) texture descriptors are used to generate elementary features. These features then form a hierarchical representation of the data to which a two-dimensional compressive sensing (2D CS) feature mining algorithm is applied. Finally, a weighted feature matching method is used to perform image retrieval. We present a comprehensive set of results of applying our proposed Hierarchical Visual Perception Enabled 2D CS approach using publicly available datasets and demonstrate the efficacy of our techniques when compared with other recently published, state-of-the-art approaches

    An Automatic Image Content Retrieval Method for better Mobile Device Display User Experiences

    Get PDF
    A growing number of commercially available mobile phones come with integrated high-resolution digital cameras. That enables a new class of dedicated applications to image analysis such as mobile visual search, image cropping, object detection, content-based image retrieval, image classification. In this paper, a new mobile application for image content retrieval and classification for mobile device display is proposed to enrich the visual experience of users. The mobile application can extract a certain number of images based on the content of an image with visual saliency methods aiming at detecting the most critical regions in a given image from a perceptual viewpoint. First, the most critical areas from a perceptual perspective are extracted using the local maxima of a 2D saliency function. Next, a salient region is cropped using the bounding box centred on the local maxima of the thresholded Saliency Map of the image. Then, each image crop feds into an Image Classification system based on SVM and SIFT descriptors to detect the class of object present in the image. ImageNet repository was used as the reference for semantic category classification. Android platform was used to implement the mobile application on a client-server architecture. A mobile client sends the photo taken by the camera to the server, which processes the image and returns the results (image contents such as image crops and related target classes) to the mobile client. The application was run on thousands of pictures and showed encouraging results towards a better user visual experience with mobile displays

    A Novel Adaptive LBP-Based Descriptor for Color Image Retrieval

    Get PDF
    In this paper, we present two approaches to extract discriminative features for color image retrieval. The proposed local texture descriptors, based on Radial Mean Local Binary Pattern (RMLBP), are called Color RMCLBP (CRMCLBP) and Prototype Data Model (PDM). RMLBP is a robust to noise descriptor which has been proposed to extract texture features of gray scale images for texture classification. For the first descriptor, the Radial Mean Completed Local Binary Pattern is applied to channels of the color space, independently. Then, the final descriptor is achieved by concatenating the histogram of the CRMCLBP_S/M/C component of each channel. Moreover, to enhance the performance of the proposed method, the Particle Swarm Optimization (PSO) algorithm is used for feature weighting. The second proposed descriptor, PDM, uses the three outputs of CRMCLBP (CRMCLBP_S, CRMCLBP_M, CRMCLBP_C) as discriminative features for each pixel of a color image. Then, a set of representative feature vectors are selected from each image by applying k-means clustering algorithm. This set of selected prototypes are compared by means of a new similarity measure to find the most relevant images. Finally, the weighted versions of PDM is constructed using PSO algorithm. Our proposed methods are tested on Wang, Corel-5k, Corel-10k and Holidays datasets. The results show that our proposed methods makes an admissible tradeoff between speed and retrieval accuracy. The first descriptor enhances the state-of-the-art color texture descriptors in both aspects. The second one is a very fast retrieval algorithm which extracts discriminative features

    Penerapan CNN dengan Filter Gabor sebagai feature extractor untuk Content-Based Image Retrieval

    Get PDF
    Abstrakā€” Seiring dengan perkembangan teknologi informasi, kebutuhan dalam pencarian informasi menjadi hal yang penting. Jika pencarian informasi selama ini dilakukan pada data berjenis teks, maka pada perkembangan teknologi saat ini, memungkinkan adanya pencarian informasi dalam bentuk citra digital. Hal tersebut terjadi karena adanya peningkatan jumlah pustaka digital dalam bentuk citra. Sebuah metode pengembalian citra menjadi komponen utama untuk memecahkan masalah tersebut. CBIR merupakan sistem pengembalian citra yang akan membantu dalam proses pencarian citra dengan memanfaatkan fitur-fiturnya. Penggunaan ekstraksif fitur yang tepat diperlukan untuk mendapatkan fitur tersebut. Pemilihan ekstraksi ftur akan sangat memengaruhi hasil dari CBIR. Salah satu metode yang dapat melakukan ekstraksi fitur pada citra adalah CNN. Metode yang masih dalam satu jenis dalam deep learning ini mampu mempelajari fitur citra untuk dimanfaatkan ke dala bidang visi komputer. Karena itu, CNN menjadi perhatian menarik dalam penelitian ini untuk melakukan CBIR. Penggunaan filter Gabor yang mampu mendapatkan tekstur citra dengan baik juga akan diimplementasikan sebagai filter pada lapisan konvolusi CNN. Dengan menggunakan CNN dan filter gabor, penelitian ini mampu mendapatkan nilai mAP sebesar 0,895 terhadap data uji dengan dataset GHIM10k. Penelitian ini juga membandingkan beberapa metode pengukuran jarak untuk mendapatkan sistem CBIR terbaik. Kata Kunciā€” Content Based Image Retrieval; Convolutional Neural Networks; pengukuran jarak; filter Gabor; visi komputer
    corecore