150,392 research outputs found

    A binary neural k-nearest neighbour technique

    Get PDF
    K-Nearest Neighbour (k-NN) is a widely used technique for classifying and clustering data. K-NN is effective but is often criticised for its polynomial run-time growth as k-NN calculates the distance to every other record in the data set for each record in turn. This paper evaluates a novel k-NN classifier with linear growth and faster run-time built from binary neural networks. The binary neural approach uses robust encoding to map standard ordinal, categorical and real-valued data sets onto a binary neural network. The binary neural network uses high speed pattern matching to recall the k-best matches. We compare various configurations of the binary approach to a conventional approach for memory overheads, training speed, retrieval speed and retrieval accuracy. We demonstrate the superior performance with respect to speed and memory requirements of the binary approach compared to the standard approach and we pinpoint the optimal configurations

    PENGGUNAAN FITUR WARNA DAN TEKSTUR UNTUK CONTENT BASED IMAGE RETRIEVAL CITRA BUNGA

    Get PDF
    Pencarian gambar berdasarkan gambar pada database, seringkali dilakukan untuk mengatasi duplikasi pada suatu karya. Content Based Image Retrieval (CBIR) Citra Bunga adalah engine pada komputer untuk melakukan pencarian gambar berdasarkan gambar pada database. Penelitian pada Content Based Image Retrieval (CBIR) Citra Bunga telah dilakukan oleh banyak peneliti. Permasalahan terjadi ketika memilih metode pendekatan seperti preprocessing, ekstraksi fitur dan similarity measure pada CBIR Citra Bunga. Pendekatan yang tidak sesuai dengan data yang diuji, tidak akan memberikan hasil yang optimal. Untuk mengetahui tingkat keberhasilan pendekatan yang digunakan pada CBIR Citra Bunga, digunakan perhitungan nilai precision. Pada penelitian ini, dataset yang akan digunakan adalah dataset Oxford Flower 17. Berdasarkan penelitian sebelumnya, untuk mendapatkan nilai precision yang lebih baik, penelitian ini akan menggunakan ekstraksi fitur warna Hue Saturation Value (HSV), ekstraksi fitur tekstur Gray Level Co-occurrence Matrix (GLCM), dan gabungan kedua fitur dengan pendekatan histogram. Pada penelitian CBIR Citra Bunga ini, terdapat tiga proses yaitu segmentasi menggunakan thresholding, proses ekstraksi fitur, dan pengukuran tingkat kemiripan citra dengan Euclidean Distance. Pengujian pada sistem dilakukan berdasarkan citra yang tersegmentasi dan tidak tersegmentasi. Pengujian sistem dengan hasil Mean Average Precision (MAP) terbesar dihasilkan oleh proses ekstraksi fitur GLCM tidak tersegmentasi sebesar 87,32%, dan untuk nilai MAP terbesar pada citra tersegmentasi dihasilkan pada proses ekstraksi fitur HSV sebesar 83,35%. Kata kunci: Content Based Image Retrieval, ekstraksi fitur HSV, ekstraksi fitur GLCM, thresholding, Euclidean Distance, Mean Average Precision (MAP);--- Searching images based on images in the database, often done to overcome duplication of a work. Content Based Image Retrieval (CBIR) Flower Image is the engine on the computer To perform image-based image search on the database. Research on Content Based Image Retrieval (CBIR) Flower Image has been done by many researchers. Problems occur when choosing approaches such as preprocessing, feature extraction and similarity measure in CBIR Flower Image. Approaches which don't correspond with the data image test, would not provide optimal results. To know the success rate of approach used in CBIR Flower Image, the calculation of precision value is used. In this study, the dataset that will be used is dataset Oxford Flower 17. Based on previous research, to get better precision value, this research will use Hue Saturation Value (HSV) feature extraction, feature extraction of Gray Level Co-occurrence Matrix (GLCM) texture, and combination of both features with histogram approach. In this research, there are three processes: segmentation using thresholding, feature extraction process, and measurement of image similarity level with Euclidean Distance. For testing the system, is based on segmented image and non-segmented image. The result of the largest Mean Average Precision (MAP) produced in this study, resulted from the process of unsegmented image by the GLCM feature extraction of 87.32%, and for the largest MAP value in the segmented image produced by the HSV feature extraction process of 83.35%. Keywords: Content Based Image Retrieval, feature extraction HSV, feature extraction GLCM, thresholding, Euclidean Distance, Mean Average Precision (MAP

    Unsupervised Generative Adversarial Cross-modal Hashing

    Full text link
    Cross-modal hashing aims to map heterogeneous multimedia data into a common Hamming space, which can realize fast and flexible retrieval across different modalities. Unsupervised cross-modal hashing is more flexible and applicable than supervised methods, since no intensive labeling work is involved. However, existing unsupervised methods learn hashing functions by preserving inter and intra correlations, while ignoring the underlying manifold structure across different modalities, which is extremely helpful to capture meaningful nearest neighbors of different modalities for cross-modal retrieval. To address the above problem, in this paper we propose an Unsupervised Generative Adversarial Cross-modal Hashing approach (UGACH), which makes full use of GAN's ability for unsupervised representation learning to exploit the underlying manifold structure of cross-modal data. The main contributions can be summarized as follows: (1) We propose a generative adversarial network to model cross-modal hashing in an unsupervised fashion. In the proposed UGACH, given a data of one modality, the generative model tries to fit the distribution over the manifold structure, and select informative data of another modality to challenge the discriminative model. The discriminative model learns to distinguish the generated data and the true positive data sampled from correlation graph to achieve better retrieval accuracy. These two models are trained in an adversarial way to improve each other and promote hashing function learning. (2) We propose a correlation graph based approach to capture the underlying manifold structure across different modalities, so that data of different modalities but within the same manifold can have smaller Hamming distance and promote retrieval accuracy. Extensive experiments compared with 6 state-of-the-art methods verify the effectiveness of our proposed approach.Comment: 8 pages, accepted by 32th AAAI Conference on Artificial Intelligence (AAAI), 201

    Ad-hoc Biomedical Information Retrieval for Global Pandemics: A Study of Methods Based on the TREC-COVID test collection

    Get PDF
    openThe TREC_COVID Challenge has the goal to create search engines to effectively and efficiently retrieve information produced at a rate never seen before, in the biomedical field. This work focuses on the effectiveness of the information retrieval. The search engine is based on Elasticsearch. A multitude of information retrieval techniques are tested, with the goal of identifying the ones leading to a performance improvement. The techniques' effectiveness is measured using the evaluation measures: P@20, MAP, and BPref. The techniques explored that yield improvement in the search are: custom analyzers, filters, relevance feedback and reciprocal rank fusion. Other tested techniques, that yield negligible results, are: field boosting, bigrams and distance feature. Ultimately, the results are compared to the ones obtained by others in the Challenge.The TREC_COVID Challenge has the goal to create search engines to effectively and efficiently retrieve information produced at a rate never seen before, in the biomedical field. This work focuses on the effectiveness of the information retrieval. The search engine is based on Elasticsearch. A multitude of information retrieval techniques are tested, with the goal of identifying the ones leading to a performance improvement. The techniques' effectiveness is measured using the evaluation measures: P@20, MAP, and BPref. The techniques explored that yield improvement in the search are: custom analyzers, filters, relevance feedback and reciprocal rank fusion. Other tested techniques, that yield negligible results, are: field boosting, bigrams and distance feature. Ultimately, the results are compared to the ones obtained by others in the Challenge

    RETRIEVAL, ACTION AND THE REPRESENTATION OF DISTANCE IN COGNITIVE MAPS

    Get PDF
    This thesis examines the context effects on retrieval, and the influence of action on the representation of distance in cognitive maps. It is proposed that bias in distance estimation is a function of the contexts of retrieval that trigger the representation of action in memory during evaluation tasks. The proposal is consistent with embodied cognition evidence that suggests that actions are implicitly a part of the representation, and will be naturally extracted as part of the retrieval process. The experimental work presented examines two different contextual cues; the frequency of visitation to landmarks, and the importance of activity performed at landmarks. Each cue primes differently the conceptualisation of landmarks prior to making distance estimation. This priming facilitates memory access, which fleshes out relevant spatial information from cognitive maps that are used in distance estimation and route description. This proposal was examined in a series of four experiments that employed structured interviews. Participants had to rate landmarks based on frequency of visitation criteria or importance of activity criteria, or both. They then made verbal distance estimations and route descriptions. The results found implicate the involvement of action representation. The involvement of action in cognitive process was empirically investigated in three further experiments. A new methodology was developed featuring the use of a blindfold, linguistic descriptions, and control of actual movements. Blindfolded participants learned new environments through verbal descriptions by imagining themselves walking in time with the metronome beats. During turns, they were carefully moved. Following instructions, they performed an action at mid-route. Their memories for the newly learned environments were tested through recalls and measured again with the metronome beats. The results found were consistent with explanations based on network-map theory. They implicate attentional processes as an intrinsic part of the cognitive mechanism, and the strings of the network-map as the actual motor program that executes the movement. These results are discussed in relation to the nature of cognitive maps
    • …
    corecore