490 research outputs found

    Exploiting multimedia in creating and analysing multimedia Web archives

    No full text
    The data contained on the web and the social web are inherently multimedia and consist of a mixture of textual, visual and audio modalities. Community memories embodied on the web and social web contain a rich mixture of data from these modalities. In many ways, the web is the greatest resource ever created by human-kind. However, due to the dynamic and distributed nature of the web, its content changes, appears and disappears on a daily basis. Web archiving provides a way of capturing snapshots of (parts of) the web for preservation and future analysis. This paper provides an overview of techniques we have developed within the context of the EU funded ARCOMEM (ARchiving COmmunity MEMories) project to allow multimedia web content to be leveraged during the archival process and for post-archival analysis. Through a set of use cases, we explore several practical applications of multimedia analytics within the realm of web archiving, web archive analysis and multimedia data on the web in general

    Pengenalan Citra Logo Kendaraan Menggunakan Metode Gray Level Co-Occurence Matrix (Glcm) dan Jst-Backpropagation

    Get PDF
    A car is a vehicle that has a varied shape or model but the difference is the brand or logo. Vehicle logos have their own meaning and meaning for car industry companies. The logo should have a practical and effective or efficient function so that the logo form is part of the marketing and branding program of the car industry company [1]. There are three types of car logos that are now known, in the form of symbols, text, or a combination between the two. The logo is always in the front and back of the car body and usually has a lighter color than the color of the vehicle. One that supports the development of technology is how to recognize a vehicle either from the brand, shape, model and color of the vehicle. Some references that are deemed feasible to help this research include utilizing the weaknesses and weaknesses of the results of previous research, including a paper entitled. Scale Invariant Feature Transform (SIFT) [2]. SIFT is combined with Logistic Regression [3] based on Gradient Orientation Histogram (HOG). Logo Recognition Using Probabilistic Neural Networks [4]. Therefore, the researchers wanted to focus on the logo recognition using the extraction of the Gray Level Co-occurrence Matrix (GLCM) feature. Testing and training testing using ANN-Backpropagation. From the results of this study the best accuracy obtained 95.7%, so that GLCM and ANN-Backpropagation can recognize the image of the vehicle logo.DOI : 10.29408/jit.v1i1.89

    Vehicle Logo Recognition by Spatial-SIFT Combined with Logistic Regression

    Get PDF
    An efficient recognition framework requires both good feature representation and effective classification methods. This paper proposes such a framework based on a spatial Scale Invariant Feature Transform (SIFT) combined with a logistic regression classifier. The performance of the proposed framework is compared to that of state-of-the-art methods based on the Histogram of Orientation Gradients, SIFT features, Support Vector Machine and K-Nearest Neighbours classifiers. By testing with the largest vehicle logo data-set, it is shown that the proposed framework can achieve a classification accuracy of 99.93%, the best among all studied methods. Moreover, the proposed framework shows robustness when noise is added in both training and testing images

    Vehicle logo recognition using histograms of oriented gradient descriptor and sparsity score

    Get PDF
    Most of vehicle have the similar structures and designs. It is extremely complicated and difficult to identify and classify vehicle brands based on their structure and shape. As we requirea quick and reliable response, so vehicle logos are an alternative method of determining the type of a vehicle. In this paper, we propose a method for vehicle logo recognition based on featureĀ  selection method in a hybrid way. Vehicle logo images are first characterized by histograms of oriented gradient descriptors and the final features vector are then applied feature selection method to reduce the irrelevant information. Moreover, we release a new benchmark dataset for vehicle logo recognition and retrieval task namely, VLR-40. The experimental results are evaluated on this database which show the efficiency of the proposed approach

    Vehicle make and model recognition for intelligent transportation monitoring and surveillance.

    Get PDF
    Vehicle Make and Model Recognition (VMMR) has evolved into a significant subject of study due to its importance in numerous Intelligent Transportation Systems (ITS), such as autonomous navigation, traffic analysis, traffic surveillance and security systems. A highly accurate and real-time VMMR system significantly reduces the overhead cost of resources otherwise required. The VMMR problem is a multi-class classification task with a peculiar set of issues and challenges like multiplicity, inter- and intra-make ambiguity among various vehicles makes and models, which need to be solved in an efficient and reliable manner to achieve a highly robust VMMR system. In this dissertation, facing the growing importance of make and model recognition of vehicles, we present a VMMR system that provides very high accuracy rates and is robust to several challenges. We demonstrate that the VMMR problem can be addressed by locating discriminative parts where the most significant appearance variations occur in each category, and learning expressive appearance descriptors. Given these insights, we consider two data driven frameworks: a Multiple-Instance Learning-based (MIL) system using hand-crafted features and an extended application of deep neural networks using MIL. Our approach requires only image level class labels, and the discriminative parts of each target class are selected in a fully unsupervised manner without any use of part annotations or segmentation masks, which may be costly to obtain. This advantage makes our system more intelligent, scalable, and applicable to other fine-grained recognition tasks. We constructed a dataset with 291,752 images representing 9,170 different vehicles to validate and evaluate our approach. Experimental results demonstrate that the localization of parts and distinguishing their discriminative powers for categorization improve the performance of fine-grained categorization. Extensive experiments conducted using our approaches yield superior results for images that were occluded, under low illumination, partial camera views, or even non-frontal views, available in our real-world VMMR dataset. The approaches presented herewith provide a highly accurate VMMR system for rea-ltime applications in realistic environments.\\ We also validate our system with a significant application of VMMR to ITS that involves automated vehicular surveillance. We show that our application can provide law inforcement agencies with efficient tools to search for a specific vehicle type, make, or model, and to track the path of a given vehicle using the position of multiple cameras

    Online Vehicle Logo Recognition Using Cauchy Prior Logistic Regression

    Get PDF
    Vehicle logo recognition is an important part of vehicle identification in intelligent transportation systems. State-of-the-art vehicle logo recognition approaches typically consider training models on large datasets. However, there might only be a small training dataset to start with and more images can be obtained during the real-time applications. This paper proposes an online image recognition framework which provides solutions for both small and large datasets. Using this recognition framework, models are built efficiently using a weight updating scheme. Another novelty of this work is that the Cauchy prior logistic regression with conjugate gradient descent is proposed to deal with the multinomial classification tasks. The Cauchy prior results in a quicker convergence speed for the weight updating process which could decrease the computational cost for both online and offline methods. By testing with a publicly available dataset, the Cauchy prior logistic regression deceases the classification time by 59%. An accuracy of up to 98.80% is achieved when the proposed framework is applied

    Use of Coherent Point Drift in computer vision applications

    Get PDF
    This thesis presents the novel use of Coherent Point Drift in improving the robustness of a number of computer vision applications. CPD approach includes two methods for registering two images - rigid and non-rigid point set approaches which are based on the transformation model used. The key characteristic of a rigid transformation is that the distance between points is preserved, which means it can be used in the presence of translation, rotation, and scaling. Non-rigid transformations - or affine transforms - provide the opportunity of registering under non-uniform scaling and skew. The idea is to move one point set coherently to align with the second point set. The CPD method finds both the non-rigid transformation and the correspondence distance between two point sets at the same time without having to use a-priori declaration of the transformation model used. The first part of this thesis is focused on speaker identification in video conferencing. A real-time, audio-coupled video based approach is presented, which focuses more on the video analysis side, rather than the audio analysis that is known to be prone to errors. CPD is effectively utilised for lip movement detection and a temporal face detection approach is used to minimise false positives if face detection algorithm fails to perform. The second part of the thesis is focused on multi-exposure and multi-focus image fusion with compensation for camera shake. Scale Invariant Feature Transforms (SIFT) are first used to detect keypoints in images being fused. Subsequently this point set is reduced to remove outliers, using RANSAC (RANdom Sample Consensus) and finally the point sets are registered using CPD with non-rigid transformations. The registered images are then fused with a Contourlet based image fusion algorithm that makes use of a novel alpha blending and filtering technique to minimise artefacts. The thesis evaluates the performance of the algorithm in comparison to a number of state-of-the-art approaches, including the key commercial products available in the market at present, showing significantly improved subjective quality in the fused images. The final part of the thesis presents a novel approach to Vehicle Make & Model Recognition in CCTV video footage. CPD is used to effectively remove skew of vehicles detected as CCTV cameras are not specifically configured for the VMMR task and may capture vehicles at different approaching angles. A LESH (Local Energy Shape Histogram) feature based approach is used for vehicle make and model recognition with the novelty that temporal processing is used to improve reliability. A number of further algorithms are used to maximise the reliability of the final outcome. Experimental results are provided to prove that the proposed system demonstrates an accuracy in excess of 95% when tested on real CCTV footage with no prior camera calibration
    • ā€¦
    corecore