Search CORE

686 research outputs found

Automated Visual Fin Identification of Individual Great White Sharks

Author: Burghardt Tilo
Hughes Benjamin
Publication venue
Publication date: 01/10/2016
Field of study

This paper discusses the automated visual identification of individual great white sharks from dorsal fin imagery. We propose a computer vision photo ID system and report recognition results over a database of thousands of unconstrained fin images. To the best of our knowledge this line of work establishes the first fully automated contour-based visual ID system in the field of animal biometrics. The approach put forward appreciates shark fins as textureless, flexible and partially occluded objects with an individually characteristic shape. In order to recover animal identities from an image we first introduce an open contour stroke model, which extends multi-scale region segmentation to achieve robust fin detection. Secondly, we show that combinatorial, scale-space selective fingerprinting can successfully encode fin individuality. We then measure the species-specific distribution of visual individuality along the fin contour via an embedding into a global `fin space'. Exploiting this domain, we finally propose a non-linear model for individual animal recognition and combine all approaches into a fine-grained multi-instance framework. We provide a system evaluation, compare results to prior work, and report performance and properties in detail.Comment: 17 pages, 16 figures. To be published in IJCV. Article replaced to update first author contact details and to correct a Figure reference on page

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

Explore Bristol Research

Ensemble of convolutional neural networks to improve animal audio classification

Author: Carlos N. Silla
Loris Nanni
Rafael B. Mangolin
Rafael L. Aguiar
Sheryl Brahnam
Yandre M. G. Costa
Publication venue
Publication date: 01/01/2020
Field of study

Abstract In this work, we present an ensemble for automated audio classification that fuses different types of features extracted from audio files. These features are evaluated, compared, and fused with the goal of producing better classification accuracy than other state-of-the-art approaches without ad hoc parameter optimization. We present an ensemble of classifiers that performs competitively on different types of animal audio datasets using the same set of classifiers and parameter settings. To produce this general-purpose ensemble, we ran a large number of experiments that fine-tuned pretrained convolutional neural networks (CNNs) for different audio classification tasks (bird, bat, and whale audio datasets). Six different CNNs were tested, compared, and combined. Moreover, a further CNN, trained from scratch, was tested and combined with the fine-tuned CNNs. To the best of our knowledge, this is the largest study on CNNs in animal audio classification. Our results show that several CNNs can be fine-tuned and fused for robust and generalizable audio classification. Finally, the ensemble of CNNs is combined with handcrafted texture descriptors obtained from spectrograms for further improvement of performance. The MATLAB code used in our experiments will be provided to other researchers for future comparisons at https://github.com/LorisNanni

Open Access Repository

Archivio istituzionale della ricerca - Università di Padova

Advances in Signal Processing for Maritime Applications

Author: Ehlers Frank
Fox Warren
Maiwald Dirk
Ulmke Martin
Wood Gary
Publication venue
Publication date: 01/01/2010
Field of study

Springer - Publisher Connector

Directory of Open Access Journals

Fraunhofer-ePrints

Open Access Repository

Stationary region predictor using a stationary camera

Author: Clarke W.A.
De Villiers B.Z.
Roodt Y.
Roos H.
Publication venue: SATNAC
Publication date: 01/01/2011
Field of study

A method to determine the stationery probability of regions or feature points in a video sequence is proposed in this paper. This is done by identifying feature points using the Harris corner detector, finding descriptors for the feature points and then tracking the feature points. The information gained from tracking the feature points is then used to determine the stationery probability of these features. This method is shown to successfully identify probable stationery and moving regions in video sequences

University of Johannesburg Institutional Repository

Finding Nemo’s Giant Cousin: Keypoint Matching for Robust Re-Identification of Giant Sunfish

Author: Moeslund Thomas B.
Nyegaard Marianne
Pedersen Malte
Publication venue
Publication date: 01/05/2023
Field of study

VBN

IST Austria Thesis

Author: Sharmanska Viktoriia
Publication venue: IST Austria
Publication date: 01/01/2015
Field of study

The human ability to recognize objects in complex scenes has driven research in the computer vision field over couple of decades. This thesis focuses on the object recognition task in images. That is, given the image, we want the computer system to be able to predict the class of the object that appears in the image. A recent successful attempt to bridge semantic understanding of the image perceived by humans and by computers uses attribute-based models. Attributes are semantic properties of the objects shared across different categories, which humans and computers can decide on. To explore the attribute-based models we take a statistical machine learning approach, and address two key learning challenges in view of object recognition task: learning augmented attributes as mid-level discriminative feature representation, and learning with attributes as privileged information. Our main contributions are parametric and non-parametric models and algorithms to solve these frameworks. In the parametric approach, we explore an autoencoder model combined with the large margin nearest neighbor principle for mid-level feature learning, and linear support vector machines for learning with privileged information. In the non-parametric approach, we propose a supervised Indian Buffet Process for automatic augmentation of semantic attributes, and explore the Gaussian Processes classification framework for learning with privileged information. A thorough experimental analysis shows the effectiveness of the proposed models in both parametric and non-parametric views

IST Austria: PubRep (Institute of Science and Technology)