1,700 research outputs found
SenseCam image localisation using hierarchical SURF trees
The SenseCam is a wearable camera that automatically takes photos of the wearer's activities, generating thousands of images per day.
Automatically organising these images for efficient search and retrieval is a challenging task, but can be simplified by providing
semantic information with each photo, such as the wearer's location during capture time. We propose a method for automatically determining the wearer's location using an annotated image database, described using SURF interest point descriptors. We show that SURF out-performs SIFT in matching SenseCam images and that matching can be done efficiently using hierarchical trees of SURF descriptors. Additionally, by re-ranking the top images using bi-directional SURF matches, location matching performance is improved further
Localization of JPEG double compression through multi-domain convolutional neural networks
When an attacker wants to falsify an image, in most of cases she/he will
perform a JPEG recompression. Different techniques have been developed based on
diverse theoretical assumptions but very effective solutions have not been
developed yet. Recently, machine learning based approaches have been started to
appear in the field of image forensics to solve diverse tasks such as
acquisition source identification and forgery detection. In this last case, the
aim ahead would be to get a trained neural network able, given a to-be-checked
image, to reliably localize the forged areas. With this in mind, our paper
proposes a step forward in this direction by analyzing how a single or double
JPEG compression can be revealed and localized using convolutional neural
networks (CNNs). Different kinds of input to the CNN have been taken into
consideration, and various experiments have been carried out trying also to
evidence potential issues to be further investigated.Comment: Accepted to CVPRW 2017, Workshop on Media Forensic
Indexing, browsing and searching of digital video
Video is a communications medium that normally brings together moving pictures with a synchronised audio track into a discrete piece or pieces of information. The size of a “piece ” of video can variously be referred to as a frame, a shot, a scene, a clip, a programme or an episode, and these are distinguished by their lengths and by their composition. We shall return to the definition of each of these in section 4 this chapter. In modern society, video is ver
The Effective of Image Retrieval in Jpeg Compressed Domain
We propose a new method of feature extraction in orderto improve the effective of image retrieving by using apartial Joint Photographic Experts Group (JPEG)compressed images algorithm. Prior to that, we prune theimages database by pre-query step based on coloursimilarity, in order to eliminate image candidates. Ourfeature extraction can be carried out directly to JPEGcompressed images. We extract two features of DCTcoefficients, DC feature and AC feature, from a JPEGcompressed image. Then we compute the Euclideandistances between the query image and the images in adatabase in terms of these two features. The image querysystem will give each retrieved image a rank to define itssimilarity to the query image. Moreover, instead of fullydecompressing JPEG images, our system only needs to dopartial entropy decoding. Therefore, our proposed schemecan accelerate the effectiveness of retrieving images.According to our experimental results, our system is notonly highly effective but is also capable of performingsatisfactoril
- …