8 research outputs found

    Towards a Unified Theory of Sensor Pattern Noise: An analysis of dark current, lens effects, and temperature bias in CMOS image sensors

    Get PDF
    Matching images to a discrete camera is of significance in forensic investigation. In the case of digital images, forensic matching is possible through the use of sensor noise present within every image. There exist misconceptions, however, around how this noise reacts under variables such as temperature and the use of different lens systems. This study aims to formulate a revised model of the additive noise for an image sensor to determine if a new method for matching images to sensors could be created which uses fewer resources than the existing methods, and takes into account a wider range of environmental conditions. Specifically, a revised noise model was needed to determine the effects of different lens systems and the impact of temperature on sensor noise. To determine the revised model, an updated literature search was conducted on the background theory relating to CMOS sensors, as the existing work focuses on CCD imaging sensors. This theory was then applied using six off the shelf CMOS imaging sensors with integrated lens systems. An image sensor was examined under scanning electron microscopy and through the use of Energydispersive x-ray spectroscopy the non-uniform structure of individual pixels was visually observed within the sensor. The lens systems were removed and made interchangeable through the use of a 3D printed camera housing. Lens effects were assessed by swapping lens systems between the cameras and using a pinhole lens to remove all optical effects. The temperature was controlled using an Arduino controlled Peltier plate device, and dark current images were obtained at different temperatures using a blackout lens. It was observed that dark current could be used to identify the temperature of the image sensor at the time of acquisition, contrary to the statements in existing literature that sensor pattern noise is temperature invariant. It was shown that the lens system contributes approximately a quarter of the signal power xii used for pattern matching between the image and sensor. Moreover, through the use of targeted signal processing methods and simple ”Goldilocks” filters processing times could be reduced by more than half by sacrificing precision without losing accuracy. This work indicates that sensor pattern noise, while already viable for forensic identification of images to a specific camera, can also be used for identification of an image to a specific lens system and an image sensors temperature. It has also shown that a tool using sensor pattern noise may have a viable future as a forensic method of triage when confronted with large image data sets. Such additional information could prove effective for forensic investigators, intelligence agencies and police when faced with any form of crime involving imaging technology such as fraud, child exploitation or terrorism.Thesis (Ph.D.) -- University of Adelaide, School of Electrical & Electronic Engineering, 201

    Classifiers and machine learning techniques for image processing and computer vision

    Get PDF
    Orientador: Siome Klein GoldensteinTese (doutorado) - Universidade Estadual de Campinas, Instituto da ComputaçãoResumo: Neste trabalho de doutorado, propomos a utilizaçãoo de classificadores e técnicas de aprendizado de maquina para extrair informações relevantes de um conjunto de dados (e.g., imagens) para solução de alguns problemas em Processamento de Imagens e Visão Computacional. Os problemas de nosso interesse são: categorização de imagens em duas ou mais classes, detecçãao de mensagens escondidas, distinção entre imagens digitalmente adulteradas e imagens naturais, autenticação, multi-classificação, entre outros. Inicialmente, apresentamos uma revisão comparativa e crítica do estado da arte em análise forense de imagens e detecção de mensagens escondidas em imagens. Nosso objetivo é mostrar as potencialidades das técnicas existentes e, mais importante, apontar suas limitações. Com esse estudo, mostramos que boa parte dos problemas nessa área apontam para dois pontos em comum: a seleção de características e as técnicas de aprendizado a serem utilizadas. Nesse estudo, também discutimos questões legais associadas a análise forense de imagens como, por exemplo, o uso de fotografias digitais por criminosos. Em seguida, introduzimos uma técnica para análise forense de imagens testada no contexto de detecção de mensagens escondidas e de classificação geral de imagens em categorias como indoors, outdoors, geradas em computador e obras de arte. Ao estudarmos esse problema de multi-classificação, surgem algumas questões: como resolver um problema multi-classe de modo a poder combinar, por exemplo, caracteríisticas de classificação de imagens baseadas em cor, textura, forma e silhueta, sem nos preocuparmos demasiadamente em como normalizar o vetor-comum de caracteristicas gerado? Como utilizar diversos classificadores diferentes, cada um, especializado e melhor configurado para um conjunto de caracteristicas ou classes em confusão? Nesse sentido, apresentamos, uma tecnica para fusão de classificadores e caracteristicas no cenário multi-classe através da combinação de classificadores binários. Nós validamos nossa abordagem numa aplicação real para classificação automática de frutas e legumes. Finalmente, nos deparamos com mais um problema interessante: como tornar a utilização de poderosos classificadores binarios no contexto multi-classe mais eficiente e eficaz? Assim, introduzimos uma tecnica para combinação de classificadores binarios (chamados classificadores base) para a resolução de problemas no contexto geral de multi-classificação.Abstract: In this work, we propose the use of classifiers and machine learning techniques to extract useful information from data sets (e.g., images) to solve important problems in Image Processing and Computer Vision. We are particularly interested in: two and multi-class image categorization, hidden messages detection, discrimination among natural and forged images, authentication, and multiclassification. To start with, we present a comparative survey of the state-of-the-art in digital image forensics as well as hidden messages detection. Our objective is to show the importance of the existing solutions and discuss their limitations. In this study, we show that most of these techniques strive to solve two common problems in Machine Learning: the feature selection and the classification techniques to be used. Furthermore, we discuss the legal and ethical aspects of image forensics analysis, such as, the use of digital images by criminals. We introduce a technique for image forensics analysis in the context of hidden messages detection and image classification in categories such as indoors, outdoors, computer generated, and art works. From this multi-class classification, we found some important questions: how to solve a multi-class problem in order to combine, for instance, several different features such as color, texture, shape, and silhouette without worrying about the pre-processing and normalization of the combined feature vector? How to take advantage of different classifiers, each one custom tailored to a specific set of classes in confusion? To cope with most of these problems, we present a feature and classifier fusion technique based on combinations of binary classifiers. We validate our solution with a real application for automatic produce classification. Finally, we address another interesting problem: how to combine powerful binary classifiers in the multi-class scenario more effectively? How to boost their efficiency? In this context, we present a solution that boosts the efficiency and effectiveness of multi-class from binary techniques.DoutoradoEngenharia de ComputaçãoDoutor em Ciência da Computaçã

    The Role of Distributed Computing in Big Data Science: Case Studies in Forensics and Bioinformatics

    Get PDF
    2014 - 2015The era of Big Data is leading the generation of large amounts of data, which require storage and analysis capabilities that can be only ad- dressed by distributed computing systems. To facilitate large-scale distributed computing, many programming paradigms and frame- works have been proposed, such as MapReduce and Apache Hadoop, which transparently address some issues of distributed systems and hide most of their technical details. Hadoop is currently the most popular and mature framework sup- porting the MapReduce paradigm, and it is widely used to store and process Big Data using a cluster of computers. The solutions such as Hadoop are attractive, since they simplify the transformation of an application from non-parallel to the distributed one by means of general utilities and without many skills. However, without any algorithm engineering activity, some target applications are not alto- gether fast and e cient, and they can su er from several problems and drawbacks when are executed on a distributed system. In fact, a distributed implementation is a necessary but not su cient condition to obtain remarkable performance with respect to a non-parallel coun- terpart. Therefore, it is required to assess how distributed solutions are run on a Hadoop cluster, and/or how their performance can be improved to reduce resources consumption and completion times. In this dissertation, we will show how Hadoop-based implementations can be enhanced by using carefully algorithm engineering activity, tuning, pro ling and code improvements. It is also analyzed how to achieve these goals by working on some critical points, such as: data local computation, input split size, number and granularity of tasks, cluster con guration, input/output representation, etc. i In particular, to address these issues, we choose some case studies coming from two research areas where the amount of data is rapidly increasing, namely, Digital Image Forensics and Bioinformatics. We mainly describe full- edged implementations to show how to design, engineer, improve and evaluate Hadoop-based solutions for Source Camera Identi cation problem, i.e., recognizing the camera used for taking a given digital image, adopting the algorithm by Fridrich et al., and for two of the main problems in Bioinformatics, i.e., alignment- free sequence comparison and extraction of k-mer cumulative or local statistics. The results achieved by our improved implementations show that they are substantially faster than the non-parallel counterparts, and re- markably faster than the corresponding Hadoop-based naive imple- mentations. In some cases, for example, our solution for k-mer statis- tics is approximately 30× faster than our Hadoop-based naive im- plementation, and about 40× faster than an analogous tool build on Hadoop. In addition, our applications are also scalable, i.e., execution times are (approximately) halved by doubling the computing units. Indeed, algorithm engineering activities based on the implementation of smart improvements and supported by careful pro ling and tun- ing may lead to a much better experimental performance avoiding potential problems. We also highlight how the proposed solutions, tips, tricks and insights can be used in other research areas and problems. Although Hadoop simpli es some tasks of the distributed environ- ments, we must thoroughly know it to achieve remarkable perfor- mance. It is not enough to be an expert of the application domain to build Hadop-based implementations, indeed, in order to achieve good performance, an expert of distributed systems, algorithm engi- neering, tuning, pro ling, etc. is also required. Therefore, the best performance depend heavily on the cooperation degree between the domain expert and the distributed algorithm engineer. [edited by Author]XIV n.s

    Experimental Analysis of the Pixel Non Uniformity (PNU) in SEM for Digital Forensics Purposes

    No full text
    Recent years saw an explosion in the number of the counterfeit or stolen images in scientific papers. In particular in the field of biomedical science publication this is becoming a serious problem for the health and economic issues caused by this fraud [1]. In this paper we investigate the possibility to extend a technique commonly used in image forensics to associate a given image with the camera used to take it. The original technique, proposed by Fridrich et al. in [3] uses the PNU, a unique fingerprint present in each photo and generated by natural the imperfection in the silicium slice that composes the Charge-Coupled Device (CCD) sensor. We analyze the quality of the PNU present in the residual noise by evaluating the quality of this noise using its variance. The experimental results shows that some PNU is still present in the residual noise, but is less than the one present in photo from digital cameras. This technique of evaluation is promisingly because is possible to use also to speedup the source camera identification process in videos by excluding the frames that not preserving enough PNU in the residual noise
    corecore