93 research outputs found

    Classifiers and machine learning techniques for image processing and computer vision

    Get PDF
    Orientador: Siome Klein GoldensteinTese (doutorado) - Universidade Estadual de Campinas, Instituto da ComputaçãoResumo: Neste trabalho de doutorado, propomos a utilizaçãoo de classificadores e técnicas de aprendizado de maquina para extrair informações relevantes de um conjunto de dados (e.g., imagens) para solução de alguns problemas em Processamento de Imagens e Visão Computacional. Os problemas de nosso interesse são: categorização de imagens em duas ou mais classes, detecçãao de mensagens escondidas, distinção entre imagens digitalmente adulteradas e imagens naturais, autenticação, multi-classificação, entre outros. Inicialmente, apresentamos uma revisão comparativa e crítica do estado da arte em análise forense de imagens e detecção de mensagens escondidas em imagens. Nosso objetivo é mostrar as potencialidades das técnicas existentes e, mais importante, apontar suas limitações. Com esse estudo, mostramos que boa parte dos problemas nessa área apontam para dois pontos em comum: a seleção de características e as técnicas de aprendizado a serem utilizadas. Nesse estudo, também discutimos questões legais associadas a análise forense de imagens como, por exemplo, o uso de fotografias digitais por criminosos. Em seguida, introduzimos uma técnica para análise forense de imagens testada no contexto de detecção de mensagens escondidas e de classificação geral de imagens em categorias como indoors, outdoors, geradas em computador e obras de arte. Ao estudarmos esse problema de multi-classificação, surgem algumas questões: como resolver um problema multi-classe de modo a poder combinar, por exemplo, caracteríisticas de classificação de imagens baseadas em cor, textura, forma e silhueta, sem nos preocuparmos demasiadamente em como normalizar o vetor-comum de caracteristicas gerado? Como utilizar diversos classificadores diferentes, cada um, especializado e melhor configurado para um conjunto de caracteristicas ou classes em confusão? Nesse sentido, apresentamos, uma tecnica para fusão de classificadores e caracteristicas no cenário multi-classe através da combinação de classificadores binários. Nós validamos nossa abordagem numa aplicação real para classificação automática de frutas e legumes. Finalmente, nos deparamos com mais um problema interessante: como tornar a utilização de poderosos classificadores binarios no contexto multi-classe mais eficiente e eficaz? Assim, introduzimos uma tecnica para combinação de classificadores binarios (chamados classificadores base) para a resolução de problemas no contexto geral de multi-classificação.Abstract: In this work, we propose the use of classifiers and machine learning techniques to extract useful information from data sets (e.g., images) to solve important problems in Image Processing and Computer Vision. We are particularly interested in: two and multi-class image categorization, hidden messages detection, discrimination among natural and forged images, authentication, and multiclassification. To start with, we present a comparative survey of the state-of-the-art in digital image forensics as well as hidden messages detection. Our objective is to show the importance of the existing solutions and discuss their limitations. In this study, we show that most of these techniques strive to solve two common problems in Machine Learning: the feature selection and the classification techniques to be used. Furthermore, we discuss the legal and ethical aspects of image forensics analysis, such as, the use of digital images by criminals. We introduce a technique for image forensics analysis in the context of hidden messages detection and image classification in categories such as indoors, outdoors, computer generated, and art works. From this multi-class classification, we found some important questions: how to solve a multi-class problem in order to combine, for instance, several different features such as color, texture, shape, and silhouette without worrying about the pre-processing and normalization of the combined feature vector? How to take advantage of different classifiers, each one custom tailored to a specific set of classes in confusion? To cope with most of these problems, we present a feature and classifier fusion technique based on combinations of binary classifiers. We validate our solution with a real application for automatic produce classification. Finally, we address another interesting problem: how to combine powerful binary classifiers in the multi-class scenario more effectively? How to boost their efficiency? In this context, we present a solution that boosts the efficiency and effectiveness of multi-class from binary techniques.DoutoradoEngenharia de ComputaçãoDoutor em Ciência da Computaçã

    Content-based image retrieval and its benefits for the stock photography market

    Get PDF
    The development of powerful low-cost desktop computer systems has changed the pre-press business where tight deadlines must be met per sistently. An increasing number of newspapers and magazines are acquiring, handling, and storing images digitally while the use of hardcopies and slides decreases. Today\u27s computers and high capacity storage-media enable stock pho tography agencies to build digital image databases, giving users fast access to large numbers of images. However, the transition from analog to digital image archives imposes new problems: with thousands of images at hand, the search for a particular image may turn into the search for the needle in a haystack. The first image Database Management Systems (DBMSs) were extended text DBMSs, which stored the image data along with a set of manually entered descriptive keywords. The major problem with this approach is that there is no generally agreed-upon language to describe images. Even sophis ticated DBMSs are unable to detect synonyms; hence, an image described with certain properties such as curvy may not be found if a user enters wavy as a search criterion. Furthermore, some image properties are hard to describe with keywords. A search is likely to fail if properties were not described at the database population stage when images are added to the database. Finally, assigning a sufficient set of keywords to every image adds a tremendous amount of labor to the population stage. Research at many scientific institutions and companies is geared towards overcoming the shortcomings of image DBMSs with keyword-based search engines. Pattern recognition which allows for comparing images based on their visual content is being introduced to image DBMSs, improving the accuracy of search engines. Sketches, sample images, and other means of describing the visual content of images may be used as search criteria in addition to keywords. This thesis project summarizes the basics of pattern recognition and its applications in image database management for contentbased image retrieval. The purpose of this thesis project is to determine the impact of contentbased image retrieval on the stock photography market in the near future. In order to obtain the necessary information, two different questionnaires were sent out to a number of selected stock photography agencies, newspapers, and magazines. The evaluation of the replies was conducted for the three groups separately. The replies from stock photography agencies showed a high interest in digital image archives. They also showed concerns about increased overhead with digital archives. The estimated amount of work required for categoriz ing images and assigning keywords ranged from fifty to ninety percent as compared to ten to fifty percent for scanning. All survey participants agreed that pattern recognition can improve the accuracy of keyword-based search engines. However, they all denied that this approach would reduce the need for assigning keywords. Different needs could be determined for newspaper and magazines. Newspapers rely heavily on keywords since images are often chosen based upon the circumstances under which they were taken while their visual con tent may be secondary. Therefore, newspapers\u27 profits from content-based image retrieval are minute. For magazines, the visual content of images seemed to have a higher priority and the appreciation for corresponding search capabilities was accordingly higher. To summarize, users of digital image archives can profit from contentbased image retrieval if the visual content is an important issue. For image providers, there are a number of reasons that delay the transition to contentbased image retrieval. Currently, there is only one shrink-wrapped commer cial product available that meets the needs of stock photography agencies. This product requires additional work for fully exhausting its capabilities. Finally, many companies have already built their image database and the transition to another system is time-consuming, expensive, and risky

    High imperceptibility and robustness watermarking scheme for brain MRI using Slantlet transform coupled with enhanced knight tour algorithm

    Get PDF
    This research introduces a novel and robust watermarking scheme for medical Brain MRI DICOM images, addressing the challenge of maintaining high imperceptibility and robustness simultaneously. The scheme ensures privacy control, content authentication, and protection against the detachment of vital Electronic Patient Record information. To enhance imperceptibility, a Dynamic Visibility Threshold parameter leveraging the Human Visual System is introduced. Embeddable Zones and Non-Embeddable Zones are defined to enhance robustness, and an enhanced Knight Tour algorithm based on Slantlet Transform shuffles the embedding sequence for added security. The scheme achieves remarkable results with a Peak Signal-to-Noise Ratio (PSNR) evaluation surpassing contemporary techniques. Extensive experimentation demonstrates resilience to various attacks, with low Bit Error Rate (BER) and high Normalized Cross-Correlation (NCC) values. The proposed technique outperforms existing methods, emphasizing its superior performance and effectiveness in medical image watermarking

    Digital watermarking and novel security devices

    Get PDF
    EThOS - Electronic Theses Online ServiceGBUnited Kingdo

    Medicines Reuse

    Get PDF
    This reprint examines the concept of medicines reuse, the idea that unused medication returned by one patient can be re-dispensed for use by another. Ten papers written by over 20 authors examine a range of issues related to medicines reuse including the circular economy of the pharmaceutical supply chain; the prevalence of unused medicines or medication waste within patients' homes; people's views about the causes of medication waste and the potential for medicines reuse; what might influence people to reuse medicines in the future; how sensing technologies might facilitate medicines reuse; and the effect of including sensing technologies on people's willingness to consider medicines reuse

    Discrete Wavelet Transforms

    Get PDF
    The discrete wavelet transform (DWT) algorithms have a firm position in processing of signals in several areas of research and industry. As DWT provides both octave-scale frequency and spatial timing of the analyzed signal, it is constantly used to solve and treat more and more advanced problems. The present book: Discrete Wavelet Transforms: Algorithms and Applications reviews the recent progress in discrete wavelet transform algorithms and applications. The book covers a wide range of methods (e.g. lifting, shift invariance, multi-scale analysis) for constructing DWTs. The book chapters are organized into four major parts. Part I describes the progress in hardware implementations of the DWT algorithms. Applications include multitone modulation for ADSL and equalization techniques, a scalable architecture for FPGA-implementation, lifting based algorithm for VLSI implementation, comparison between DWT and FFT based OFDM and modified SPIHT codec. Part II addresses image processing algorithms such as multiresolution approach for edge detection, low bit rate image compression, low complexity implementation of CQF wavelets and compression of multi-component images. Part III focuses watermaking DWT algorithms. Finally, Part IV describes shift invariant DWTs, DC lossless property, DWT based analysis and estimation of colored noise and an application of the wavelet Galerkin method. The chapters of the present book consist of both tutorial and highly advanced material. Therefore, the book is intended to be a reference text for graduate students and researchers to obtain state-of-the-art knowledge on specific applications

    Handbook of Digital Face Manipulation and Detection

    Get PDF
    This open access book provides the first comprehensive collection of studies dealing with the hot topic of digital face manipulation such as DeepFakes, Face Morphing, or Reenactment. It combines the research fields of biometrics and media forensics including contributions from academia and industry. Appealing to a broad readership, introductory chapters provide a comprehensive overview of the topic, which address readers wishing to gain a brief overview of the state-of-the-art. Subsequent chapters, which delve deeper into various research challenges, are oriented towards advanced readers. Moreover, the book provides a good starting point for young researchers as well as a reference guide pointing at further literature. Hence, the primary readership is academic institutions and industry currently involved in digital face manipulation and detection. The book could easily be used as a recommended text for courses in image processing, machine learning, media forensics, biometrics, and the general security area

    Study and Implementation of Watermarking Algorithms

    Get PDF
    Water Making is the process of embedding data called a watermark into a multimedia object such that watermark can be detected or extracted later to make an assertion about the object. The object may be an audio, image or video. A copy of a digital image is identical to the original. This has in many instances, led to the use of digital content with malicious intent. One way to protect multimedia data against illegal recording and retransmission is to embed a signal, called digital signature or copyright label or watermark that authenticates the owner of the data. Data hiding, schemes to embed secondary data in digital media, have made considerable progress in recent years and attracted attention from both academia and industry. Techniques have been proposed for a variety of applications, including ownership protection, authentication and access control. Imperceptibility, robustness against moderate processing such as compression, and the ability to hide many bits are the basic but rat..
    corecore