10 research outputs found

    A Novel Hybrid SVM-CNN Method for Extracting Characteristics and Classifying Cattle Branding

    Get PDF
    A tool that can perform the automatic identification of cattle brandings is essential for the government agencies responsible for the record, control and inspection of this activity. This article presents a novel hybrid method that uses Convolutional Neural Networks (CNN) to extract features from images and Support Vector Machines (SVM) to classify the brandings. The experiments were performed using a cattle branding image set provided by the City Hall of Bagé, Brazil. Metrics of Overall Accuracy, Recall, Precision, Kappa Coefficient, and Processing Time were used in order to assess the proposed tool. The results obtained here were satisfactory, reaching a Overall Accuracy of 93.11% in the first experiment with 39 brandings and 1,950 sample images, and 95.34% of accuracy in the second experiment, with the same 39 brandings, but with 2,730 sample images. The processing time attained in the experiments was 31.661s and 41.749s, respectively

    Métodos computacionais para identificar automaticamente estruturas da retina e quantificar a severidade do edema macular diabético em imagens de fundo de olho

    Get PDF
    Através das imagens de fundo do olho, os especialistas em oftalmologia podem detectar possíveis complicações relacionadas ao Diabetes como a diminuição ou até a perda da capacidade de visão. O Edema Macular Diabético (EMD) é uma das complicações que lideram os casos de danos à visão em pessoas em idade de trabalho. Sendo assim, esta tese apresenta métodos para automaticamente identificar os diferentes níveis de gravidade do Edema Macular Diabético visando auxiliar o especialista no diagnóstico dessa patologia. Como resultado final, propõe-se automaticamente e rapidamente identificar, a partir da imagem, se o paciente possui o EMD leve, moderado ou grave. Utilizando imagens de fundo do olho de um banco de dados livremente disponível na internet (ou seja, o DIARETDB1), o método proposto para a identificação automática do EMD obteve uma precisão de 94,29%. Alguns métodos intermediários necessários para a solução desse problema foram propostos e os resultados publicados na literatura científica.Through color eye fundus images, the eye care specialists can detect possible complications related to diabetes as the vision impairment or vision loss. The Diabetic Macular Edema (DME) is the most common cause of vision damage in working-age people. Therefore, this thesis presents an approach to automatically identify the different levels of severity of diabetic macular edema aiming to assist the expert in the diagnosis of this pathology. As a final result, a methodology to automatically and quickly identify, from the eye fundus image, if a patient has the EMD mild, moderate or severe EMD is proposed. In a preliminary evaluation of our DME grading scheme using publicly available eye fundus images (i.e., DIARETDB1 image database), an accuracy of 94.29% was obtained. Some intermediate methods needed to solve this problem have been proposed and the results published in scientific literature

    Métodos computacionais para identificar automaticamente estruturas da retina e quantificar a severidade do edema macular diabético em imagens de fundo de olho

    Get PDF
    Através das imagens de fundo do olho, os especialistas em oftalmologia podem detectar possíveis complicações relacionadas ao Diabetes como a diminuição ou até a perda da capacidade de visão. O Edema Macular Diabético (EMD) é uma das complicações que lideram os casos de danos à visão em pessoas em idade de trabalho. Sendo assim, esta tese apresenta métodos para automaticamente identificar os diferentes níveis de gravidade do Edema Macular Diabético visando auxiliar o especialista no diagnóstico dessa patologia. Como resultado final, propõe-se automaticamente e rapidamente identificar, a partir da imagem, se o paciente possui o EMD leve, moderado ou grave. Utilizando imagens de fundo do olho de um banco de dados livremente disponível na internet (ou seja, o DIARETDB1), o método proposto para a identificação automática do EMD obteve uma precisão de 94,29%. Alguns métodos intermediários necessários para a solução desse problema foram propostos e os resultados publicados na literatura científica.Through color eye fundus images, the eye care specialists can detect possible complications related to diabetes as the vision impairment or vision loss. The Diabetic Macular Edema (DME) is the most common cause of vision damage in working-age people. Therefore, this thesis presents an approach to automatically identify the different levels of severity of diabetic macular edema aiming to assist the expert in the diagnosis of this pathology. As a final result, a methodology to automatically and quickly identify, from the eye fundus image, if a patient has the EMD mild, moderate or severe EMD is proposed. In a preliminary evaluation of our DME grading scheme using publicly available eye fundus images (i.e., DIARETDB1 image database), an accuracy of 94.29% was obtained. Some intermediate methods needed to solve this problem have been proposed and the results published in scientific literature

    Análise de métodos para suavização de ruídos em imagens de tomografia computadorizada multislice de baixa radiação

    No full text
    The objective of this work is to identify a computational method to smooth noise in images of low radiation multislice computed tomography (MDCT). These images contains low quality, contaminated by noises, which are stochastic phenomena and impossible to predict their occurrence. In order to perform this task, some image processing techniques for noise smoothing were studied. PSNR, SNR, MSE and SSIM metrics were used to evaluate de quality of the processed images. The filters analyzed and simulated were the average, median, mode, gaussian and Wiener, from the spatial domain. After some simulations, it was verified that the gaussian filter technique presented superior results with an average PSNR of 25.64dB and an average SSIM of 0.76, for the best cases.Este trabalho tem como objetivo identificar um método computacional para suavizar ruídos de imagens de tomografia computadorizada multislice (MDCT) de baixa radiação. Essas imagens possuem baixa qualidade pois estão contaminadas por ruídos, que são fenômenos estocásticos, onde não é possível prever seu acontecimento. Dessa forma, foram pesquisadas e estudadas algumas técnicas de processamento de imagens para a suavização de ruídos. Para avaliar a qualidade das imagens processadas, foram utilizadas as métricas PSNR, SNR, MSE e SSIM. Foram analisados e simulados os filtros da média, mediana, moda, gaussiano e Wiener, do domínio espacial. Após a simulação dessas técnicas e uma comparação quantitativa dos valores médios das métricas de qualidade, verificou-se que a técnica do filtro gaussiano apresentou resultados superiores com um PSNR médio de 25.64dB e um SSIM médio de 0.76, para os melhores casos

    A New Approach for Detecting Fundus Lesions Using Image Processing and Deep Neural Network Architecture Based on YOLO Model

    No full text
    Diabetic Retinopathy is one of the main causes of vision loss, and in its initial stages, it presents with fundus lesions, such as microaneurysms, hard exudates, hemorrhages, and soft exudates. Computational models capable of detecting these lesions can help in the early diagnosis of the disease and prevent the manifestation of more severe forms of lesions, helping in screening and defining the best form of treatment. However, the detection of these lesions through computerized systems is a challenge due to numerous factors, such as the characteristics of size and shape of the lesions, noise and the contrast of images available in the public datasets of Diabetic Retinopathy, the number of labeled examples of these lesions available in the datasets and the difficulty of deep learning algorithms in detecting very small objects in digital images. Thus, to overcome these problems, this work proposes a new approach based on image processing techniques, data augmentation, transfer learning, and deep neural networks to assist in the medical diagnosis of fundus lesions. The proposed approach was trained, adjusted, and tested using the public DDR and IDRiD Diabetic Retinopathy datasets and implemented in the PyTorch framework based on the YOLOv5 model. The proposed approach reached in the DDR dataset an mAP of 0.2630 for the IoU limit of 0.5 and F1-score of 0.3485 in the validation stage, and an mAP of 0.1540 for the IoU limit of 0.5 and F1-score of 0.2521, in the test stage. The results obtained in the experiments demonstrate that the proposed approach presented superior results to works with the same purpose found in the literature

    The Pattern Recognition in Cattle Brand using Bag of Visual Words and Support Vector Machines Multi-Class

    No full text
    The recognition images of cattle brand in an automatic way is a necessity to governmental organs responsible for this activity. To help this process, this work presents a method that consists in using Bag of Visual Words for extracting of characteristics from images of cattle brand and Support Vector Machines Multi-Class for classification. This method consists of six stages: a) select database of images; b) extract points of interest (SURF); c) create vocabulary (K-means); d) create vector of image characteristics (visual words); e) train and sort images (SVM); f) evaluate the classification results. The accuracy of the method was tested on database of municipal city hall, where it achieved satisfactory results, reporting 86.02% of accuracy and 56.705 seconds of processing time, respectively

    Mobile Robot Navigation Using an Object Recognition Software with RGBD Images and the YOLO Algorithm

    No full text
    This work presents a vision system based on the YOLO algorithm to identify static objects that could be obstacles in the path of a mobile robot. In order to identify the objects and its distances, a Microsoft Kinect sensor was used. In addition, a Nvidia Jetson TX2 GPU was used to increase the image processing algorithm performance. Our experimental results indicate that the YOLO network has detected all the predefined obstacles for which it has been trained with good reliability and the calculus of the distance using the depth information returned by the Microsoft Kinect camera had an error below of 3,64%

    Article Users Activity Gesture Recognition on Kinect Sensor Using Convolutional Neural Networks and FastDTW for Controlling Movements of a Mobile Robot

    No full text
    In this paper, we use data from the Microsoft Kinect sensor that processes the captured image of a person using and extracting the joints information on every frame. Then, we propose the creation of an image derived from all the sequential frames of a gesture the movement, which facilitates training in a convolutional neural network. We trained a CNN using two strategies: combined training and individual training. The strategies were experimented in the convolutional neural network (CNN) using the MSRC-12 dataset, obtaining an accuracy rate of 86.67% in combined training and 90.78% of accuracy rate in the individual training.. Then, the trained neural network was used to classify data obtained from Kinect with a person, obtaining an accuracy rate of 72.08% in combined training and 81.25% in individualized training. Finally, we use the system to send commands to a mobile robot in order to control it
    corecore