25 research outputs found

    Impact of time variability in off-line writer identification and verification

    Full text link
    Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. F. Alonso-Fernández, J. Fiérrez, A. Gilpérez, J Ortega-García, "Impact of time variability in off-line writer identification and verification" in 6th International Symposium on Image and Signal Processing and Analysis (ISPA), Salzburg (Austria), 2009, pp. 540 - 545One of the biggest challenges in person recognition using biometric systems is the variability in the acquired data. In this paper, we evaluate the effects of an increasing time lapse between reference and test biometric data consisting of static images of handwritten signatures and texts. We use for our experiments two recognition approaches exploiting information at the global and local levels, and the BiosecurlD database, containing 3,724 signature images and 532 texts of 133 individuals acquired in four acquisition sessions distributed along a 4 months time span. We report results of the recognition systems working both in verification (one-to-one) and identification (one-to-many) mode. The results show the extent of the impact that the time separation between samples under comparison has on the recognition rates, being the local approach more robust to the time lapse than the global one. We also observe in our experiments that recognition based on handwritten texts provides higher accuracy than recognition based on signatures.This work has been supported by Spanish MCYT TEC2006-13141-C03-03 project

    Generalization Capacity of Handwritten Outlier Symbols Rejection with Neural Network

    No full text
    http://www.suvisoft.comDifferent problems of generalization of outlier rejection exist depending of the context. In this study we firstly define three different problems depending of the outlier availability during the learning phase of the classifier. Then we propose different solutions to reject outliers with two main strategies: add a rejection class to the classifier or delimit its knowledge to better reject what it has not learned. These solutions are compared with ROC curves to recognize handwritten digits and reject handwritten characters. We show that delimiting knowledge of the classifier is important and that using only a partial subset of outliers do not perform a good reject option

    A scale space approach for automatically segmenting words from historical handwritten documents

    Full text link

    Robust Visual Recognition Using Multilayer Generative Neural Networks

    Get PDF
    Deep generative neural networks such as the Deep Belief Network and Deep Boltzmann Machines have been used successfully to model high dimensional visual data. However, they are not robust to common variations such as occlusion and random noise. In this thesis, we explore two strategies for improving the robustness of DBNs. First, we show that a DBN with sparse connections in the first layer is more robust to variations that are not in the training set. Second, we develop a probabilistic denoising algorithm to determine a subset of the hidden layer nodes to unclamp. We show that this can be applied to any feedforward network classifier with localized first layer connections. By utilizing the already available generative model for denoising prior to recognition, we show significantly better performance over the standard DBN implementations for various sources of noise on the standard and Variations MNIST databases

    CaiMU: Caderno Multimodal e Ubíquo

    Get PDF
    Tese de mestrado em Engenharia Informática, apresentada à Universidade de Lisboa, através da Faculdade de Ciências, 2013Desde sempre que o processo de aprendizagem da escrita, neste caso para as crianças, é uma das fases mais importantes da sua vida, sendo que é uma “abertura” a um novo mundo de comunicação. É por isso imprescindível que o aluno consiga ter uma dupla atenção na absorção de todo este conhecimento e na execução contínua dos exercícios com o intuito de melhorar as suas capacidades motoras, visuais e percepcionais. Por outro lado, é também fundamental uma devida atenção da parte do professor na transmissão de conhecimentos: onde por vezes é necessário exemplificar de forma paciente todos os movimentos necessários para a aprendizagem da caligrafia como também tentar materializar números e o seu significado para a aprendizagem da aritmética; e na avaliação de conhecimentos executada por este. Um processo de transmissão e avaliação interessante diz respeito à utilização de ferramentas de ensino que automaticamente ajudam o professor em todo este processo. As ferramentas de ensino são muito pouco utilizadas pelos professores nas salas de aula. Usualmente os mais utilizados são os retroprojectores e os quadros multimédia. No entanto, estes são utilizados de modo tradicional para projectar matéria ou realizar exercícios no quadro, existindo por isso muito pouca interacção dos alunos com estes dispositivos, e o professor irá ter uma tarefa bastante complicada na transmissão e avaliação de todo o conhecimento referido. É, por isso, necessário existir um trabalho mútuo entre professor e aluno, para que ambos se sintam estimulados no decorrer de todo este processo. Assim a existência de uma ferramenta simples e lúdica, em que tanto o professor como o aluno sejam capaz de aperfeiçoar todo este processo é bastante útil. Neste trabalho realizou-se uma plataforma de ensino auxiliado por dispositivos tácteis – Tablet PCs, que suportará a construção de duas ferramentas protótipos: uma vocacionada para assistência do professor, que definirá exercícios e visualizará as execuções dos alunos; e outra capaz de fornecer uma interface apelativa a crianças para execução dos exercícios, de escrita ou aritmética, propostos, com mecanismos de classificação fiáveis de cada exercício e com um retorno visual que a ajude e oriente em tempo real. A arquitectura deste trabalho consiste em quatro componentes principais: a ferramenta do professor, a ferramenta do aluno, o repositório de dados e a plataforma CaiMU. A plataforma CaiMU é responsável por criar, manipular, armazenar e renderizar toda a informação criada pelas ferramentas do professor e aluno. Todas as suas funcionalidades visam conectar o repositório descrito às ferramentas do aluno e do professor. O repositório de dados é responsável por guardar todos os dados que são criados nas várias ferramentas para documentos XML. Estes dados são organizados de forma hierárquica. A ferramenta do professor agrupa um conjunto de interfaces gráficas que visam optimizar e facilitar a criação, edição e revisão de grafemas ou caracteres alfanuméricos, exercícios e aulas. Nesta ferramenta também é possível visualizar e analisar todos os exercícios que o aluno realizou. Por fim, a ferramenta do aluno tem o intuito de receber todos os dados sobre as aulas e exercícios que o professor disponibilizou. Estes, após a sua execução, serão avaliados e armazenados de forma automática pelo sistema para que, posteriormente, o professor os possa analisar. Foi também possível efectuar um conjunto de testes e entrevistas para avaliar toda a aplicação, validar opções de desenho e usabilidade de ambas as ferramentas com o intuito de melhorar e perceber possíveis características a desenvolver. Desta forma, toda a aplicação foi testada e validada cuidadosamente. Esta aplicação será capaz de reduzira dependência entre professor e aluno, optimizando a produtividade entre ambos. Esta redução libertará o professor para executar o seu papel perante alunos com mais dificuldades nesta aprendizagem. Com este trabalho foi então realizado: uma análise do ensino auxiliado por dispositivos; uma plataforma de auxílio ao ensino da escrita e aritmética que permite a ambos os tipos de utilizadores, professor e aluno, aumentarem a sua produtividade e evoluir os seus métodos de trabalho; um conjunto de testes e entrevistas para validar todas as opções tomadas na criação deste sistema.The purpose of this work was to create and improve an application for tactile devices that can be used in the process of writing and arithmetic learning by teachers and students in order to improve teaching methods and children’s motor, visual and perceptional skills, optimizing the productivity of teacher and student. For this technique to perfectly work it is necessary mutual work thus a dependency between both. This application which will support the construction of two prototype tools: one dedicated to teacher’s assistance that will manage the exercises he/she can create and manage the students’ executions, and another able to provide an appealing interface for children to perform the proposed writing or arithmetic exercises, proposed with reliable classification mechanisms for each exercise and with a visual return that helps the student and guide him/her in real time. These prototypes are helped by a platform responsible for create, manipulate, store and render all the information created by these tools. All features are connected by a repository responsible to store all data created to XML documents organized on a hierarchical way. To verify the correctness of this system, a control bed test and some interviews with teachers and students to evaluate all the application, validate design and usability options of both prototypes in order to improve and develop technical features. By this way, all the application was tested and validated carefully

    Advances in Interactive Speech Transcription

    Full text link
    [ES] Novedoso sistema interactivo para la transcripción del habla que compensa el esfuerzo del usuario y el error máximo tolerado en las transcripciones resultantes.[EN] Novel interactive speech transcription system that balances the user effort and the maximum allowed error tolerated for the final resulting transcriptions.Sánchez Cortina, I. (2012). Advances in Interactive Speech Transcription. http://hdl.handle.net/10251/17889Archivo delegad

    Hidden Markov models and neural networks for speech recognition

    Get PDF
    The Hidden Markov Model (HMMs) is one of the most successful modeling approaches for acoustic events in speech recognition, and more recently it has proven useful for several problems in biological sequence analysis. Although the HMM is good at capturing the temporal nature of processes such as speech, it has a very limited capacity for recognizing complex patterns involving more than first order dependencies in the observed data sequences. This is due to the first order state process and the assumption of state conditional independence between observations. Artificial Neural Networks (NNs) are almost the opposite: they cannot model dynamic, temporally extended phenomena very well, but are good at static classification and regression tasks. Combining the two frameworks in a sensible way can therefore lead to a more powerful model with better classification abilities. The overall aim of this work has been to develop a probabilistic hybrid of hidden Markov models and neural networks and ..

    Temperament in Bach's Well-tempered clavier : a historical survey and a new evaluation according to dissonance theory

    Get PDF
    After a historical survey of temperament in Bach's Well-Tempered Clavier by Johann Sebastian Bach, an analysis of the work has been made by applying a number of historical good temperaments as well as some recent proposals. The results obtained show that the global dissonance for all preludes and fugues in major keys can be minimized using the Kirnberger II temperament. The method of analysis used for this research is based on the mathematical theories of sensory dissonance, which have been developed by authors such as Hermann Ludwig Ferdinand von Helmholtz, Harry Partch, Reinier Plomp, Willem J. M. Levelt and William A. SetharesDesprés d'una visió històrica sobre el temperament a El clavecí ben temperat de Johann Sebastian Bach, s'ha realitzat una anàlisi de l'obra aplicant divesos bons temperaments històrics a més d'algunes propostes recents. Els resultats obtinguts demostren que la dissonància global per a tots els preludis i fugues en tonalitats majors pot minimitzar-se utilitzant el temperament Kirnberger II. El mètode d'anàlisi utilitzat per a aquesta recerca està basat en les teories matemàtiques de la dissonància sensorial desenvolupades per autors com Hermann Ludwig Ferdinand von Helmholtz, Harry Partch, Reinier Plomp, Willem J. M. Levelt i William A. Sethare

    THE SALIVARY IL-1? AS PERIODONTAL DISEASE DIAGNOSTIC MARKER

    Get PDF
    Abstract. Periodontal disease is preceeded from gingivitis, if not treated appropriately this inflamation disease will be worse . It turns into periodontitis . more severe . Periodontitis is characterized by loss of tissue attachment and alveolar bone destruction. Interleukin-1? (IL-1?) is one of the main cytokines related to pathology of inflammatory and potentially stimulate bone resorption. Saliva is oral fluid that may reflect the levels of IL-1? as a biological marker of inflammation in periodontal disease. The purpose of this study was to determine the difference between the levels of IL-1? GCF contained in the present in saliva . Samples have as many as 54 people, divided into three main groups, 18 patients with mild gingivitis, 18 with mild periodontitis patients and 18 healthy controls with ages ranging between 17-45 years. Unstimulated whole saliva were collected, then IL-1? levels were measured using ELISA. The average levels of IL-1? contained in saliva by Ramfjord PDI group has significant difference (p <0.05). There are differences between the levels of IL-1? contained in saliva on healthy group, mild gingivitis, and mild periodontitis
    corecore