Search CORE

21 research outputs found

Benchmark databases for video-based automatic sign language recognition

Author: Athitsos V.
Dreuw P.
Neidle Carol
Ney H.
Sclaroff Stanley
Publication venue: EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA
Publication date: 10/01/2023
Field of study

Boston University Institutional Repository (OpenBU)

Advanced Speech Communication System for Deaf People

Author: Córdoba Herralde Ricardo de
Ferreiros López Javier
Lebai Lutfi Syaheerah Binti
Lopez Ludeña Veronica
Martin Maganto Raquel
Pardo Muñoz José Manuel
San Segundo Hernández Rubén
Publication venue: E.T.S.I. Telecomunicación (UPM)
Publication date: 01/01/2010
Field of study

This paper describes the development of an Advanced Speech Communication System for Deaf People and its field evaluation in a real application domain: the renewal of Driver’s License. The system is composed of two modules. The first one is a Spanish into Spanish Sign Language (LSE: Lengua de Signos Española) translation module made up of a speech recognizer, a natural language translator (for converting a word sequence into a sequence of signs), and a 3D avatar animation module (for playing back the signs). The second module is a Spoken Spanish generator from sign writing composed of a visual interface (for specifying a sequence of signs), a language translator (for generating the sequence of words in Spanish), and finally, a text to speech converter. For language translation, the system integrates three technologies: an example based strategy, a rule based translation method and a statistical translator. This paper also includes a detailed description of the evaluation carried out in the Local Traffic Office in the city of Toledo (Spain) involving real government employees and deaf people. This evaluation includes objective measurements from the system and subjective information from questionnaire

Archivo Digital UPM

Spanish generation from Spanish Sign Language using a phrase-based translation system

Author: Echeverry Correa Julian David
Lopez Ludeña Veronica
Lucas Cuesta Juan Manuel
Martin Maganto Raquel
San Segundo Hernández Rubén
Publication venue: E.T.S.I. Telecomunicación (UPM)
Publication date: 01/01/2010
Field of study

This paper describes the development of a Spoken Spanish generator from Spanish Sign Language (LSE – Lengua de Signos Española) in a specific domain: the renewal of Identity Document and Driver’s license. The system is composed of three modules. The first one is an interface where a deaf person can specify a sign sequence in sign-writing. The second one is a language translator for converting the sign sequence into a word sequence. Finally, the last module is a text to speech converter. Also, the paper describes the generation of a parallel corpus for the system development composed of more than 4,000 Spanish sentences and their LSE translations in the application domain. The paper is focused on the translation module that uses a statistical strategy with a phrase-based translation model, and this paper analyses the effect of the alignment configuration used during the process of word based translation model generation. Finally, the best configuration gives a 3.90% mWER and a 0.9645 BLEU

Archivo Digital UPM

Computer-based tracking, analysis, and visualization of linguistically significant nonmanual events in American Sign Language (ASL)

Author: Liu Bo
Liu Jingjing
Metaxas Dimitris
Neidle Carol
Peng Xi
Vogler Christian
Publication venue: EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA
Publication date: 01/01/2014
Field of study

Our linguistically annotated American Sign Language (ASL) corpora have formed a basis for research to automate detection by computer of essential linguistic information conveyed through facial expressions and head movements. We have tracked head position and facial deformations, and used computational learning to discern specific grammatical markings. Our ability to detect, identify, and temporally localize the occurrence of such markings in ASL videos has recently been improved by incorporation of (1) new techniques for deformable model-based 3D tracking of head position and facial expressions, which provide significantly better tracking accuracy and recover quickly from temporary loss of track due to occlusion; and (2) a computational learning approach incorporating 2-level Conditional Random Fields (CRFs), suited to the multi-scale spatio-temporal characteristics of the data, which analyses not only low-level appearance characteristics, but also the patterns that enable identification of significant gestural components, such as periodic head movements and raised or lowered eyebrows. Here we summarize our linguistically motivated computational approach and the results for detection and recognition of nonmanual grammatical markings; demonstrate our data visualizations, and discuss the relevance for linguistic research; and describe work underway to enable such visualizations to be produced over large corpora and shared publicly on the Web

Boston University Institutional Repository (OpenBU)

Modelling a parallel corpus of French and French Belgian Sign Language (LSFB)

Author: Cleve Anthony
Gobert Maxime
Meurant Laurence
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/01/2016
Field of study

Repository of the University of Namur

Language Resources for Spanish - Spanish Sign Language (LSE) translation

Author: Garcia Adolfo
Lopez Ludeña Veronica
Martin Maganto Raquel
San Segundo Hernández Rubén
Sánchez David
Publication venue: E.T.S.I. Telecomunicación (UPM)
Publication date: 01/01/2010
Field of study

This paper describes the development of a Spanish Spanish Sign Language (LSE) translation system. Firstly, it describes the first Spanish Spanish Sign Language (LSE) parallel corpus focused on two specific domains: the renewal of the Identity Document and Driver’s License. This corpus includes more than 4,000 Spanish sentences (in these domains), their LSE translation and a video for each LSE sentence with the sign language representation. This corpus also contains more than 700 sign descriptions in several sign writing specifications. The translation system developed with this corpus consists of two modules: a Spanish into LSE translation module that is composed of a speech recognizer (for decoding the spoken utterance into a word sequence), a natural language translator (for converting a word sequence into a sequence of signs) and a 3D avatar animation module (for playing back the signs). The second module is a Spanish generator from LSE made up of a visual interface (for specifying a sequence of signs in sign writing), a language translator (for generating the sequence of words in Spanish) and a text to speech converter. For each language translation, the system uses three technologies: an example based strategy, a rule based translation method and a statistical translator

Archivo Digital UPM

Factored Translation Models for improving a Speech into Sign Language Translation System

Author: Córdoba Herralde Ricardo de
Ferreiros López Javier
Lopez Ludeña Veronica
Montero Martínez Juan Manuel
Pardo Muñoz José Manuel
San Segundo Hernández Rubén
Publication venue: E.T.S.I. Telecomunicación (UPM)
Publication date: 01/01/2011
Field of study

This paper proposes the use of Factored Translation Models (FTMs) for improving a Speech into Sign Language Translation System. These FTMs allow incorporating syntactic-semantic information during the translation process. This new information permits to reduce significantly the translation error rate. This paper also analyses different alternatives for dealing with the non-relevant words. The speech into sign language translation system has been developed and evaluated in a specific application domain: the renewal of Identity Documents and Driver’s License. The translation system uses a phrase-based translation system (Moses). The evaluation results reveal that the BLEU (BiLingual Evaluation Understudy) has improved from 69.1% to 73.9% and the mSER (multiple references Sign Error Rate) has been reduced from 30.6% to 24.8%

Archivo Digital UPM

Sistema de Comunicación Oral para Personas Sordas

Author: Garcia Adolfo
Lopez Ludeña Veronica
Martin Maganto Raquel
San Segundo Hernández Rubén
Sánchez David
Publication venue: E.T.S.I. Telecomunicación (UPM)
Publication date: 01/01/2010
Field of study

Este artículo describe el desarrollo y la evaluación de un sistema de comunicación para personas sordas en un ámbito de aplicación específico: la renovación del permiso de conducir. El sistema de comunicación desarrollado está compuesto por dos módulos que permiten la comunicación en los dos sentidos. El primer módulo es un traductor de voz en castellano a Lengua de Signos Española (LSE) y está formado por un reconocedor de voz, un traductor de palabras en castellano a una secuencia de signos y un tercer módulo de representación de los signos mediante un agente animado. El segundo módulo es un generador de voz en castellano a partir de una secuencia de signos, y está formado por una interfaz donde se especifican los signos, un traductor (para convertir la secuencia de signos en una secuencia de palabras) y un conversor de texto a voz. En los dos módulos de traducción entre lenguas, se integran tres tecnologías: una basada en ejemplos, una basada en reglas y un traductor estadístico. Este artículo describe la evaluación del sistema llevada a cabo en la Jefatura Provincial de Tráfico de Toledo implicando a funcionarios de dicha jefatura y personas sordas

Archivo Digital UPM

Evaluating a Speech Communication System for deaf people

Author: García Alfonso
Lopez Ludeña Veronica
Martin Maganto Raquel
San Segundo Hernández Rubén
Sánchez David
Publication venue: E.T.S.I. Telecomunicación (UPM)
Publication date: 01/01/2011
Field of study

This paper describes the development of an Advanced Speech Communication System for Deaf People and its field evaluation in a real application domain: the renewal of Driver’s License. The system is composed of two modules. The first one is a Spanish into Spanish Sign Language (LSE: Lengua de Signos Española) translation module made up of a speech recognizer, a natural language translator (for converting a word sequence into a sequence of signs), and a 3D avatar animation module (for playing back the signs). The second module is a Spoken Spanish generator from sign-writing composed of a visual interface (for specifying a sequence of signs), a language translator (for generating the sequence of words in Spanish), and finally, a text to speech converter. For language translation, the system integrates three technologies: an example-based strategy, a rule-based translation method and a statistical translator. This paper also includes a detailed description of the evaluation carried out in the Local Traffic Office in the city of Toledo (Spain) involving real government employees and deaf people. This evaluation includes objective measurements from the system and subjective information from questionnaires. Finally, the paper reports an analysis of the main problems and a discussion about possible solutions

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM