4 research outputs found
Interactive Pattern Recognition applied to Natural Language Processing
This thesis is about Pattern Recognition. In the last decades, huge efforts have been
made to develop automatic systems able to rival human capabilities in this field. Although
these systems achieve high productivity rates, they are not precise enough in
most situations. Humans, on the contrary, are very accurate but comparatively quite
slower. This poses an interesting question: the possibility of benefiting from both
worlds by constructing cooperative systems.
This thesis presents diverse contributions to this kind of collaborative approach.
The point is to improve the Pattern Recognition systems by properly introducing a
human operator into the system. We call this Interactive Pattern Recognition (IPR).
Firstly, a general proposal for IPR will be stated. The aim is to develop a framework
to easily derive new applications in this area. Some interesting IPR issues are
also introduced. Multi-modality or adaptive learning are examples of extensions that
can naturally fit into IPR.
In the second place, we will focus on a specific application. A novel method to
obtain high quality speech transcriptions (CAST, Computer Assisted Speech Transcription).
We will start by proposing a CAST formalization and, next, we will cope
with different implementation alternatives. Practical issues, as the system response
time, will be also taken into account, in order to allow for a practical implementation
of CAST. Word graphs and probabilistic error correcting parsing are tools that will
be used to reach an alternative formulation that allows for the use of CAST in a real
scenario.
Afterwards, a special application within the general IPR framework will be discussed.
This is intended to test the IPR capabilities in an extreme environment, where
no input pattern is available and the system only has access to the user actions to produce
a hypothesis. Specifically, we will focus here on providing assistance in the
problem of text generation.RodrÃguez Ruiz, L. (2010). Interactive Pattern Recognition applied to Natural Language Processing [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/8479Palanci
Aportaciones al etiquetado y segmentación automática de diálogos en el corpus DIHANA
El siguiente trabajo estudia varios métodos para el etiquetado automático de segmentos
en los sistemas de diálogo hablados. Concretamente, se centra la experimentación en el
corpus de diálogo Dihana. El estudio aborda la eficacia de la prosodia (información
extraÃda de la señal, que caracteriza el habla) por sà misma para identificar actos de
diálogo y su combinación con las transcripciones de las intervenciones. También se
presenta un método de etiquetado basado en la transcripción que utiliza HMMs. Este
modelo se presenta en distintas versiones, fruto de realizar distintas asunciones en el
desarrollo del planteamiento por máxima verosimilitud. Se presenta también otro método
basado en la transcripción que utiliza técnicas de alineamiento tÃpicas de la traducción
automática.Tamarit Ballester, V. (2008). Aportaciones al etiquetado y segmentación automática de diálogos en el corpus DIHANA. http://hdl.handle.net/10251/13637Archivo delegad
Acoustic And Syntactical Modeling in the ATROS System
Current speech technology allows us to build efficient speech recognition systems. However, model learning of knowledge sources in a speech recognition system is not a closed problem. In addition, lower demand of computational requirements are crucial to building real-time systems