14 research outputs found
Face Recognition in Color Using Complex and Hypercomplex Representation
The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-540-72847-4_29Color has plenty of discriminative information that can be used to improve the performance of face recognition algorithms, although it is difficult to use it because of its high variability. In this paper we investigate the use of the quaternion representation of a color image for face recognition. We also propose a new representation for color images based on complex numbers. These two color representation methods are compared with the traditional grayscale and RGB representations using an eigenfaces based algorithm for identity verification. The experimental results show that the proposed method gives a very significant improvement when compared to using only the illuminance information.Work supported by the Spanish Project DPI2004-08279-C02-02 and the Generalitat Valenciana - ConsellerÃa d’Empresa, Universitat i Ciència under an FPI scholarship.Villegas, M.; Paredes Palacios, R. (2007). Face Recognition in Color Using Complex and Hypercomplex Representation. En Pattern Recognition and Image Analysis: Third Iberian Conference, IbPRIA 2007, Girona, Spain, June 6-8, 2007, Proceedings, Part I. Springer Verlag (Germany). 217-224. https://doi.org/10.1007/978-3-540-72847-4_29S217224Yip, A., Sinha, P.: Contribution of color to face recognition. Perception 31(5), 995–1003 (2002)Torres, L., Reutter, J.Y., Lorente, L.: The importance of the color information in face recognition. In: ICIP, vol. 3, pp. 627–631 (1999)Jones III, C., Abbott, A.L.: Color face recognition by hypercomplex gabor analysis. In: FG2006, University of Southampton, UK, pp. 126–131 (2006)Hamilton, W.R.: On a new species of imaginary quantities connected with a theory of quaternions. In: Proc. Royal Irish Academy, vol. 2, pp. 424–434 (1844)Zhang, F.: Quaternions and matrices of quaternions. Linear Algebra And Its Applications 251(1-3), 21–57 (1997)Pei, S., Cheng, C.: A novel block truncation coding of color images by using quaternion-moment preserving principle. In: ISCAS, Atlanta, USA, vol. 2, pp. 684–687 (1996)Sangwine, S., Ell, T.: Hypercomplex fourier transforms of color images. In: ICIP, Thessaloniki, Greece, vol. 1, pp. 137–140 (2001)Bihan, N.L., Sangwine, S.J.: Quaternion principal component analysis of color images. In: ICIP, Barcelona, Spain, vol. 1, pp. 809–812 (2003)Chang, J.-H., Pei, S.-C., Ding, J.J.: 2d quaternion fourier spectral analysis and its applications. In: ISCAS, Vancouver, Canada, vol. 3, pp. 241–244 (2004)Li, S.Z., Jain, A.K.: 6. In: Handbook of Face Recognition. Springer (2005)Gross, R., Brajovic, V.: An image preprocessing algorithm for illumination invariant face recognition. In: Kittler, J., Nixon, M.S. (eds.) AVBPA 2003. LNCS, vol. 2688, p. 1055. Springer, Heidelberg (2003)Lee, K., Ho, J., Kriegman, D.: Nine points of light: Acquiring subspaces for face recognition under variable lighting. In: CVPR, vol. 1, pp. 519–526 (2001)Zhang, L., Samaras, D.: Face recognition under variable lighting using harmonic image exemplars. In: CVPR, vol. 1, pp. 19–25 (2003)Villegas, M., Paredes, R.: Comparison of illumination normalization methods for face recognition. In: COST 275, University of Hertfordshire, UK, pp. 27–30 (2005)Turk, M., Pentland, A.: Face recognition using eigenfaces. In: CVPR, Hawaii, pp. 586–591 (1991)Bihan, N.L., Mars, J.: Subspace method for vector-sensor wave separation based on quaternion algebra. In: EUSIPCO, Toulouse, France (2002)XM2VTS (CDS00{1,6}), http://www.ee.surrey.ac.uk/Reseach/VSSP/xm2vtsdbLuettin, J., Maître, G.: Evaluation protocol for the extended M2VTS database (XM2VTSDB). IDIAP-COM 05, IDIAP (1998
Multimodal Interactive Transcription of Handwritten Text Images
En esta tesis se presenta un nuevo marco interactivo y multimodal para la transcripción de
Documentos manuscritos. Esta aproximación, lejos de proporcionar la transcripción completa
pretende asistir al experto en la dura tarea de transcribir.
Hasta la fecha, los sistemas de reconocimiento de texto manuscrito disponibles no proporcionan
transcripciones aceptables por los usuarios y, generalmente, se requiere la intervención
del humano para corregir las transcripciones obtenidas. Estos sistemas han demostrado ser
realmente útiles en aplicaciones restringidas y con vocabularios limitados (como es el caso
del reconocimiento de direcciones postales o de cantidades numéricas en cheques bancarios),
consiguiendo en este tipo de tareas resultados aceptables. Sin embargo, cuando se trabaja
con documentos manuscritos sin ningún tipo de restricción (como documentos manuscritos
antiguos o texto espontáneo), la tecnologÃa actual solo consigue resultados inaceptables.
El escenario interactivo estudiado en esta tesis permite una solución más efectiva. En este
escenario, el sistema de reconocimiento y el usuario cooperan para generar la transcripción final
de la imagen de texto. El sistema utiliza la imagen de texto y una parte de la transcripción
previamente validada (prefijo) para proponer una posible continuación. Despues, el usuario
encuentra y corrige el siguente error producido por el sistema, generando asà un nuevo prefijo
mas largo. Este nuevo prefijo, es utilizado por el sistema para sugerir una nueva hipótesis. La
tecnologÃa utilizada se basa en modelos ocultos de Markov y n-gramas. Estos modelos son
utilizados aquà de la misma manera que en el reconocimiento automático del habla. Algunas
modificaciones en la definición convencional de los n-gramas han sido necesarias para tener
en cuenta la retroalimentación del usuario en este sistema.Romero Gómez, V. (2010). Multimodal Interactive Transcription of Handwritten Text Images [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/8541Palanci
Digital Image Sensor-Based Assessment of the Status of Oat (Avena sativa L.) Crops after Frost Damage
The aim of this paper is to classify the land covered with oat crops, and the quantification of frost damage on oats, while plants are still in the flowering stage. The images are taken by a digital colour camera CCD-based sensor. Unsupervised classification methods are applied because the plants present different spectral signatures, depending on two main factors: illumination and the affected state. The colour space used in this application is CIELab, based on the decomposition of the colour in three channels, because it is the closest to human colour perception. The histogram of each channel is successively split into regions by thresholding. The best threshold to be applied is automatically obtained as a combination of three thresholding strategies: (a) Otsu’s method, (b) Isodata algorithm, and (c) Fuzzy thresholding. The fusion of these automatic thresholding techniques and the design of the classification strategy are some of the main findings of the paper, which allows an estimation of the damages and a prediction of the oat production
Web Service SWePT: A Hybrid Opinion Mining Approach
[EN] The increasing use of social networks and online sites where people can express
their opinions has created a growing interest in Opinion Mining. One of the main tasks of
Opinion Mining is to determine whether an opinion is positive or negative. Therefore, the role
of the feelings expressed on the web has become crucial, mainly due to the concern of
businesses and government to automatically identify the semantic orientation of the views of
customers or citizens. This is also a concern, in the area of health to identify psychological
disorders. This research focuses on the development of a web application called SWePT (Web
Service for Polarity detection in Spanish Texts), which implements the Sequential Minimal
Optimization (SMO) algorithm, extracting its features from an affective lexicon in Mexican
Spanish. For this purpose, a corpus and an affective lexicon in Mexican Spanish were created.
The experiments using three (positive, neutral, negative) and five categories (very positive,
positive, neutral, negative, and very negative) allow us to demonstrate the effectiveness of the
presented method. SWePT has also been implemented in the Emotion-bracelet interface, which
shows the opinion of a user graphically.This work has been partially supported by the Sectorial Fund CONACyT-INEGI: Project with ref. 208471, INFOTEC, Mexico. And, also by the project CNDT-PYR2015-0016, CENIDET, Mexico. The work of the third author was in the framework of the SomEMBED MINECO TIN2015-71147-C2-1-P research project. The National Council for Science and Technology (CONACyT Mexico) has funded the research work of Delia Irazu Hernandez Farias (Grant No. 218109/313683 CVU-369616).Baca-Gomez, YR.; MartÃnez, A.; Rosso, P.; Estrada Esquivel, H.; Hernandez-Farias, DI. (2016). Web Service SWePT: A Hybrid Opinion Mining Approach. Journal of Universal Computer Science. 22(5):671-690. https://doi.org/10.3217/jucs-022-05-067167169022
PolÃticas de Copyright de Publicações CientÃficas em Repositórios Institucionais: O Caso do INESC TEC
A progressiva transformação das práticas cientÃficas, impulsionada pelo desenvolvimento das novas Tecnologias de Informação e Comunicação (TIC), têm possibilitado aumentar o acesso à informação, caminhando gradualmente para uma abertura do ciclo de pesquisa. Isto permitirá resolver a longo prazo uma adversidade que se tem colocado aos investigadores, que passa pela existência de barreiras que limitam as condições de acesso, sejam estas geográficas ou financeiras. Apesar da produção cientÃfica ser dominada, maioritariamente, por grandes editoras comerciais, estando sujeita à s regras por estas impostas, o Movimento do Acesso Aberto cuja primeira declaração pública, a Declaração de Budapeste (BOAI), é de 2002, vem propor alterações significativas que beneficiam os autores e os leitores. Este Movimento vem a ganhar importância em Portugal desde 2003, com a constituição do primeiro repositório institucional a nÃvel nacional. Os repositórios institucionais surgiram como uma ferramenta de divulgação da produção cientÃfica de uma instituição, com o intuito de permitir abrir aos resultados da investigação, quer antes da publicação e do próprio processo de arbitragem (preprint), quer depois (postprint), e, consequentemente, aumentar a visibilidade do trabalho desenvolvido por um investigador e a respetiva instituição. O estudo apresentado, que passou por uma análise das polÃticas de copyright das publicações cientÃficas mais relevantes do INESC TEC, permitiu não só perceber que as editoras adotam cada vez mais polÃticas que possibilitam o auto-arquivo das publicações em repositórios institucionais, como também que existe todo um trabalho de sensibilização a percorrer, não só para os investigadores, como para a instituição e toda a sociedade. A produção de um conjunto de recomendações, que passam pela implementação de uma polÃtica institucional que incentive o auto-arquivo das publicações desenvolvidas no âmbito institucional no repositório, serve como mote para uma maior valorização da produção cientÃfica do INESC TEC.The progressive transformation of scientific practices, driven by the development of new Information and Communication Technologies (ICT), which made it possible to increase access to information, gradually moving towards an opening of the research cycle. This opening makes it possible to resolve, in the long term, the adversity that has been placed on researchers, which involves the existence of barriers that limit access conditions, whether geographical or financial. Although large commercial publishers predominantly dominate scientific production and subject it to the rules imposed by them, the Open Access movement whose first public declaration, the Budapest Declaration (BOAI), was in 2002, proposes significant changes that benefit the authors and the readers. This Movement has gained importance in Portugal since 2003, with the constitution of the first institutional repository at the national level. Institutional repositories have emerged as a tool for disseminating the scientific production of an institution to open the results of the research, both before publication and the preprint process and postprint, increase the visibility of work done by an investigator and his or her institution. The present study, which underwent an analysis of the copyright policies of INESC TEC most relevant scientific publications, allowed not only to realize that publishers are increasingly adopting policies that make it possible to self-archive publications in institutional repositories, all the work of raising awareness, not only for researchers but also for the institution and the whole society. The production of a set of recommendations, which go through the implementation of an institutional policy that encourages the self-archiving of the publications developed in the institutional scope in the repository, serves as a motto for a greater appreciation of the scientific production of INESC TEC
Landscape of coordinated immune responses to H1N1 challenge in humans
Influenza is a significant cause of morbidity and mortality worldwide. Here we show changes in the abundance and activation states of more than 50 immune cell subsets in 35 individuals over 11 time points during human A/California/2009 (H1N1) virus challenge monitored using mass cytometry along with other clinical assessments. Peak change in monocyte, B cell, and T cell subset frequencies coincided with peak virus shedding, followed by marked activation of T and NI< cells. Results led to the identification of C038 as a critical regulator of plasmacytoid dendritic cell function in response to influenza virus. Machine learning using study-derived clinical parameters and single-cell data effectively classified and predicted susceptibility to infection. The coordinated immune cell dynamics defined in this study provide a framework for identifying novel correlates of protection in the evaluation of future influenza therapeutics
Seamless Multimodal Biometrics for Continuous Personalised Wellbeing Monitoring
Artificially intelligent perception is increasingly present in the lives of
every one of us. Vehicles are no exception, (...) In the near future, pattern
recognition will have an even stronger role in vehicles, as self-driving cars
will require automated ways to understand what is happening around (and within)
them and act accordingly. (...) This doctoral work focused on advancing
in-vehicle sensing through the research of novel computer vision and pattern
recognition methodologies for both biometrics and wellbeing monitoring. The
main focus has been on electrocardiogram (ECG) biometrics, a trait well-known
for its potential for seamless driver monitoring. Major efforts were devoted to
achieving improved performance in identification and identity verification in
off-the-person scenarios, well-known for increased noise and variability. Here,
end-to-end deep learning ECG biometric solutions were proposed and important
topics were addressed such as cross-database and long-term performance,
waveform relevance through explainability, and interlead conversion. Face
biometrics, a natural complement to the ECG in seamless unconstrained
scenarios, was also studied in this work. The open challenges of masked face
recognition and interpretability in biometrics were tackled in an effort to
evolve towards algorithms that are more transparent, trustworthy, and robust to
significant occlusions. Within the topic of wellbeing monitoring, improved
solutions to multimodal emotion recognition in groups of people and
activity/violence recognition in in-vehicle scenarios were proposed. At last,
we also proposed a novel way to learn template security within end-to-end
models, dismissing additional separate encryption processes, and a
self-supervised learning approach tailored to sequential data, in order to
ensure data security and optimal performance. (...)Comment: Doctoral thesis presented and approved on the 21st of December 2022
to the University of Port
Correspondence of three-dimensional objects
First many thanks go to Prof. Hans du Buf, for his supervision based
on his experience, for providing a stimulating and cheerful research environment
in his laboratory, for letting me participate in the projects that
produced results for papers, thus made me more aware of the state of the
art in Computer Vision, especially in the area of 3D recognition. Also for
his encouraging support and his way to always nd time for discussions,
and last but not the least for the cooking recipes...
Many thanks go also to my laboratory fellows, to Jo~ao Rodrigues, who
invited me to participate in FCT and QREN projects, Jaime Carvalho
Martins and Miguel Farrajota, for discussing scienti c and technical
problems, but also almost all problems in the world.
To all persons, that worked in, or visited the Vision Laboratory, especially
those with whom I have worked with, almost on a daily basis.
A special thanks to the Instituto Superior de Engenharia at UAlg and
my colleagues at the Department of Electrical Engineering, for allowing
me to suspend lectures in order to be present at conferences.
To my family, my wife and my kids
Novel statistical approaches to text classification, machine translation and computer-assisted translation
Esta tesis presenta diversas contribuciones en los campos de la
clasificación automática de texto, traducción automática y traducción
asistida por ordenador bajo el marco estadÃstico.
En clasificación automática de texto, se propone una nueva aplicación
llamada clasificación de texto bilingüe junto con una serie de modelos
orientados a capturar dicha información bilingüe. Con tal fin se
presentan dos aproximaciones a esta aplicación; la primera de ellas se
basa en una asunción naive que contempla la independencia entre las
dos lenguas involucradas, mientras que la segunda, más sofisticada,
considera la existencia de una correlación entre palabras en
diferentes lenguas. La primera aproximación dió lugar al desarrollo de
cinco modelos basados en modelos de unigrama y modelos de n-gramas
suavizados. Estos modelos fueron evaluados en tres tareas de
complejidad creciente, siendo la más compleja de estas tareas
analizada desde el punto de vista de un sistema de ayuda a la
indexación de documentos. La segunda aproximación se caracteriza por
modelos de traducción capaces de capturar correlación entre palabras
en diferentes lenguas. En nuestro caso, el modelo de traducción
elegido fue el modelo M1 junto con un modelo de unigramas. Este
modelo fue evaluado en dos de las tareas más simples superando la
aproximación naive, que asume la independencia entre palabras en
differentes lenguas procedentes de textos bilingües.
En traducción automática, los modelos estadÃsticos de traducción
basados en palabras M1, M2 y HMM son extendidos bajo el marco de la
modelización mediante mixturas, con el objetivo de definir modelos de
traducción dependientes del contexto. Asimismo se extiende un
algoritmo iterativo de búsqueda basado en programación dinámica,
originalmente diseñado para el modelo M2, para el caso de mixturas de
modelos M2. Este algoritmo de búsqueda nCivera Saiz, J. (2008). Novel statistical approaches to text classification, machine translation and computer-assisted translation [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/2502Palanci