Search CORE

444 research outputs found

Robust non-blind color video watermarking using QR decomposition and entropy analysis

Author: M. Agoyi
P. Rasti
S. Escalera
S. Samiei
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

Issues such as content identification, document and image security, audience measurement, ownership and copyright among others can be settled by the use of digital watermarking. Many recent video watermarking methods show drops in visual quality of the sequences. The present work addresses the aforementioned issue by introducing a robust and imperceptible non-blind color video frame watermarking algorithm. The method divides frames into moving and non-moving parts. The non-moving part of each color channel is processed separately using a block-based watermarking scheme. Blocks with an entropy lower than the average entropy of all blocks are subject to a further process for embedding the watermark image. Finally a watermarked frame is generated by adding moving parts to it. Several signal processing attacks are applied to each watermarked frame in order to perform experiments and are compared with some recent algorithms. Experimental results show that the proposed scheme is imperceptible and robust against common signal processing attacks

Okina

CentralNet: a Multilayer Approach for Multimodal Fusion

Author: A Dhall
D Lahat
M Kang
N Neverova
N Neverova
PK Atrey
S Chandar
S Escalera
Y LeCun
Z Gu
Publication venue
Publication date: 22/08/2018
Field of study

This paper proposes a novel multimodal fusion approach, aiming to produce best possible decisions by integrating information coming from multiple media. While most of the past multimodal approaches either work by projecting the features of different modalities into the same space, or by coordinating the representations of each modality through the use of constraints, our approach borrows from both visions. More specifically, assuming each modality can be processed by a separated deep convolutional network, allowing to take decisions independently from each modality, we introduce a central network linking the modality specific networks. This central network not only provides a common feature embedding but also regularizes the modality specific networks through the use of multi-task learning. The proposed approach is validated on 4 different computer vision tasks on which it consistently improves the accuracy of existing multimodal fusion approaches

arXiv.org e-Print Archive

HAL - Normandie Université

Crossref

Empowerment through journalism: social change through youth media production in northeast Brazil [abstract]

Author: Brooks Brian S.
dos Santos Cristinalva
Escalera Carolina
Publication venue: University of Missouri--Columbia. Office of Undergraduate Research
Publication date: 01/01/2008
Field of study

Abstract only availableJournalism is a process in which people can begin to understand their realities and can be used as a powerful force in democratic societies for or against change. Specifically, youth journalism engages students in identifying themes that elicit social and emotional involvement and a high level of motivation to participate. This thesis intends to explore the question of how journalism can be used as a tool of empowerment in building the capacity of youth to become aware of their own realities and communicate these realities to others through a newspaper. I also explore how the production is linked to social justice by analyzing how it allows the youth of Daruê Malungo, a Center for Arts and Education, in Recife, Brazil to examine visible and invisible systems shaping their interactions and identities. My methodology for this research included teaching a journalism class using Paulo Freire's theory in the Pedagogy of the Oppressed and the development of a newspaper made by the students. I argue that the newspaper by the students at Daruê Malungo allowed them to navigate experiences of difference in terms of race, class, privilege, and oppression. Their production was linked to social justice because it was cry, “”um lamento” as the students decided to name their newspaper, for social action in terms of the racial prejudice that still surrounds them, the violence and drug problems in their community, the lack of education they receive, the pollution and abuse of the environment, and an explanation of how they express themselves through their culture. This journalism production created a space for youth development and empowerment, in which students said they weren't afraid to be silent anymore: they were given the opportunity to tell their community, their country and the world what was important to them and why they wanted change.School for International Trainin

University of Missouri: MOspace

Exploiting feature representations through similarity learning, post-ranking and ranking aggregation for person re-identification

Author: Baró Xavier
Escalera Sergio
Junior Julio C. S. Jacques
Publication venue
Publication date: 28/06/2017
Field of study

Person re-identification has received special attention by the human analysis community in the last few years. To address the challenges in this field, many researchers have proposed different strategies, which basically exploit either cross-view invariant features or cross-view robust metrics. In this work, we propose to exploit a post-ranking approach and combine different feature representations through ranking aggregation. Spatial information, which potentially benefits the person matching, is represented using a 2D body model, from which color and texture information are extracted and combined. We also consider background/foreground information, automatically extracted via Deep Decompositional Network, and the usage of Convolutional Neural Network (CNN) features. To describe the matching between images we use the polynomial feature map, also taking into account local and global information. The Discriminant Context Information Analysis based post-ranking approach is used to improve initial ranking lists. Finally, the Stuart ranking aggregation method is employed to combine complementary ranking lists obtained from different feature representations. Experimental results demonstrated that we improve the state-of-the-art on VIPeR and PRID450s datasets, achieving 67.21% and 75.64% on top-1 rank recognition rate, respectively, as well as obtaining competitive results on CUHK01 dataset.Comment: Preprint submitted to Image and Vision Computin

arXiv.org e-Print Archive

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

VBN

The Oberta in open access

CELL DEATH AND VIABILITY IN MARINE PHYTOPLANKTON

Author: Escalera-Moura L. (Laura)
González-Gil S. (Sonsoles)
Reguera B. (Beatriz)
Publication venue: Sede Central IEO
Publication date: 01/01/2005
Field of study

CITONATAplicación de ensayos in vitro para la detección precoz de ficotoxinas en muestras de poblaciones fitoplanctónicas multiespecífica

Digital.CSIC

Repositorio Institucional Digital del IEO

Duplications in nomenclature

Author: JF Bazan
M Hortsch
Michael Hortsch
S De La Escalera
Y Pan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/10/1997
Field of study

Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/62822/1/389539a0.pd

Crossref

Deep Blue Documents at the University of Michigan

Impairments in Decoding Facial and Vocal Emotional Expressions in High Functioning Autistic Adults and Adolescents.

Author: Cirillo I
Escalera S
Esposito A
Esposito AM
Foresti GL
Fortunati L
Publication venue
Publication date: 01/01/2020
Field of study

Archivio Istituzionale della Ricerca - Università degli Studi della Campania "Luigi Vanvitelli"

Overcoming Calibration Problems in Pattern Labeling with Pairwise Ratings: Application to Personality Traits

Author: Chen B
Escalera S
Guyon I
Ponce-Lopez V
Shah N
Simon MO
Publication venue: 14th European Conference on Computer Vision (ECCV)
Publication date: 24/11/2016
Field of study

We address the problem of calibration of workers whose task is to label patterns with continuous variables, which arises for instance in labeling images of videos of humans with continuous traits. Worker bias is particularly difficult to evaluate and correct when many workers contribute just a few labels, a situation arising typically when labeling is crowd-sourced. In the scenario of labeling short videos of people facing a camera with personality traits, we evaluate the feasibility of the pairwise ranking method to alleviate bias problems. Workers are exposed to pairs of videos at a time and must order by preference. The variable levels are reconstructed by fitting a Bradley-Terry-Luce model with maximum likelihood. This method may at first sight, seem prohibitively expensive because for N videos, p=N(N−1)/2 pairs must be potentially processed by workers rather that N videos. However, by performing extensive simulations, we determine an empirical law for the scaling of the number of pairs needed as a function of the number of videos in order to achieve a given accuracy of score reconstruction and show that the pairwise method is affordable. We apply the method to the labeling of a large scale dataset of 10,000 videos used in the ChaLearn Apparent Personality Trait challenge

UCL Discovery

Seniors’ ability to decode differently aged facial emotional expressions

Author: Amorese T
Cordasco G
Escalera S
Esposito A
Maldonato NM
Torres MI
Vinciarelli A
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

The present investigation aims at assessing elders' ability to decode facial emotional expressions conveyed by differently aged people in order to confirm (or disconfirm) the appropriateness of the 'own age bias' theory, as well as investigate effects of different ages and different emotional categories. The study, involves 44 healthy elders (23 females), aged 65+ (mean age=75.09; SD=±7.9) which were requested to label 76 pictures depicting elders, middle-aged and young women and men displaying the six facial emotional expressions of disgust, anger, fear, sadness, happiness and neutrality. Results show a complex pattern of influences that calls for more deep investigations on the features to be accounted by providing socially and emotionally believable interfaces of effective and efficient algorithms to detect and decode their users' emotional facial expressions

Archivio della ricerca - Università degli studi di Napoli Federico II

Archivio Istituzionale della Ricerca - Università degli Studi della Campania "Luigi Vanvitelli"

Video Transformers: A Survey

Author: Clapés Albert
Escalera Sergio
Johansen Anders S.
Moeslund Thomas B.
Nasrollahi Kamal
Selva Javier
Publication venue
Publication date: 09/12/2022
Field of study

Transformer models have shown great success handling long-range interactions, making them a promising tool for modeling video. However they lack inductive biases and scale quadratically with input length. These limitations are further exacerbated when dealing with the high dimensionality introduced with the temporal dimension. While there are surveys analyzing the advances of Transformers for vision, none focus on an in-depth analysis of video-specific designs. In this survey we analyze main contributions and trends of works leveraging Transformers to model video. Specifically, we delve into how videos are handled as input-level first. Then, we study the architectural changes made to deal with video more efficiently, reduce redundancy, re-introduce useful inductive biases, and capture long-term temporal dynamics. In addition we provide an overview of different training regimes and explore effective self-supervised learning strategies for video. Finally, we conduct a performance comparison on the most common benchmark for Video Transformers (i.e., action classification), finding them to outperform 3D ConvNets even with less computational complexity

arXiv.org e-Print Archive

VBN