Search CORE

44 research outputs found

Improving Spatial Codification in Semantic Segmentation

Author: Giró-i-Nieto Xavier
Marqués Ferran
McGuinness Kevin
O'Connor Noel E.
Ventura Carles
Vilaplana Verónica
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/05/2015
Field of study

This paper explores novel approaches for improving the spatial codification for the pooling of local descriptors to solve the semantic segmentation problem. We propose to partition the image into three regions for each object to be described: Figure, Border and Ground. This partition aims at minimizing the influence of the image context on the object description and vice versa by introducing an intermediate zone around the object contour. Furthermore, we also propose a richer visual descriptor of the object by applying a Spatial Pyramid over the Figure region. Two novel Spatial Pyramid configurations are explored: Cartesian-based and crown-based Spatial Pyramids. We test these approaches with state-of-the-art techniques and show that they improve the Figure-Ground based pooling in the Pascal VOC 2011 and 2012 semantic segmentation challenges.Comment: Paper accepted at the IEEE International Conference on Image Processing, ICIP 2015. Quebec City, 27-30 September. Project page: https://imatge.upc.edu/web/publications/improving-spatial-codification-semantic-segmentatio

arXiv.org e-Print Archive

Crossref

Irish Universities

DCU Online Research Access Service

Simple vs complex temporal recurrences for video saliency prediction

Author: Giró-i-Nieto Xavier
Linardos Panagiotis
McGuinness Kevin
Mohedano Eva
Nieto Juan Jose
O'Connor Noel E.
Publication venue: 'British Machine Vision Association and Society for Pattern Recognition'
Publication date: 01/01/2019
Field of study

This paper investigates modifying an existing neural network architecture for static saliency prediction using two types of recurrences that integrate information from the temporal domain. The first modification is the addition of a ConvLSTM within the architecture, while the second is a conceptually simple exponential moving average of an internal convolutional state. We use weights pre-trained on the SALICON dataset and fine-tune our model on DHF1K. Our results show that both modifications achieve state-of-the-art results and produce similar saliency maps. Source code is available at https://git.io/fjPiB

arXiv.org e-Print Archive

UPCommons. Portal del coneixement obert de la UPC

Irish Universities

DCU Online Research Access Service

Assessing knee OA severity with CNN attention-based end-to-end architectures

Author: Antony Joseph
Giró-i-Nieto Xavier
Gorriz Marc
McGuinness Kevin
O'Connor Noel E.
Publication venue: JMLR
Publication date: 01/01/2019
Field of study

This work proposes a novel end-to-end convolutional neural network (CNN) architecture to automatically quantify the severity of knee osteoarthritis (OA) using X-Ray images, which incorporates trainable attention modules acting as unsupervised fine-grained detectors of the region of interest (ROI). The proposed attention modules can be applied at different levels and scales across any CNN pipeline helping the network to learn relevant attention patterns over the most informative parts of the image at different resolutions. We test the proposed attention mechanism on existing state-of-the-art CNN architectures as our base models, achieving promising results on the benchmark knee OA datasets from the osteoarthritis initiative (OAI) and multicenter osteoarthritis study (MOST). All code from our experiments will be publicly available on the github repository: https://github.com/marc-gorriz/KneeOA-CNNAttentio

arXiv.org e-Print Archive

UPCommons. Portal del coneixement obert de la UPC

Irish Universities

DCU Online Research Access Service

Shallow and deep convolutional networks for saliency prediction

Author: Giró-i-Nieto Xavier
McGuinness Kevin
O'Connor Noel E.
Pan Junting
Sayrol Elisa
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

The prediction of salient areas in images has been traditionally addressed with hand-crafted features based on neuroscience principles. This paper, however, addresses the problem with a completely data-driven approach by training a convolutional neural network (convnet). The learning process is formulated as a minimization of a loss function that measures the Euclidean distance of the predicted saliency map with the provided ground truth. The recent publication of large datasets of saliency prediction has provided enough data to train end-to-end architectures that are both fast and accurate. Two designs are proposed: a shallow convnet trained from scratch, and a another deeper solution whose first three layers are adapted from another network trained for classification. To the authors knowledge, these are the first end-to-end CNNs trained and tested for the purpose of saliency prediction

arXiv.org e-Print Archive

Crossref

UPCommons. Portal del coneixement obert de la UPC

Irish Universities

DCU Online Research Access Service

Bags of local convolutional features for scalable instance search

Author: Giró-i-Nieto Xavier
Marqués Ferran
McGuinness Kevin
Mohedano Eva
O'Connor Noel E.
Salvador Amaia
Publication venue
Publication date: 01/01/2016
Field of study

This work proposes a simple instance retrieval pipeline based on encoding the convolutional features of CNN using the bag of words aggregation scheme (BoW). Assigning each local array of activations in a convolutional layer to a visual word produces an assignment map, a compact representation that relates regions of an image with a visual word. We use the assignment map for fast spatial reranking, obtain- ing object localizations that are used for query expansion. We demonstrate the suitability of the BoW representation based on local CNN features for instance retrieval, achieving competitive performance on the Oxford and Paris buildings benchmarks. We show that our proposed system for CNN feature aggregation with BoW outperforms state-of-the-art techniques using sum pooling at a subset of the challenging TRECVid INS benchmark

arXiv.org e-Print Archive

Crossref

UPCommons. Portal del coneixement obert de la UPC

Irish Universities

DCU Online Research Access Service

EEG-based saliency maps

Author: Giró-i-Nieto Xavier
Healy Graham
McGuinness Kevin
Mohedano Eva
O'Connor Noel E.
Smeaton Alan F.
Publication venue
Publication date: 11/06/2015
Field of study

Irish Universities

DCU Online Research Access Service

Desarrollo e implementación de un sistema de detección de fallas de materiales basado en el método de golpeteo

Author: Filoni Pablo T.
Giró Juan Francisco
Stuardi José E.
Publication venue
Publication date: 10/09/2021
Field of study

Se desarrolla analíticamente e implementa en forma práctica un sistema de predicción de fallas de materiales compuestos. El procedimiento se enmarca en la categoría END (Ensayos No Destructivos) y debido a su portabilidad, es aplicable a materiales que forman parte de estructuras ya construidas. A partir del análisis de las señales obtenidas experimentalmente mediante el golpeteo con un martillo modal -construido ex profeso-, se determinan los parámetros característicos que permiten la evaluación del estado del material compuesto, su tipificación y una eventual cuantificación del daño. Un software específicamente diseñado administra y procesa las señales obtenidas en los ensayos, y permite mediante su interfaz gráfica la rápida interpretación de los resultados. El sistema fue probado en muestras de materiales con fallas típicas, demostrando su efectividad. El método constituye una herramienta sólida para la detección temprana de fallas y para la toma de medidas correctivas sobre estructuras en servicio.Sociedad Argentina de Informática e Investigación Operativ

Servicio de Difusión de la Creación Intelectual

SalGAN: visual saliency prediction with generative adversarial networks

Author: Canton Ferrer Cristian
Giró-i-Nieto Xavier
McGuinness Kevin
O'Connor Noel E.
Pan Junting
Sayrol Elisa
Torres Jordi
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 26/07/2017
Field of study

We introduce SalGAN, a deep convolutional neural network for visual saliency prediction trained with adversarial examples. The first stage of the network consists of a generator model whose weights are learned by back-propagation computed from a binary cross entropy (BCE) loss over downsampled versions of the saliency maps. The resulting prediction is processed by a discriminator network trained to solve a binary classification task between the saliency maps generated by the generative stage and the ground truth ones. Our experiments show how adversarial training allows reaching state-of-the-art performance across different metrics when combined with a widely-used loss function like BCE. Our results can be reproduced with the source code and trained models available at https://imatge-upc.github. io/saliency-salgan-2017/

arXiv.org e-Print Archive

Irish Universities

DCU Online Research Access Service

Exploring EEG for object detection and retrieval

Author: Giró-i-Nieto Xavier
Healy Graham
McGuinness Kevin
Mohedano Eva
O'Connor Noel E.
Porta Caubet Sergi
Salvador Amaia
Smeaton Alan F.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 26/06/2015
Field of study

This paper explores the potential for using Brain Computer Interfaces (BCI) as a relevance feedback mechanism in contentbased image retrieval. Several experiments are performed using a rapid serial visual presentation (RSVP) of images at different rates (5Hz and 10Hz) on 8 users with different degrees of familiarization with BCI and the dataset. We compare the feedback from the BCI and mouse-based interfaces in a subset of TRECVid images, finding that, when users have limited time to annotate the images, both interfaces are comparable in performance. Comparing our best users in a retrieval task, we found that EEG-based relevance feedback can outperform mouse-based feedback

DCU Online Research Access Service