17 research outputs found
Segmentation of Optic Disc and Optic Cup in Retinal Fundus Images Using Coupled Shape Regression
Accurate segmentation of optic cup and disc in retinal fundus images is required to derive the cup-to-disc ratio (CDR) parameter which is the main indicator for Glaucoma assessment. In this paper, we propose a coupled regression method for accurate segmentation of optic cup and disc in retinal colour fundus image. The proposed coupled regression framework consists of a parameter regressor which directly predicts CDR from a given image, as well as an ensemble shape regressor which iteratively estimates the OD-OC boundary by taking into account the CDR estimated by the parameter regressor. The parameter regressor and the shape regressor are then coupled together within a feedback loop so that estimation of one reinforces the other. Both parameter regressor and the ensemble shape regressor are modeled using Boosted Regression Trees. The proposed optic cup and disc segmentation method is applied on an image set of 50 patients and demonstrates high segmentation accuracy. A comparative study shows that our proposed method outperforms state of the art methods for cup segmentation
Analysis of MVD and color edge detection for depth maps enhacement
Prjecte final de carrera realitzat en col.laboració amb Fraunhofer Heinrich Hertz InstituteMVD (Multiview Video plus Depth) data consists of two components: color
video and depth maps sequences. Depth maps represent the spatial arrangement
(or three dimensional geometry) of the scene. The MVD representation
is used for rendering virtual views in FVV (Free Viewpoint Video) and for
3DTV (3-dimensional TeleVision) applications. Distortions of the silhouettes
of objects in the depth maps are a problem when rendering a stereo video
pair. This Master thesis presents a system to improve the depth component
of MVD . For this purpose, it introduces a new method called correlation
histograms for analyzing the two components of depth-enhanced 3D video
representations with special emphasis on the improved depth component.
This document gives a description of this new method and presents an analysis
of six di erent MVD data sets with di erent features. Moreover, a modular
and exible system for improving depth maps is introduced. The idea
behind is to use the color video component for extracting edges of the scene
and to re-shape the depth component according to the edge information.
The mentioned system basically describes a framework. Hence, it is capable
to admit changes on speci c tasks if the concrete target is respected. After
the improvement process, the MVD data is analyzed again via correlation
histograms in order to obtain characteristics of the depth improvement.
The achieved results show that correlation histograms are a good method
for analyzing the impact of processing MVD data. It is also con rmed that
the presented system is modular and exible, as it works with three di erent
degrees of change, introducing modi cations in depth maps, according
to the input characteristics. Hence, this system can be used as a framework
for depth map improvement. The results show that contours with 1-pixel
width jittering in depth maps have been correctly re-shaped. Additionally,
constant background and foreground areas of depth maps have also been improved
according to the degree of change, attaining better results in terms of
temporal consistency. However, future work can focus on unresolved problems,
such as jittering with more than one pixel width or by making the
system more dynamic
Analysis of MVD and color edge detection for depth maps enhacement
Prjecte final de carrera realitzat en col.laboració amb Fraunhofer Heinrich Hertz InstituteMVD (Multiview Video plus Depth) data consists of two components: color
video and depth maps sequences. Depth maps represent the spatial arrangement
(or three dimensional geometry) of the scene. The MVD representation
is used for rendering virtual views in FVV (Free Viewpoint Video) and for
3DTV (3-dimensional TeleVision) applications. Distortions of the silhouettes
of objects in the depth maps are a problem when rendering a stereo video
pair. This Master thesis presents a system to improve the depth component
of MVD . For this purpose, it introduces a new method called correlation
histograms for analyzing the two components of depth-enhanced 3D video
representations with special emphasis on the improved depth component.
This document gives a description of this new method and presents an analysis
of six di erent MVD data sets with di erent features. Moreover, a modular
and exible system for improving depth maps is introduced. The idea
behind is to use the color video component for extracting edges of the scene
and to re-shape the depth component according to the edge information.
The mentioned system basically describes a framework. Hence, it is capable
to admit changes on speci c tasks if the concrete target is respected. After
the improvement process, the MVD data is analyzed again via correlation
histograms in order to obtain characteristics of the depth improvement.
The achieved results show that correlation histograms are a good method
for analyzing the impact of processing MVD data. It is also con rmed that
the presented system is modular and exible, as it works with three di erent
degrees of change, introducing modi cations in depth maps, according
to the input characteristics. Hence, this system can be used as a framework
for depth map improvement. The results show that contours with 1-pixel
width jittering in depth maps have been correctly re-shaped. Additionally,
constant background and foreground areas of depth maps have also been improved
according to the degree of change, attaining better results in terms of
temporal consistency. However, future work can focus on unresolved problems,
such as jittering with more than one pixel width or by making the
system more dynamic
Multiple-camera capture system implementation
The project consists in studying and analyzing different techniques for the acquisition of 3D scenes using a set of different cameras observing the scene from multiple views. Algorithms for camera calibration will be also considered and implemented. Moreover, algorithms for estimating the depth of the objects in the scene, using the information provided by two, three or more cameras; will also be develope
Gaze-Based Human-Robot Interaction by the Brunswick Model
We present a new paradigm for human-robot interaction based on social signal processing, and in particular on the Brunswick model. Originally, the Brunswick model copes with face-to-face dyadic interaction, assuming that the interactants are communicating through a continuous exchange of non verbal social signals, in addition to the spoken messages. Social signals have to be interpreted, thanks to a proper recognition phase that considers visual and audio information. The Brunswick model allows to quantitatively evaluate the quality of the interaction using statistical tools which measure how effective is the recognition phase. In this paper we cast this theory when one of the interactants is a robot; in this case, the recognition phase performed by the robot and the human have to be revised w.r.t. the original model. The model is applied to Berrick, a recent open-source low-cost robotic head platform, where the gazing is the social signal to be considered
Towards Learning Representations in Visual Computing Tasks
abstract: The performance of most of the visual computing tasks depends on the quality of the features extracted from the raw data. Insightful feature representation increases the performance of many learning algorithms by exposing the underlying explanatory factors of the output for the unobserved input. A good representation should also handle anomalies in the data such as missing samples and noisy input caused by the undesired, external factors of variation. It should also reduce the data redundancy. Over the years, many feature extraction processes have been invented to produce good representations of raw images and videos.
The feature extraction processes can be categorized into three groups. The first group contains processes that are hand-crafted for a specific task. Hand-engineering features requires the knowledge of domain experts and manual labor. However, the feature extraction process is interpretable and explainable. Next group contains the latent-feature extraction processes. While the original feature lies in a high-dimensional space, the relevant factors for a task often lie on a lower dimensional manifold. The latent-feature extraction employs hidden variables to expose the underlying data properties that cannot be directly measured from the input. Latent features seek a specific structure such as sparsity or low-rank into the derived representation through sophisticated optimization techniques. The last category is that of deep features. These are obtained by passing raw input data with minimal pre-processing through a deep network. Its parameters are computed by iteratively minimizing a task-based loss.
In this dissertation, I present four pieces of work where I create and learn suitable data representations. The first task employs hand-crafted features to perform clinically-relevant retrieval of diabetic retinopathy images. The second task uses latent features to perform content-adaptive image enhancement. The third task ranks a pair of images based on their aestheticism. The goal of the last task is to capture localized image artifacts in small datasets with patch-level labels. For both these tasks, I propose novel deep architectures and show significant improvement over the previous state-of-art approaches. A suitable combination of feature representations augmented with an appropriate learning approach can increase performance for most visual computing tasks.Dissertation/ThesisDoctoral Dissertation Computer Science 201
Correspondence of three-dimensional objects
First many thanks go to Prof. Hans du Buf, for his supervision based
on his experience, for providing a stimulating and cheerful research environment
in his laboratory, for letting me participate in the projects that
produced results for papers, thus made me more aware of the state of the
art in Computer Vision, especially in the area of 3D recognition. Also for
his encouraging support and his way to always nd time for discussions,
and last but not the least for the cooking recipes...
Many thanks go also to my laboratory fellows, to Jo~ao Rodrigues, who
invited me to participate in FCT and QREN projects, Jaime Carvalho
Martins and Miguel Farrajota, for discussing scienti c and technical
problems, but also almost all problems in the world.
To all persons, that worked in, or visited the Vision Laboratory, especially
those with whom I have worked with, almost on a daily basis.
A special thanks to the Instituto Superior de Engenharia at UAlg and
my colleagues at the Department of Electrical Engineering, for allowing
me to suspend lectures in order to be present at conferences.
To my family, my wife and my kids
Recommended from our members
Automatic Multilevel Feature Abstraction in Adaptable Machine Vision Systems
Vision is a complex task which can be accomplished with apparent ease by biological systems, but for which the design of artificial systems is difficult. Although machine vision systems can be successfully designed for a specific task, under certain conditions, they are likely to fail if circumstances change. This was the motivation for the research into ways in which systems can be self-designing and adaptable to new visual tasks. The research was conducted in three vital areas of concern for machine vision systems.
The first area is finding a suitable architecture for forming an appropriate representation for the current task. The research investigated the application of Hypernetworks theory to building a multilevel, generally-applicable representation, through repeated application of a fundamental 'self-similarity' principle, that parts of objects assembled under a particular relation at one level, form whole objects at the next. Results show that this is potentially a powerful approach for autonomously generating an adaptable system-architecture suitable for multiple visual tasks.
The second area is the autonomous extraction of suitable low-level features, which the research investigated through random generation of minimally-constrained pixel-configurations and algorithmic generation of homogeneous and heterogeneous polygons. The results suggest that, despite the simplicity of the features making them vulnerable to image transformations, these are promising approaches worth developing further.
The third area is automatic feature selection. The research explored management of 'dimensionality' and of 'combinatorial explosion', as well as how to locate relevant features at multiple representation levels, in the context of 'emergence' of structure. Results indicate that this approach can find useful 'intermediate-level' constructs through analysis of the connectivity of the simplices representing objects at higher levels.
The research concludes that the proposed novel approaches to tackling the above issues, in particular the application of hypernetworks to the formation of multilevel representations and the resulting emergence of higher-level structure, is fruitful
Patient-Specific Implants in Musculoskeletal (Orthopedic) Surgery
Most of the treatments in medicine are patient specific, aren’t they? So why should we bother with individualizing implants if we adapt our therapy to patients anyway? Looking at the neighboring field of oncologic treatment, you would not question the fact that individualization of tumor therapy with personalized antibodies has led to the thriving of this field in terms of success in patient survival and positive responses to alternatives for conventional treatments. Regarding the latest cutting-edge developments in orthopedic surgery and biotechnology, including new imaging techniques and 3D-printing of bone substitutes as well as implants, we do have an armamentarium available to stimulate the race for innovation in medicine. This Special Issue of Journal of Personalized Medicine will gather all relevant new and developed techniques already in clinical practice. Examples include the developments in revision arthroplasty and tumor (pelvic replacement) surgery to recreate individual defects, individualized implants for primary arthroplasty to establish physiological joint kinematics, and personalized implants in fracture treatment, to name but a few