4 research outputs found

    Are iterations and curvature useful for tensor voting?

    No full text
    Tensor voting is an efficient algorithm for perceptual grouping and feature extraction, particularly for contour extraction. In this paper two studies on tensor voting are presented. First the use of iterations is investigated, and second, a new method for integrating curvature information is evaluated. In opposition to other grouping methods, tensor voting claims the advantage to be non-iterative. Although noniterative tensor voting methods provide good results in many cases, the algorithm can be iterated to deal with more complex data configurations. The experiments conducted demonstrate that iterations substantially improve the process of feature extraction and help to overcome limitations of the original algorithm. As a further contribution we propose a curvature improvement for tensor voting. On the contrary to the curvature - augmented tensor voting proposed by Tang and Medioni, our method takes advantage of the curvature calculation already performed by the classical tensor voting and evaluates the full curvature, sign and amplitude. Some new curvature - modified voting fields are also proposed. Results show a lower degree of artifacts, smoother curves, a high tolerance to scale parameter changes and also more noise - robustness. 漏 2004 Springer-VerlagPeer Reviewe

    New contributions in overcomplete image representations inspired from the functional architecture of the primary visual cortex = Nuevas contribuciones en representaciones sobrecompletas de im谩genes inspiradas por la arquitectura funcional de la corteza visual primaria

    Get PDF
    The present thesis aims at investigating parallelisms between the functional architecture of primary visual areas and image processing methods. A first objective is to refine existing models of biological vision on the base of information theory statements and a second is to develop original solutions for image processing inspired from natural vision. The available data on visual systems contains physiological and psychophysical studies, Gestalt psychology and statistics on natural images The thesis is mostly centered in overcomplete representations (i.e. representations increasing the dimensionality of the data) for multiple reasons. First because they allow to overcome existing drawbacks of critically sampled transforms, second because biological vision models appear overcomplete and third because building efficient overcomplete representations raises challenging and actual mathematical problems, in particular the problem of sparse approximation. The thesis proposes first a self-invertible log-Gabor wavelet transformation inspired from the receptive field and multiresolution arrangement of the simple cells in the primary visual cortex (V1). This transform shows promising abilities for noise elimination. Second, interactions observed between V1 cells consisting in lateral inhibition and in facilitation between aligned cells are shown efficient for extracting edges of natural images. As a third point, the redundancy introduced by the overcompleteness is reduced by a dedicated sparse approximation algorithm which builds a sparse representation of the images based on their edge content. For an additional decorrelation of the image information and for improving the image compression performances, edges arranged along continuous contours are coded in a predictive manner through chains of coefficients. This offers then an efficient representation of contours. Fourth, a study on contour completion using the tensor voting framework based on Gestalt psychology is presented. There, the use of iterations and of the curvature information allow to improve the robustness and the perceptual quality of the existing method. La presente tesis doctoral tiene como objetivo indagar en algunos paralelismos entre la arquitectura funcional de las 谩reas visuales primarias y el tratamiento de im谩genes. Un primer objetivo consiste en mejorar los modelos existentes de visi贸n biol贸gica bas谩ndose en la teor铆a de la informaci贸n. Un segundo es el desarrollo de nuevos algoritmos de tratamiento de im谩genes inspirados de la visi贸n natural. Los datos disponibles sobre el sistema visual abarcan estudios fisiol贸gicos y psicof铆sicos, psicolog铆a Gestalt y estad铆sticas de las im谩genes naturales. La tesis se centra principalmente en las representaciones sobrecompletas (i.e. representaciones que incrementan la dimensionalidad de los datos) por las siguientes razones. Primero porque permiten sobrepasar importantes desventajas de las transformaciones ortogonales; segundo porque los modelos de visi贸n biol贸gica necesitan a menudo ser sobrecompletos y tercero porque construir representaciones sobrecompletas eficientes involucra problemas matem谩ticos relevantes y novedosos, en particular el problema de las aproximaciones dispersas. La tesis propone primero una transformaci贸n en ond铆culas log-Gabor auto-inversible inspirada del campo receptivo y la organizaci贸n en multiresoluci贸n de las c茅lulas simples del cortex visual primario (V1). Esta transformaci贸n ofrece resultados prometedores para la eliminaci贸n del ruido. En segundo lugar, las interacciones observadas entre las c茅lulas de V1 que consisten en la inhibici贸n lateral y en la facilitaci贸n entre c茅lulas alineadas se han mostrado eficientes para extraer los bordes de las im谩genes naturales. En tercer lugar, la redundancia introducida por la transformaci贸n sobrecompleta se reduce gracias a un algoritmo dedicado de aproximaci贸n dispersa el cual construye una representaci贸n dispersa de las im谩genes sobre la base de sus bordes. Para una decorrelaci贸n adicional y para conseguir m谩s altas tasas de compresi贸n, los bordes alineados a lo largo de contornos continuos est谩n codificado de manera predictiva por cadenas de coeficientes, lo que ofrece una representacion eficiente de los contornos. Finalmente se presenta un estudio sobre el cierre de contornos utilizando la metodolog铆a de tensor voting. Proponemos el uso de iteraciones y de la informaci贸n de curvatura para mejorar la robustez y la calidad perceptual de los m茅todos existentes
    corecore