10 research outputs found

    Generalizing Common Tasks in Automated Skin Lesion Diagnosis

    Full text link

    Computer Vision for Timber Harvesting

    Get PDF

    Realistic Virtual Cuts

    Get PDF

    Quantifying Texture Scale in Accordance With Human Perception

    Get PDF
    Visual texture has multiple perceptual attributes (e.g. regularity, isotropy, etc.), including scale. The scale of visual texture has been defined as the size of the repeating unit (or texel) of which the texture is composed. Not all textures are formed through the placement of a clearly discernible repeating unit (e.g. irregular and stochastic textures). There is currently no rigorous definition for texture scale that is applicable to textures of a wide range of regularities. We hypothesised that texture scale ought to extend to these less regular textures. Non-overlapping sample windows (or patches) taken from a texture appear increasingly similar as the size of the window gets larger. This is true irrespective of whether the texture is formed by the placement of a discernible repeating unit or not. We propose the following new characterisation for texture scale: “the smallest window size beyond within which texture appears consistently”. We perform two psychophysical studies and report data that demonstrates consensus across subjects and across methods of probing in the assessment of texture scale. We then present an empirical algorithm for the estimation of scale based on this characterisation. We demonstrate agreement between the algorithm and (subjective) human assessment with an RMS accuracy of 1.2 just-noticeable-differences, a significant improvement over previous published algorithms. We provide two ground-truth perceptual datasets, one for each of our psychophysical studies, for the texture scale of the entire Brodatz album, together with confidence levels for each of our estimates. Finally, we make available an online tool which researchers can use to obtain texture scale estimates by uploading images of textures

    Modelling appearance and geometry from images

    Get PDF
    Acquisition of realistic and relightable 3D models of large outdoor structures, such as buildings, requires the modelling of detailed geometry and visual appearance. Recovering these material characteristics can be very time consuming and needs specially dedicated equipment. Alternatively, surface detail can be conveyed by textures recovered from images, whose appearance is only valid under the originally photographed viewing and lighting conditions. Methods to easily capture locally detailed geometry, such as cracks in stone walls, and visual appearance require control of lighting conditions, which are usually restricted to small portions of surfaces captured at close range.This thesis investigates the acquisition of high-quality models from images, using simple photographic equipment and modest user intervention. The main focus of this investigation is on approximating detailed local depth information and visual appearance, obtained using a new image-based approach, and combining this with gross-scale 3D geometry. This is achieved by capturing these surface characteristics in small accessible regions and transferring them to the complete façade. This approach yields high-quality models, imparting the illusion of measured reflectance. In this thesis, we first present two novel algorithms for surface detail and visual appearance transfer, where these material properties are captured for small exemplars, using an image-based technique. Second, we develop an interactive solution to solve the problems of performing the transfer over both a large change in scale and to the different materials contained in a complete façade. Aiming to completely automate this process, a novel algorithm to differentiate between materials in the façade and associate them with the correct exemplars is introduced with promising results. Third, we present a new method for texture reconstruction from multiple images that optimises texture quality, by choosing the best view for every point and minimising seams. Material properties are transferred from the exemplars to the texture map, approximating reflectance and meso-structure. The combination of these techniques results in a complete working system capable of producing realistic relightable models of full building façades, containing high-resolution geometry and plausible visual appearance.EThOS - Electronic Theses Online ServiceGBUnited Kingdo

    Probabilistic Methods for Model Validation

    Get PDF
    This dissertation develops a probabilistic method for validation and verification (V&V) of uncertain nonlinear systems. Existing systems-control literature on model and controller V&V either deal with linear systems with norm-bounded uncertainties,or consider nonlinear systems in set-based and moment based framework. These existing methods deal with model invalidation or falsification, rather than assessing the quality of a model with respect to measured data. In this dissertation, an axiomatic framework for model validation is proposed in probabilistically relaxed sense, that instead of simply invalidating a model, seeks to quantify the "degree of validation". To develop this framework, novel algorithms for uncertainty propagation have been proposed for both deterministic and stochastic nonlinear systems in continuous time. For the deterministic flow, we compute the time-varying joint probability density functions over the state space, by solving the Liouville equation via method-of-characteristics. For the stochastic flow, we propose an approximation algorithm that combines the method-of-characteristics solution of Liouville equation with the Karhunen-Lo eve expansion of process noise, thus enabling an indirect solution of Fokker-Planck equation, governing the evolution of joint probability density functions. The efficacy of these algorithms are demonstrated for risk assessment in Mars entry-descent-landing, and for nonlinear estimation. Next, the V&V problem is formulated in terms of Monge-Kantorovich optimal transport, naturally giving rise to a metric, called Wasserstein metric, on the space of probability densities. It is shown that the resulting computation leads to solving a linear program at each time of measurement availability, and computational complexity results for the same are derived. Probabilistic guarantees in average and worst case sense, are given for the validation oracle resulting from the proposed method. The framework is demonstrated for nonlinear robustness veri cation of F-16 flight controllers, subject to probabilistic uncertainties. Frequency domain interpretations for the proposed framework are derived for linear systems, and its connections with existing nonlinear model validation methods are pointed out. In particular, we show that the asymptotic Wasserstein gap between two single-output linear time invariant systems excited by Gaussian white noise, is the difference between their average gains, up to a scaling by the strength of the input noise. A geometric interpretation of this result allows us to propose an intrinsic normalization of the Wasserstein gap, which in turn allows us to compare it with classical systems-theoretic metrics like v-gap. Next, it is shown that the optimal transport map can be used to automatically refine the model. This model refinement formulation leads to solving a non-smooth convex optimization problem. Examples are given to demonstrate how proximal operator splitting based computation enables numerically solving the same. This method is applied for nite-time feedback control of probability density functions, and for data driven modeling of dynamical systems

    The scale of a texture and its application to segmentation

    No full text

    Visual Words Dictionaries And Fusion Techniques For Searching People Through Textual And Visual Attributes

    No full text
    Using personal traits for searching people is paramount in several application areas and has attracted an ever-growing attention from the scientific community over the past years. Some practical applications in the realm of digital forensics and surveillance include locating a suspect or finding missing people in a public space. In this paper, we aim at assigning describable visual attributes (e.g., white chubby male wearing glasses and with bangs) as labels to images to describe their appearance and performing visual searches without relying on image annotations during testing. For that, we create mid-level image representations for face images based on visual dictionaries linking visual properties in the images to describable attributes. In addition, we take advantage of machine learning techniques for combining different attributes and performing a query. First, we propose three methods for building the visual dictionaries. Method #1 uses a sparse-sampling scheme to obtain low-level features with a clustering algorithm to build the visual dictionaries. Method #2 uses dense-sampling to obtain low-level features and random selection to build the visual dictionaries while Method #3 uses dense-sampling to obtain low-level features followed by a clustering algorithm to build the visual dictionaries. Thereafter, we train 2-class classifiers for the describable visual attributes of interest which assign to each image a decision score used to obtain its ranking. For more complex queries (2+ attributes), we use three state-of-the-art approaches for combining the rankings: (1) product of probabilities, (2) rank aggregation and (3) rank position. To date, we have considered fifteen attribute classifiers and, consequently, their direct counterparts theoretically allowing 2 15=32,768 different combined queries (the actual number is smaller since some attributes are contradictory or mutually exclusive). Notwithstanding, the method is easily extensible to include new attributes. Experimental results show that Method #3 greatly improves retrieval precision for some attributes in comparison with other methods in the literature. Finally, for combined attributes, product of probabilities, rank aggregation and rank position yield complementary results for rank fusion and the final decision making suggesting interesting possible combinations for further work. © 2013 Elsevier B.V. All rights reserved.39174842010/05647-4; Microsoft ResearchBay, H., Tuytelaars, T., Gool, L.V., Surf: Speeded up robust features (2006) European Conference on Computer Vision (ECCV), pp. 1-14Boureau, Y., Bach, F., Lecun, Y., Ponce, J., Learning mid-level features for recognition (2010) IEEE Intl. Conference on Computer Vision And, Pattern Recognition, pp. 2559-2566Carkacloglu, A., Yarman-Vural, F., SASI: A generic texture descriptor for image retrieval (2003) Pattern Recognition, 36 (11), pp. 2615-2633. , DOI 10.1016/S0031-3203(03)00171-7Cottrell, G.W., Metcalfe, J., Empath: Face, emotion, and gender recognition using holons (1990) Neural Information Processing Systems (NIPS), pp. 564-571Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C., Visual categorization with bags of keypoints (2004) European Conference on Computer Vision (ECCV), pp. 1-14Datta, A., Feris, R., Vaquero, D., Hierarchical ranking of facial attributes (2011) IEEE International Conference on Face and Gesture (F&G), pp. 36-42Do Valle Jr., E.A., Local-descriptor matching for image identification systems (2008) Ph.D. Thesis, , Université de Cergy-Pontoise École Doctorale Sciences et Ingénierie, Cergy-Pontoise, France (June)Fabian, J., Pires, R., Rocha, A., Searching for people through textual and visual attributes (2012) 25th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), 2012, pp. 276-282Fei-Fei, L., Perona, P., A bayesian hierarchical model for learning natural scene categories (2005) IEEE Intl. Conference on Computer Vision and Pattern Recognition (CVPR), pp. 524-531Ferrari, V., Zisserman, A., Learning visual attributes (2007) Neural Information Processing Systems (NIPS), pp. 1-8Golomb, B., Lawrence, D., Sejnowski, T., Sexnet: A neural network identifies sex from human faces (1990) Neural Information Processing Systems (NIPS), pp. 572-577Gonzalez, R., Woods, R., (2007) Digital Image Processing, , third ed. Prentice-HallHaralick, R.M., Shanmugam, K., Textural features for image classification (1973) IEEE Transactions on Systems, Man, and Cybernetics (SMC-3), 6 (1), pp. 610-621Heflin, B., Scheirer, W., Rocha, A., Boult, T.E., (2011) Pattern Recognition, Machine Intelligence and Biometrics: Expanding Frontiers, No. ISBN 978-3-642-22406-5 in 1, pp. 361-387. , Springer, Ch. A Look at Eye Detection for Unconstrained EnvironmentsHong, B.-W., Soatto, S., Ni, K., Chan, T., The scale of a texture and its application to segmentation (2008) IEEE Intl. Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1-8Huang, G., Ramesh, M., Berg, T., (2007) E. Learned-miller, Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments.Jelinek, H.F., Pires, R., Padilha, R., Goldenstein, S., Wainer, J., Bossomaier, T., Rocha, A., Data fusion for multi-lesion diabetic retinopathy detection (2012) IEEE International Symposium on Computer-based Medical System (CBMS), , Rome, Italy, (in press)Jurie, F., Triggs, B., Creating efficient codebooks for visual recognition (2005) Proceedings of the IEEE International Conference on Computer Vision, I, pp. 604-610. , DOI 10.1109/ICCV.2005.66, 1541309, Proceedings - 10th IEEE International Conference on Computer Vision, ICCV 2005Kemeny, J., Mathematics without numbers (1959) Daedalus, 88 (4), pp. 577-591Kumar, N., Belhumeur, P., Nayar, S., Facetracer: A search engine for collections of images with faces (2008) European Conference on Computer Vision (ECCV), pp. 340-353Kumar, N., Berg, A.C., Belhumeur, P., Nayar, S., Attribute and simile classifiers for face verification (2009) IEEE International Conference on Computer Vision (ICCV), pp. 365-372Kumar, N., Berg, A.C., Belhumeur, P.N., Nayar, S.K., Describable visual attributes for face verification and image search (2011) IEEE Transactions on Pattern Analysis and Machine Intelligence (T.PAMI), 33 (10), pp. 1962-1977Lam, L., Suen, C.Y., Optimal combinations of pattern classifiers (1995) PRL, 16 (9), pp. 945-954Lampert, C., Nickisch, H., Harmeling, S., Learning to detect unseen object classes by between-class attribute transfer (2009) IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 951-958Lowe, D., Distinctive image features from scale-invariant keypoints (2004) International Journal of Computer Vision (IJCV), 60 (2), pp. 91-110Nowak, E., Jurie, F., Triggs, B., Sampling strategies for bag-of-features image classification (2006) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3954, pp. 490-503. , DOI 10.1007/11744085-38, Computer Vision - ECCV 2006, 9th European Conference on Computer Vision, ProceedingsNuray, R., Can, F., Automatic ranking of information retrieval systems using data fusion (2006) Information Processing and Management, 42 (3), pp. 595-614. , DOI 10.1016/j.ipm.2005.03.023, PII S0306457305000555Park, U., Liao, S., Klare, B., Voss, J., Jain, A.K., Face finder: Filtering a large face database using scars, marks and tattoos (2011) Tech. Rep. TR11, , Michigan State UnivPedronette, D., Da, R., Torres, S., Exploiting contextual information for image re-ranking and rank aggregation (2012) International Journal of Multimedia Information Retrieval (JMIR), 1 (1), pp. 115-128Pedronette, D., Da, R., Torres, S., Exploiting pairwise recommendation and clustering strategies for image re-ranking (2012) Information Sciences (IS), 207 (1), pp. 19-34Penatti, O.A.B., Valle, E., Torres, R.S., Comparative study of global color and texture descriptors for web image retrieval (2012) Journal of Visual Communication and Image Representation, 23 (2), pp. 359-380Pires, R., Wainer, J., Jelinek, H.F., Rocha, A., Retinal image quality analysis for automatic diabetic retinopathy detection (2012) 25th Conference on Graphics, Patterns and Images (SIBGRAPI), , Ouro Preto, Brazil (in press)Presti, L.L., Cascia, M.L., Entropy-based localization of textured regions (2011) Intl. Conference on Image Analysis and Processing (ICIAP), pp. 616-625Roberts, F., (1976) Discrete Mathematical Models with Applications to Social, Biological, and Environmental Problems, , Prentice HallRocha, A., Carvalho, T., Jelinek, H.F., Goldenstein, S., Wainer, J., Points of interest and visual dictionaries for automatic retinal lesion detection (2012) IEEE Transactions on Biomedical Engineering (T.BME), 59 (8), pp. 2244-2253Scheirer, W., Rocha, A., Michaels, R., Boult, T.E., Extreme value theory for recognition score normalization (2010) European Conference on Computer Vision (ECCV), pp. 481-495Scheirer, W., Kumar, N., Ricanek, K., Boult, T., Belhumeur, P., Fusing with context: A bayesian approach to combining descriptive attributes (2011) IEEE Intl. Joint Conference on Biometrics (IJCB), pp. 1-8Scheirer, W., Kumar, N., Belhumeur, P.N., Boult, T.E., Multi-attribute spaces: Calibration for attribute fusion and similarity search (2012) IEEE Intl. Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2933-2940Viola, P., Jones, M., Robust real-time face detection (2004) International Journal of Computer Vision (IJCV), 57, pp. 137-15
    corecore