4 research outputs found

    Probabilistic models for multi-view semi-supervised learning and coding

    Get PDF
    Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.Cataloged from PDF version of thesis.Includes bibliographical references (p. 146-160).This thesis investigates the problem of classification from multiple noisy sensors or modalities. Examples include speech and gesture interfaces and multi-camera distributed sensor networks. Reliable recognition in such settings hinges upon the ability to learn accurate classification models in the face of limited supervision and to cope with the relatively large amount of potentially redundant information transmitted by each sensor or modality (i.e., view). We investigate and develop novel multi view learning algorithms capable of learning from semi-supervised noisy sensor data, for automatically adapting to new users and working conditions, and for performing distributed feature selection on bandwidth limited sensor networks. We propose probabilistic models built upon multi-view Gaussian Processes (GPs) for solving this class of problems, and demonstrate our approaches for solving audio-visual speech and gesture, and multi-view object classification problems. Multi-modal tasks are good candidates for multi-view learning, since each modality provides a potentially redundant view to the learning algorithm. On audio-visual speech unit classification, and user agreement recognition using spoken utterances and head gestures, we demonstrate that multi-modal co-training can be used to learn from only a few labeled examples in one or both of the audio-visual modalities. We also propose a co-adaptation algorithm, which adapts existing audio-visual classifiers to a particular user or noise condition by leveraging the redundancy in the unlabeled data. Existing methods typically assume constant per-channel noise models.(cont.) In contrast we develop co-training algorithms that are able to learn from noisy sensor data corrupted by complex per-sample noise processes, e.g., occlusion common to multi sensor classification problems. We propose a probabilistic heteroscedastic approach to co-training that simultaneously discovers the amount of noise on a per-sample basis, while solving the classification task. This results in accurate performance in the presence of occlusion or other complex noise processes. We also investigate an extension of this idea for supervised multi-view learning where we develop a Bayesian multiple kernel learning algorithm that can learn a local weighting over each view of the input space. We additionally consider the problem of distributed object recognition or indexing from multiple cameras, where the computational power available at each camera sensor is limited and communication between cameras is prohibitively expensive. In this scenario, it is desirable to avoid sending redundant visual features from multiple views. Traditional supervised feature selection approaches are inapplicable as the class label is unknown at each camera. In this thesis, we propose an unsupervised multi-view feature selection algorithm based on a distributed coding approach. With our method, a Gaussian Process model of the joint view statistics is used at the receiver to obtain a joint encoding of the views without directly sharing information across encoders. We demonstrate our approach on recognition and indexing tasks with multi-view image databases and show that our method compares favorably to an independent encoding of the features from each camera.by C. Mario Christoudias.Ph.D

    Cartographic Abstraction : Mapping Practices in Contemporary Art

    Get PDF
    This thesis proposes a theory of cartographic abstraction as a framework for investigating cartographic viewing, and does so through engaging with a series of contemporary artworks concerned with cartographic ‘ways of seeing’ (Berger 1972). Cartographic abstraction is a material modality of thought and experience that is produced through cartographic techniques of depiction. It is the more-than-visual register that posits and produces the ‘cartographic world’, or what John Pickles has called the ‘geo-coded world’ (2006). By this I mean the naturalized apprehension of the earth as a homogeneous space that is naturally, even necessarily, understood as regular, consistent and objective. I argue for identifying cartographic techniques of depiction as themselves abstract, and cartographic abstraction as such as the modality of thought and experience that these techniques produce. Abstraction within capitalism comes to be socially real and material, taking place outside thought. I propose a series of viewpoints, that are posited by the relations of viewing enacted by the selected artworks themselves. I analyse these viewpoints in relation to modes of cartographic viewing offered by theorists. Through close readings of cartographic artworks, I expand the current possibilities for understanding cartographic abstraction and its effects, through proposing a range of viewpoints that are both deployed in, and themselves problematize, cartographic viewing. I connect cartographic abstraction to debates about abstraction in Marxist and materialist approaches to philosophy, arguing for interpreting cartographic viewing as an abstract practice through which subjects are positioned and structured in relation to the ‘viewed’. This study discerns ‘real abstraction’ functioning in a particular area of ‘the operations of capitalism’; that is, modes of visual, and epistemological, abstraction that we can identify by exploring artworks concerned with cartographic depiction and conceptualisation. This approach to abstraction explores how cartographic knowledge can be theorized through recognising cartographic abstraction as a material modality of thought and experience
    corecore