4 research outputs found
Probabilistic models for multi-view semi-supervised learning and coding
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.Cataloged from PDF version of thesis.Includes bibliographical references (p. 146-160).This thesis investigates the problem of classification from multiple noisy sensors or modalities. Examples include speech and gesture interfaces and multi-camera distributed sensor networks. Reliable recognition in such settings hinges upon the ability to learn accurate classification models in the face of limited supervision and to cope with the relatively large amount of potentially redundant information transmitted by each sensor or modality (i.e., view). We investigate and develop novel multi view learning algorithms capable of learning from semi-supervised noisy sensor data, for automatically adapting to new users and working conditions, and for performing distributed feature selection on bandwidth limited sensor networks. We propose probabilistic models built upon multi-view Gaussian Processes (GPs) for solving this class of problems, and demonstrate our approaches for solving audio-visual speech and gesture, and multi-view object classification problems. Multi-modal tasks are good candidates for multi-view learning, since each modality provides a potentially redundant view to the learning algorithm. On audio-visual speech unit classification, and user agreement recognition using spoken utterances and head gestures, we demonstrate that multi-modal co-training can be used to learn from only a few labeled examples in one or both of the audio-visual modalities. We also propose a co-adaptation algorithm, which adapts existing audio-visual classifiers to a particular user or noise condition by leveraging the redundancy in the unlabeled data. Existing methods typically assume constant per-channel noise models.(cont.) In contrast we develop co-training algorithms that are able to learn from noisy sensor data corrupted by complex per-sample noise processes, e.g., occlusion common to multi sensor classification problems. We propose a probabilistic heteroscedastic approach to co-training that simultaneously discovers the amount of noise on a per-sample basis, while solving the classification task. This results in accurate performance in the presence of occlusion or other complex noise processes. We also investigate an extension of this idea for supervised multi-view learning where we develop a Bayesian multiple kernel learning algorithm that can learn a local weighting over each view of the input space. We additionally consider the problem of distributed object recognition or indexing from multiple cameras, where the computational power available at each camera sensor is limited and communication between cameras is prohibitively expensive. In this scenario, it is desirable to avoid sending redundant visual features from multiple views. Traditional supervised feature selection approaches are inapplicable as the class label is unknown at each camera. In this thesis, we propose an unsupervised multi-view feature selection algorithm based on a distributed coding approach. With our method, a Gaussian Process model of the joint view statistics is used at the receiver to obtain a joint encoding of the views without directly sharing information across encoders. We demonstrate our approach on recognition and indexing tasks with multi-view image databases and show that our method compares favorably to an independent encoding of the features from each camera.by C. Mario Christoudias.Ph.D
Cartographic Abstraction : Mapping Practices in Contemporary Art
This thesis proposes a theory of cartographic abstraction as a framework
for investigating cartographic viewing, and does so through engaging with
a series of contemporary artworks concerned with cartographic âways of
seeingâ (Berger 1972). Cartographic abstraction is a material modality of
thought and experience that is produced through cartographic techniques
of depiction. It is the more-than-visual register that posits and produces
the âcartographic worldâ, or what John Pickles has called the âgeo-coded
worldâ (2006). By this I mean the naturalized apprehension of the earth as
a homogeneous space that is naturally, even necessarily, understood as
regular, consistent and objective. I argue for identifying cartographic
techniques of depiction as themselves abstract, and cartographic
abstraction as such as the modality of thought and experience that these
techniques produce. Abstraction within capitalism comes to be socially
real and material, taking place outside thought.
I propose a series of viewpoints, that are posited by the relations of viewing
enacted by the selected artworks themselves. I analyse these viewpoints
in relation to modes of cartographic viewing offered by theorists. Through
close readings of cartographic artworks, I expand the current possibilities
for understanding cartographic abstraction and its effects, through
proposing a range of viewpoints that are both deployed in, and themselves
problematize, cartographic viewing. I connect cartographic abstraction to
debates about abstraction in Marxist and materialist approaches to
philosophy, arguing for interpreting cartographic viewing as an abstract
practice through which subjects are positioned and structured in relation to
the âviewedâ. This study discerns âreal abstractionâ functioning in a
particular area of âthe operations of capitalismâ; that is, modes of visual,
and epistemological, abstraction that we can identify by exploring artworks
concerned with cartographic depiction and conceptualisation. This
approach to abstraction explores how cartographic knowledge can be
theorized through recognising cartographic abstraction as a material modality
of thought and experience