421,165 research outputs found

    What-and-Where to Match: Deep Spatially Multiplicative Integration Networks for Person Re-identification

    Full text link
    Matching pedestrians across disjoint camera views, known as person re-identification (re-id), is a challenging problem that is of importance to visual recognition and surveillance. Most existing methods exploit local regions within spatial manipulation to perform matching in local correspondence. However, they essentially extract \emph{fixed} representations from pre-divided regions for each image and perform matching based on the extracted representation subsequently. For models in this pipeline, local finer patterns that are crucial to distinguish positive pairs from negative ones cannot be captured, and thus making them underperformed. In this paper, we propose a novel deep multiplicative integration gating function, which answers the question of \emph{what-and-where to match} for effective person re-id. To address \emph{what} to match, our deep network emphasizes common local patterns by learning joint representations in a multiplicative way. The network comprises two Convolutional Neural Networks (CNNs) to extract convolutional activations, and generates relevant descriptors for pedestrian matching. This thus, leads to flexible representations for pair-wise images. To address \emph{where} to match, we combat the spatial misalignment by performing spatially recurrent pooling via a four-directional recurrent neural network to impose spatial dependency over all positions with respect to the entire image. The proposed network is designed to be end-to-end trainable to characterize local pairwise feature interactions in a spatially aligned manner. To demonstrate the superiority of our method, extensive experiments are conducted over three benchmark data sets: VIPeR, CUHK03 and Market-1501.Comment: Published at Pattern Recognition, Elsevie

    Understanding Learned Models by Identifying Important Features at the Right Resolution

    Full text link
    In many application domains, it is important to characterize how complex learned models make their decisions across the distribution of instances. One way to do this is to identify the features and interactions among them that contribute to a model's predictive accuracy. We present a model-agnostic approach to this task that makes the following specific contributions. Our approach (i) tests feature groups, in addition to base features, and tries to determine the level of resolution at which important features can be determined, (ii) uses hypothesis testing to rigorously assess the effect of each feature on the model's loss, (iii) employs a hierarchical approach to control the false discovery rate when testing feature groups and individual base features for importance, and (iv) uses hypothesis testing to identify important interactions among features and feature groups. We evaluate our approach by analyzing random forest and LSTM neural network models learned in two challenging biomedical applications.Comment: First two authors contributed equally to this work, Accepted for presentation at the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19

    Learning a local-variable model of aromatic and conjugated systems

    Get PDF
    A collection of new approaches to building and training neural networks, collectively referred to as deep learning, are attracting attention in theoretical chemistry. Several groups aim to replace computationally expensive <i>ab initio</i> quantum mechanics calculations with learned estimators. This raises questions about the representability of complex quantum chemical systems with neural networks. Can local-variable models efficiently approximate nonlocal quantum chemical features? Here, we find that convolutional architectures, those that only aggregate information locally, cannot efficiently represent aromaticity and conjugation in large systems. They cannot represent long-range nonlocality known to be important in quantum chemistry. This study uses aromatic and conjugated systems computed from molecule graphs, though reproducing quantum simulations is the ultimate goal. This task, by definition, is both computable and known to be important to chemistry. The failure of convolutional architectures on this focused task calls into question their use in modeling quantum mechanics. To remedy this heretofore unrecognized deficiency, we introduce a new architecture that propagates information back and forth in waves of nonlinear computation. This architecture is still a local-variable model, and it is both computationally and representationally efficient, processing molecules in sublinear time with far fewer parameters than convolutional networks. Wave-like propagation models aromatic and conjugated systems with high accuracy, and even models the impact of small structural changes on large molecules. This new architecture demonstrates that some nonlocal features of quantum chemistry can be efficiently represented in local variable models

    Do Deep Neural Networks Model Nonlinear Compositionality in the Neural Representation of Human-Object Interactions?

    Full text link
    Visual scene understanding often requires the processing of human-object interactions. Here we seek to explore if and how well Deep Neural Network (DNN) models capture features similar to the brain's representation of humans, objects, and their interactions. We investigate brain regions which process human-, object-, or interaction-specific information, and establish correspondences between them and DNN features. Our results suggest that we can infer the selectivity of these regions to particular visual stimuli using DNN representations. We also map features from the DNN to the regions, thus linking the DNN representations to those found in specific parts of the visual cortex. In particular, our results suggest that a typical DNN representation contains encoding of compositional information for human-object interactions which goes beyond a linear combination of the encodings for the two components, thus suggesting that DNNs may be able to model this important property of biological vision.Comment: 4 pages, 2 figures; presented at CCN 201

    Coupled surface to deep Earth processes: Perspectives from TOPO-EUROPE with an emphasis on climate- and energy-related societal challenges

    Get PDF
    Understanding the interactions between surface and deep Earth processes is important for research in many diverse scientific areas including climate, environment, energy, georesources and biosphere. The TOPO-EUROPE initiative of the International Lithosphere Program serves as a pan-European platform for integrated surface and deep Earth sciences, synergizing observational studies of the Earth structure and fluxes on all spatial and temporal scales with modelling of Earth processes. This review provides a survey of scientific developments in our quantitative understanding of coupled surface-deep Earth processes achieved through TOPO-EUROPE. The most notable innovations include (1) a process-based understanding of the connection of upper mantle dynamics and absolute plate motion frames; (2) integrated models for sediment source-to-sink dynamics, demonstrating the importance of mass transfer from mountains to basins and from basin to basin; (3) demonstration of the key role of polyphase evolution of sedimentary basins, the impact of pre-rift and pre-orogenic structures, and the evolution of subsequent lithosphere and landscape dynamics; (4) improved conceptual understanding of the temporal evolution from back-arc extension to tectonic inversion and onset of subduction; (5) models to explain the integrated strength of Europe's lithosphere; (6) concepts governing the interplay between thermal upper mantle processes and stress-induced intraplate deformation; (7) constraints on the record of vertical motions from high-resolution data sets obtained from geo-thermochronology for Europe's topographic evolution; (8) recognition and quantifications of the forcing by erosional and/or glacial-interglacial surface mass transfer on the regional magmatism, with major implications for our understanding of the carbon cycle on geological timescales and the emerging field of biogeodynamics; and (9) the transfer of insights obtained on the coupling of deep Earth and surface processes to the domain of geothermal energy exploration. Concerning the future research agenda of TOPO-EUROPE, we also discuss the rich potential for further advances, multidisciplinary research and community building across many scientific frontiers, including research on the biosphere, climate and energy. These will focus on obtaining a better insight into the initiation and evolution of subduction systems, the role of mantle plumes in continental rifting and (super)continent break-up, and the deformation and tectonic reactivation of cratons; the interaction between geodynamic, surface and climate processes, such as interactions between glaciation, sea level change and deep Earth processes; the sensitivity, tipping points, and spatio-temporal evolution of the interactions between climate and tectonics as well as the role of rock melting and outgassing in affecting such interactions; the emerging field of biogeodynamics, that is the impact of coupled deep Earth – surface processes on the evolution of life on Earth; and tightening the connection between societal challenges regarding renewable georesources, climate change, natural geohazards, and novel process-understanding of the Earth system

    Deep Memory Networks for Attitude Identification

    Full text link
    We consider the task of identifying attitudes towards a given set of entities from text. Conventionally, this task is decomposed into two separate subtasks: target detection that identifies whether each entity is mentioned in the text, either explicitly or implicitly, and polarity classification that classifies the exact sentiment towards an identified entity (the target) into positive, negative, or neutral. Instead, we show that attitude identification can be solved with an end-to-end machine learning architecture, in which the two subtasks are interleaved by a deep memory network. In this way, signals produced in target detection provide clues for polarity classification, and reversely, the predicted polarity provides feedback to the identification of targets. Moreover, the treatments for the set of targets also influence each other -- the learned representations may share the same semantics for some targets but vary for others. The proposed deep memory network, the AttNet, outperforms methods that do not consider the interactions between the subtasks or those among the targets, including conventional machine learning methods and the state-of-the-art deep learning models.Comment: Accepted to WSDM'1
    • …
    corecore