12 research outputs found

    Approaching human 3D shape perception with neurally mappable models

    Full text link
    Humans effortlessly infer the 3D shape of objects. What computations underlie this ability? Although various computational models have been proposed, none of them capture the human ability to match object shape across viewpoints. Here, we ask whether and how this gap might be closed. We begin with a relatively novel class of computational models, 3D neural fields, which encapsulate the basic principles of classic analysis-by-synthesis in a deep neural network (DNN). First, we find that a 3D Light Field Network (3D-LFN) supports 3D matching judgments well aligned to humans for within-category comparisons, adversarially-defined comparisons that accentuate the 3D failure cases of standard DNN models, and adversarially-defined comparisons for algorithmically generated shapes with no category structure. We then investigate the source of the 3D-LFN's ability to achieve human-aligned performance through a series of computational experiments. Exposure to multiple viewpoints of objects during training and a multi-view learning objective are the primary factors behind model-human alignment; even conventional DNN architectures come much closer to human behavior when trained with multi-view objectives. Finally, we find that while the models trained with multi-view learning objectives are able to partially generalize to new object categories, they fall short of human alignment. This work provides a foundation for understanding human shape inferences within neurally mappable computational architectures and highlights important questions for future work

    Unsupervised Segmentation in Real-World Images via Spelke Object Inference

    Full text link
    Self-supervised, category-agnostic segmentation of real-world images is a challenging open problem in computer vision. Here, we show how to learn static grouping priors from motion self-supervision by building on the cognitive science concept of a Spelke Object: a set of physical stuff that moves together. We introduce the Excitatory-Inhibitory Segment Extraction Network (EISEN), which learns to extract pairwise affinity graphs for static scenes from motion-based training signals. EISEN then produces segments from affinities using a novel graph propagation and competition network. During training, objects that undergo correlated motion (such as robot arms and the objects they move) are decoupled by a bootstrapping process: EISEN explains away the motion of objects it has already learned to segment. We show that EISEN achieves a substantial improvement in the state of the art for self-supervised image segmentation on challenging synthetic and real-world robotics datasets.Comment: 25 pages, 10 figure

    Identifying concept libraries from language about object structure

    Full text link
    Our understanding of the visual world goes beyond naming objects, encompassing our ability to parse objects into meaningful parts, attributes, and relations. In this work, we leverage natural language descriptions for a diverse set of 2K procedurally generated objects to identify the parts people use and the principles leading these parts to be favored over others. We formalize our problem as search over a space of program libraries that contain different part concepts, using tools from machine translation to evaluate how well programs expressed in each library align to human language. By combining naturalistic language at scale with structured program representations, we discover a fundamental information-theoretic tradeoff governing the part concepts people name: people favor a lexicon that allows concise descriptions of each object, while also minimizing the size of the lexicon itself.Comment: Appears in the conference proceedings of CogSci 202

    Mechanism of short-term ERK activation by electromagnetic fields at mobile phone frequencies

    No full text
    The exposure to non-thermal microwave electromagnetic fields generated by mobile phones affects the expression of many proteins. This effect on transcription and protein stability can be mediated by the MAPK (mitogen-activated protein kinase) cascades, which serve as central signalling pathways and govern essentially all stimulated cellular processes. Indeed, long-term exposure of cells to mobile phone irradiation results in the activation of p38 as well as the ERK (extracellular-signal-regulated kinase) MAPKs. In the present study, we have studied the immediate effect of irradiation on the MAPK cascades, and found that ERKs, but not stress-related MAPKs, are rapidly activated in response to various frequencies and intensities. Using signalling inhibitors, we delineated the mechanism that is involved in this activation. We found that the first step is mediated in the plasma membrane by NADH oxidase, which rapidly generates ROS (reactive oxygen species). These ROS then directly stimulate MMPs (matrix metalloproteinases) and allow them to cleave and release Hb-EGF [heparin-binding EGF (epidermal growth factor)]. This secreted factor activates the EGF receptor, which in turn further activates the ERK cascade. Thus this study demonstrates for the first time a detailed molecular mechanism by which electromagnetic irradiation from mobile phones induces the activation of the ERK cascade and thereby induces transcription and other cellular processes
    corecore