764 research outputs found

    Multimedia Semantic Integrity Assessment Using Joint Embedding Of Images And Text

    Full text link
    Real world multimedia data is often composed of multiple modalities such as an image or a video with associated text (e.g. captions, user comments, etc.) and metadata. Such multimodal data packages are prone to manipulations, where a subset of these modalities can be altered to misrepresent or repurpose data packages, with possible malicious intent. It is, therefore, important to develop methods to assess or verify the integrity of these multimedia packages. Using computer vision and natural language processing methods to directly compare the image (or video) and the associated caption to verify the integrity of a media package is only possible for a limited set of objects and scenes. In this paper, we present a novel deep learning-based approach for assessing the semantic integrity of multimedia packages containing images and captions, using a reference set of multimedia packages. We construct a joint embedding of images and captions with deep multimodal representation learning on the reference dataset in a framework that also provides image-caption consistency scores (ICCSs). The integrity of query media packages is assessed as the inlierness of the query ICCSs with respect to the reference dataset. We present the MultimodAl Information Manipulation dataset (MAIM), a new dataset of media packages from Flickr, which we make available to the research community. We use both the newly created dataset as well as Flickr30K and MS COCO datasets to quantitatively evaluate our proposed approach. The reference dataset does not contain unmanipulated versions of tampered query packages. Our method is able to achieve F1 scores of 0.75, 0.89 and 0.94 on MAIM, Flickr30K and MS COCO, respectively, for detecting semantically incoherent media packages.Comment: *Ayush Jaiswal and Ekraam Sabir contributed equally to the work in this pape

    Zero-Shot Learning by Convex Combination of Semantic Embeddings

    Full text link
    Several recent publications have proposed methods for mapping images into continuous semantic embedding spaces. In some cases the embedding space is trained jointly with the image transformation. In other cases the semantic embedding space is established by an independent natural language processing task, and then the image transformation into that space is learned in a second stage. Proponents of these image embedding systems have stressed their advantages over the traditional \nway{} classification framing of image understanding, particularly in terms of the promise for zero-shot learning -- the ability to correctly annotate images of previously unseen object categories. In this paper, we propose a simple method for constructing an image embedding system from any existing \nway{} image classifier and a semantic word embedding model, which contains the \n class labels in its vocabulary. Our method maps images into the semantic embedding space via convex combination of the class label embedding vectors, and requires no additional training. We show that this simple and direct method confers many of the advantages associated with more complex image embedding schemes, and indeed outperforms state of the art methods on the ImageNet zero-shot learning task

    Learning SO(3) Equivariant Representations with Spherical CNNs

    Full text link
    We address the problem of 3D rotation equivariance in convolutional neural networks. 3D rotations have been a challenging nuisance in 3D classification tasks requiring higher capacity and extended data augmentation in order to tackle it. We model 3D data with multi-valued spherical functions and we propose a novel spherical convolutional network that implements exact convolutions on the sphere by realizing them in the spherical harmonic domain. Resulting filters have local symmetry and are localized by enforcing smooth spectra. We apply a novel pooling on the spectral domain and our operations are independent of the underlying spherical resolution throughout the network. We show that networks with much lower capacity and without requiring data augmentation can exhibit performance comparable to the state of the art in standard retrieval and classification benchmarks.Comment: Camera-ready. Accepted to ECCV'18 as oral presentatio

    Directed Trans-Differentiation of Thymus Cells into Parathyroid-Like Cells Without Genetic Manipulation

    Full text link
    Replacement of a diseased organ with an autologously derived tissue is an ideal therapy for some medical problems. However, it is difficult to recreate many adult human tissues in vitro due to the functionally necessary architecture of most organs and the lack of understanding of methods to direct the development of the organ of interest. The parathyroid gland is ideal for in vitro organ development because this gland is relatively simple, is transplantable, and is commonly affected by a surgical complication rather than an autoimmune disease. We have investigated thymus as a source of autologous endoderm and parathyroid-like precursor cells. Human thymus cells were treated with a differentiation protocol we developed with human embryonic stem cells (The Bingham Protocol) that utilizes timed exposures to Activin A and soluble Sonic hedgehog (Shh). We incrementally changed the protocol to optimize the differentiation of the thymus cells into parathyroid-like cells. The final protocol used 50-ng/mL Activin A and 100-ng/mL Shh over 13 weeks. The differentiated cells expressed the parathyroid markers parathyroid hormone (PTH), calcium sensing receptor, chemokine receptor type-4 (CXCR4), and chorian-specific transcription factor (GCM2) as measured by reverse transcription-polymerase chain reaction and PTH enzyme-linked immunosorbent assay. Cultured thymus cells without Activin A or Shh exposure did not secrete PTH nor express similar markers. The differentiated cells released PTH, which was suppressed in response to increased calcium concentration. The chemically differentiated cells did not form tumors in immune-compromised mice. Our protocol recreated cells with markers of parathyroid tissue that responded as parathyroid cells to physiologic stimuli. This approach is a further step toward a strategy to restore parathyroid function using autologous cells that were directed to differentiate by nongenetic in vitro manipulation.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/90464/1/ten-2Etec-2E2011-2E0170.pd

    Detecting Sarcasm in Multimodal Social Platforms

    Full text link
    Sarcasm is a peculiar form of sentiment expression, where the surface sentiment differs from the implied sentiment. The detection of sarcasm in social media platforms has been applied in the past mainly to textual utterances where lexical indicators (such as interjections and intensifiers), linguistic markers, and contextual information (such as user profiles, or past conversations) were used to detect the sarcastic tone. However, modern social media platforms allow to create multimodal messages where audiovisual content is integrated with the text, making the analysis of a mode in isolation partial. In our work, we first study the relationship between the textual and visual aspects in multimodal posts from three major social media platforms, i.e., Instagram, Tumblr and Twitter, and we run a crowdsourcing task to quantify the extent to which images are perceived as necessary by human annotators. Moreover, we propose two different computational frameworks to detect sarcasm that integrate the textual and visual modalities. The first approach exploits visual semantics trained on an external dataset, and concatenates the semantics features with state-of-the-art textual features. The second method adapts a visual neural network initialized with parameters trained on ImageNet to multimodal sarcastic posts. Results show the positive effect of combining modalities for the detection of sarcasm across platforms and methods.Comment: 10 pages, 3 figures, final version published in the Proceedings of ACM Multimedia 201

    Zero-Shot Hashing via Transferring Supervised Knowledge

    Full text link
    Hashing has shown its efficiency and effectiveness in facilitating large-scale multimedia applications. Supervised knowledge e.g. semantic labels or pair-wise relationship) associated to data is capable of significantly improving the quality of hash codes and hash functions. However, confronted with the rapid growth of newly-emerging concepts and multimedia data on the Web, existing supervised hashing approaches may easily suffer from the scarcity and validity of supervised information due to the expensive cost of manual labelling. In this paper, we propose a novel hashing scheme, termed \emph{zero-shot hashing} (ZSH), which compresses images of "unseen" categories to binary codes with hash functions learned from limited training data of "seen" categories. Specifically, we project independent data labels i.e. 0/1-form label vectors) into semantic embedding space, where semantic relationships among all the labels can be precisely characterized and thus seen supervised knowledge can be transferred to unseen classes. Moreover, in order to cope with the semantic shift problem, we rotate the embedded space to more suitably align the embedded semantics with the low-level visual feature space, thereby alleviating the influence of semantic gap. In the meantime, to exert positive effects on learning high-quality hash functions, we further propose to preserve local structural property and discrete nature in binary codes. Besides, we develop an efficient alternating algorithm to solve the ZSH model. Extensive experiments conducted on various real-life datasets show the superior zero-shot image retrieval performance of ZSH as compared to several state-of-the-art hashing methods.Comment: 11 page

    In wildness is the preservation of the world

    Get PDF
    Passage of the Wilderness Act of 1964 and establishment of the National Wilderness Preservation System represent an historic achievement, an assertion of idealism in a materialist-powered society. The four federal land-management agencies, however, have failed to meet their legislative mandate. Thus Earth Day 1990 provides a singular opportunity to address the unfinished agenda with consideration of a new agency to be called the United States Wilderness Service

    D.C. Background on Predator Control Legislation

    Get PDF
    The tragic fiasco of federal predator control as we have known it is finished. The American people will no longer tolerate it. In this age of environmental concern, the people will not allow their tax dollars to be diverted for such a destructive and wasteful war against living wild creatures for the exclusive benefit of the sheep industry. There is now no turning back to old ways. Indiscriminate trapping, shooting and poisoning have reduced some of the rarest, most beautiful and superbly adapted species of our wildlife heritage to the brink of extinction, although they consitutue a resource that could be enjoyed by all and harvested by sportsmen under sound management principles. The war on predators has been waged with little scientific knowledge of their beneficial role in the biotic community, and without moral or ethical consideration for man\u27s responsibility in preserving natural life as an integral part of the environment. As I wrote in the January, 1971, issue of Field and Stream, the Division of Wildlife Services, an agency of the Interior Department, has had one prime goal at the root of its existence: to kill wildlife. It has for years gotten away with murder -- the murder of wolves, mountain lions, coyotes, bobcats, foxes, badgers -- as well as anything else that might be handy

    Career success: the role of teenage career aspirations, ambition value and gender in predicting adult social status and earnings

    Get PDF
    Links between family social background, teenage career aspirations, educational performance and adult social status attainment are well documented. Using a contextual developmental framework, this article extends previous research by examining the role of gender and teenage ambition value in shaping social status attainment and earnings in adulthood. Drawing on data from an 18-year British follow up study we tested a path model linking family background factors (such as family social status and parental aspirations) and individual agency factors in adolescence (in particular, career aspirations and ambition value) to social status attainment and earnings in adulthood. The findings suggest that ambition value is linked to adult earnings. That is, young people for whom it is important to get on in their job earn more money in adulthood than their less ambitious peers. The findings also confirm that teenage career aspirations are linked to adult social status attainment, and suggest that family background factors, teenage career aspirations and ambition value interact to influence social status attainment and earnings in adulthood. Gender differences are discusse
    corecore