625 research outputs found

    Bridge Correlational Neural Networks for Multilingual Multimodal Representation Learning

    Full text link
    Recently there has been a lot of interest in learning common representations for multiple views of data. Typically, such common representations are learned using a parallel corpus between the two views (say, 1M images and their English captions). In this work, we address a real-world scenario where no direct parallel data is available between two views of interest (say, V1V_1 and V2V_2) but parallel data is available between each of these views and a pivot view (V3V_3). We propose a model for learning a common representation for V1V_1, V2V_2 and V3V_3 using only the parallel data available between V1V3V_1V_3 and V2V3V_2V_3. The proposed model is generic and even works when there are nn views of interest and only one pivot view which acts as a bridge between them. There are two specific downstream applications that we focus on (i) transfer learning between languages L1L_1,L2L_2,...,LnL_n using a pivot language LL and (ii) cross modal access between images and a language L1L_1 using a pivot language L2L_2. Our model achieves state-of-the-art performance in multilingual document classification on the publicly available multilingual TED corpus and promising results in multilingual multimodal retrieval on a new dataset created and released as a part of this work.Comment: Published at NAACL-HLT 201

    The Cosmological Kibble Mechanism in the Laboratory: String Formation in Liquid Crystals

    Get PDF
    We have observed the production of strings (disclination lines and loops) via the Kibble mechanism of domain (bubble) formation in the isotropic to nematic phase transition of a sample of uniaxial nematic liquid crystal. The probablity of string formation per bubble is measured to be 0.33±0.010.33 \pm 0.01. This is in good agreement with the theoretical value 1/π1/ \pi expected in two dimensions for the order parameter space S2/Z2S^2/{\bf Z}_2 of a simple uniaxial nematic liquid crystal.Comment: 17 pages, in TEX, 2 figures (not included, available on request

    A Correlational Encoder Decoder Architecture for Pivot Based Sequence Generation

    Full text link
    Interlingua based Machine Translation (MT) aims to encode multiple languages into a common linguistic representation and then decode sentences in multiple target languages from this representation. In this work we explore this idea in the context of neural encoder decoder architectures, albeit on a smaller scale and without MT as the end goal. Specifically, we consider the case of three languages or modalities X, Z and Y wherein we are interested in generating sequences in Y starting from information available in X. However, there is no parallel training data available between X and Y but, training data is available between X & Z and Z & Y (as is often the case in many real world applications). Z thus acts as a pivot/bridge. An obvious solution, which is perhaps less elegant but works very well in practice is to train a two stage model which first converts from X to Z and then from Z to Y. Instead we explore an interlingua inspired solution which jointly learns to do the following (i) encode X and Z to a common representation and (ii) decode Y from this common representation. We evaluate our model on two tasks: (i) bridge transliteration and (ii) bridge captioning. We report promising results in both these applications and believe that this is a right step towards truly interlingua inspired encoder decoder architectures.Comment: 10 page

    Age and Mass for 920 LMC Clusters Derived from 100 Million Monte Carlo Simulations

    Full text link
    We present new age and mass estimates for 920 stellar clusters in the Large Magellanic Cloud (LMC) based on previously published broad-band photometry and the stellar cluster analysis package, MASSCLEANage. Expressed in the generic fitting formula, d^{2}N/dM dt ~ M^{\alpha} t^{\beta}, the distribution of observed clusters is described by \alpha = -1.5 to -1.6 and \beta = -2.1 to -2.2. For 288 of these clusters, ages have recently been determined based on stellar photometric color-magnitude diagrams, allowing us to gauge the confidence of our ages. The results look very promising, opening up the possibility that this sample of 920 clusters, with reliable and consistent age, mass and photometric measures, might be used to constrain important characteristics about the stellar cluster population in the LMC. We also investigate a traditional age determination method that uses a \chi^2 minimization routine to fit observed cluster colors to standard infinite mass limit simple stellar population models. This reveals serious defects in the derived cluster age distribution using this method. The traditional \chi^2 minimization method, due to the variation of U,B,V,R colors, will always produce an overdensity of younger and older clusters, with an underdensity of clusters in the log(age/yr)=[7.0,7.5] range. Finally, we present a unique simulation aimed at illustrating and constraining the fading limit in observed cluster distributions that includes the complex effects of stochastic variations in the observed properties of stellar clusters.Comment: Accepted for publication in The Astrophysical Journal, 37 pages, 18 figure

    Transfer Learning for Multi-language Twitter Election Classification

    Get PDF
    Both politicians and citizens are increasingly embracing social media as a means to disseminate information and comment on various topics, particularly during significant political events, such as elections. Such commentary during elections is also of interest to social scientists and pollsters. To facilitate the study of social media during elections, there is a need to automatically identify posts that are topically related to those elections. However, current studies have focused on elections within English-speaking regions, and hence the resultant election content classifiers are only applicable for elections in countries where the predominant language is English. On the other hand, as social media is becoming more prevalent worldwide, there is an increasing need for election classifiers that can be generalised across different languages, without building a training dataset for each election. In this paper, based upon transfer learning, we study the development of effective and reusable election classifiers for use on social media across multiple languages. We combine transfer learning with different classifiers such as Support Vector Machines (SVM) and state-of-the-art Convolutional Neural Networks (CNN), which make use of word embedding representations for each social media post. We generalise the learned classifier models for cross-language classification by using a linear translation approach to map the word embedding vectors from one language into another. Experiments conducted over two election datasets in different languages show that without using any training data from the target language, linear translations outperform a classical transfer learning approach, namely Transfer Component Analysis (TCA), by 80% in recall and 25% in F1 measure

    Dynamic instabilities of fracture under biaxial strain using a phase field model

    Full text link
    We present a phase field model of the propagation of fracture under plane strain. This model, based on simple physical considerations, is able to accurately reproduce the different behavior of cracks (the principle of local symmetry, the Griffith and Irwin criteria, and mode-I branching). In addition, we test our model against recent experimental findings showing the presence of oscillating cracks under bi-axial load. Our model again reproduces well observed supercritical Hopf bifurcation, and is therefore the first simulation which does so
    • …
    corecore