21,145 research outputs found

    Modelling string structure in vector spaces

    Get PDF
    Searching for similar strings is an important and frequent database task both in terms of human interactions and in absolute world-wide CPU utilisation. A wealth of metric functions for string comparison exist. However, with respect to the wide range of classification and other techniques known within vector spaces, such metrics allow only a very restricted range of techniques. To counter this restriction, various strategies have been used for mapping string spaces into vector spaces, approximating the string distances within the mapped space and therefore allowing vector space techniques to be used. In previous work we have developed a novel technique for mapping metric spaces into vector spaces, which can therefore be applied for this purpose. In this paper we evaluate this technique in the context of string spaces, and compare it to other published techniques for mapping strings to vectors. We use a publicly available English lexicon as our experimental data set, and test two different string metrics over it for each vector mapping. We find that our novel technique considerably outperforms previously used technique in preserving the actual distance.Publisher PD

    Lambek vs. Lambek: Functorial Vector Space Semantics and String Diagrams for Lambek Calculus

    Full text link
    The Distributional Compositional Categorical (DisCoCat) model is a mathematical framework that provides compositional semantics for meanings of natural language sentences. It consists of a computational procedure for constructing meanings of sentences, given their grammatical structure in terms of compositional type-logic, and given the empirically derived meanings of their words. For the particular case that the meaning of words is modelled within a distributional vector space model, its experimental predictions, derived from real large scale data, have outperformed other empirically validated methods that could build vectors for a full sentence. This success can be attributed to a conceptually motivated mathematical underpinning, by integrating qualitative compositional type-logic and quantitative modelling of meaning within a category-theoretic mathematical framework. The type-logic used in the DisCoCat model is Lambek's pregroup grammar. Pregroup types form a posetal compact closed category, which can be passed, in a functorial manner, on to the compact closed structure of vector spaces, linear maps and tensor product. The diagrammatic versions of the equational reasoning in compact closed categories can be interpreted as the flow of word meanings within sentences. Pregroups simplify Lambek's previous type-logic, the Lambek calculus, which has been extensively used to formalise and reason about various linguistic phenomena. The apparent reliance of the DisCoCat on pregroups has been seen as a shortcoming. This paper addresses this concern, by pointing out that one may as well realise a functorial passage from the original type-logic of Lambek, a monoidal bi-closed category, to vector spaces, or to any other model of meaning organised within a monoidal bi-closed category. The corresponding string diagram calculus, due to Baez and Stay, now depicts the flow of word meanings.Comment: 29 pages, pending publication in Annals of Pure and Applied Logi

    Finsler and Lagrange Geometries in Einstein and String Gravity

    Full text link
    We review the current status of Finsler-Lagrange geometry and generalizations. The goal is to aid non-experts on Finsler spaces, but physicists and geometers skilled in general relativity and particle theories, to understand the crucial importance of such geometric methods for applications in modern physics. We also would like to orient mathematicians working in generalized Finsler and Kahler geometry and geometric mechanics how they could perform their results in order to be accepted by the community of ''orthodox'' physicists. Although the bulk of former models of Finsler-Lagrange spaces where elaborated on tangent bundles, the surprising result advocated in our works is that such locally anisotropic structures can be modelled equivalently on Riemann-Cartan spaces, even as exact solutions in Einstein and/or string gravity, if nonholonomic distributions and moving frames of references are introduced into consideration. We also propose a canonical scheme when geometrical objects on a (pseudo) Riemannian space are nonholonomically deformed into generalized Lagrange, or Finsler, configurations on the same manifold. Such canonical transforms are defined by the coefficients of a prime metric and generate target spaces as Lagrange structures, their models of almost Hermitian/ Kahler, or nonholonomic Riemann spaces. Finally, we consider some classes of exact solutions in string and Einstein gravity modelling Lagrange-Finsler structures with solitonic pp-waves and speculate on their physical meaning.Comment: latex 2e, 11pt, 44 pages; accepted to IJGMMP (2008) as a short variant of arXiv:0707.1524v3, on 86 page

    Port controlled Hamiltonian representation of distributed parameter systems

    Get PDF
    A port controlled Hamiltonian formulation of the dynamics of distributed parameter systems is presented, which incorporates the energy flow through the boundary of the domain of the system, and which allows to represent the system as a boundary control Hamiltonian system. This port controlled Hamiltonian system is defined with respect to a Dirac structure associated with the exterior derivative and based on Stokes' theorem. The definition is illustrated on the examples of the telegrapher's equations, Maxwell's equations and the vibrating string. \u

    Locally Anisotropic Kinetic Processes and Thermodynamics in Curved Spaces

    Get PDF
    The kinetic theory is formulated with respect to anholonomic frames of reference on curved spacetimes. By using the concept of nonlinear connection we develop an approach to modelling locally anisotropic kinetic processes and, in corresponding limits, the relativistic non-equilibrium thermodynamics with local anisotropy. This lead to a unified formulation of the kinetic equations on (pseudo) Riemannian spaces and in various higher dimensional models of Kaluza-Klein type and/or generalized Lagrange and Finsler spaces. The transition rate considered for the locally anisotropic transport equations is related to the differential cross section and spacetime parameters of anisotropy. The equations of states for pressure and energy in locally anisotropic thermodynamics are derived. The obtained general expressions for heat conductivity, shear and volume viscosity coefficients are applied to determine the transport coefficients of cosmic fluids in spacetimes with generic local anisotropy. We emphasize that such locally anisotropic structures are induced also in general relativity if we are modelling physical processes with respect to frames with mixed sets of holonomic and anholonomic basis vectors which naturally admits an associated nonlinear connection structure.Comment: version 2, 46 pages, latex 209, minor changes, accepted to Annals of Physics (NY

    Mathematical Foundations for a Compositional Distributional Model of Meaning

    Full text link
    We propose a mathematical framework for a unification of the distributional theory of meaning in terms of vector space models, and a compositional theory for grammatical types, for which we rely on the algebra of Pregroups, introduced by Lambek. This mathematical framework enables us to compute the meaning of a well-typed sentence from the meanings of its constituents. Concretely, the type reductions of Pregroups are `lifted' to morphisms in a category, a procedure that transforms meanings of constituents into a meaning of the (well-typed) whole. Importantly, meanings of whole sentences live in a single space, independent of the grammatical structure of the sentence. Hence the inner-product can be used to compare meanings of arbitrary sentences, as it is for comparing the meanings of words in the distributional model. The mathematical structure we employ admits a purely diagrammatic calculus which exposes how the information flows between the words in a sentence in order to make up the meaning of the whole sentence. A variation of our `categorical model' which involves constraining the scalars of the vector spaces to the semiring of Booleans results in a Montague-style Boolean-valued semantics.Comment: to appea
    corecore