Search CORE

324 research outputs found

AugCSE: contrastive sentence embedding with diverse augmentations

Author: Kocyigit Muhammed Yusuf
Tang Zilu
Wijaya Derry
Publication venue: Association for Computational Linguistics
Publication date: 27/03/2023
Field of study

Data augmentation techniques have been proven useful in many applications in NLP fields. Most augmentations are task-specific, and cannot be used as a general-purpose tool. In our work, we present AugCSE, a unified framework to utilize diverse sets of data augmentations to achieve a better, general-purpose, sentence embedding model. Building upon the latest sentence embedding models, our approach uses a simple antagonistic discriminator that differentiates the augmentation types. With the finetuning objective borrowed from domain adaptation, we show that diverse augmentations, which often lead to conflicting contrastive signals, can be tamed to produce a better and more robust sentence representation. Our methods achieve state-of-the-art results on downstream transfer tasks and perform competitively on semantic textual similarity tasks, using only unsupervised data.000000000000000000000000000000000000000000000000000000010241 - University of California, Berkeleyhttps://aclanthology.org/2022.aacl-main.30/First author draf

Boston University Institutional Repository (OpenBU)

Textual Entailment for Modern Standard Arabic

Author: Alabbas Maytham
Publication venue
Publication date: 31/12/2013
Field of study

The University of Manchester - Institutional Repository

Software similarity measurements using UML diagrams: A systematic literature review

Author: Fauzan Reza
Karolita Devi
Rochimah Siti
Siahaan Daniel O.
Suardika I Gede
Triandini Evi
Publication venue: 'Universitas Pesantren Tinggi Darul Ulum (Unipdu)'
Publication date: 09/05/2021
Field of study

Every piece of software uses a model to derive its operational, auxiliary, and functional procedures. Unified Modeling Language (UML) is a standard displaying language for determining, recording, and building a software product. Several algorithms have been used by researchers to measure similarities between UML artifacts. However, there no literature studies have considered measurements of UML diagram similarities. This paper presents the results of a systematic literature review concerning similarity measurements between the UML diagrams of different software products. The study reviews and identifies similarity measurements of UML artifacts, with class diagram, sequence diagram, statechart diagram, and use case diagram being UML diagrams that are widely used as research objects for measuring similarity. Measuring similarity enables resolution of the problem domains of software reuse, similarity measurement, and clone detection. The instruments used to measure similarity are semantic and structural similarity. The findings indicate opportunities for future research regarding calculating other UML diagrams, compiling calculation information for each diagram, adapting semantic and structural similarity calculation methods, determining the best weight for each item in the diagram, testing novel proposed methods, and building or finding good datasets for use as testing material

Jurnal Online Unipdu Jombang (Universitas Pesantren Tinggi Darul 'Ulum)

Combining Representation Learning with Logic for Language Processing

Author: Rocktäschel Tim
Publication venue
Publication date: 27/12/2017
Field of study

The current state-of-the-art in many natural language processing and automated knowledge base completion tasks is held by representation learning methods which learn distributed vector representations of symbols via gradient-based optimization. They require little or no hand-crafted features, thus avoiding the need for most preprocessing steps and task-specific assumptions. However, in many cases representation learning requires a large amount of annotated training data to generalize well to unseen data. Such labeled training data is provided by human annotators who often use formal logic as the language for specifying annotations. This thesis investigates different combinations of representation learning methods with logic for reducing the need for annotated training data, and for improving generalization.Comment: PhD Thesis, University College London, Submitted and accepted in 201

arXiv.org e-Print Archive

UCL Discovery

Four Lessons in Versatility or How Query Languages Adapt to the Web

Author: A. Bonifati
A. Gelder van
A. Polleres
A. Polleres
A.C. Klug
B. Adida
B. Cooper
B. Jenner
D. Olteanu
D. Olteanu
D. Recordon
D.D. Chamberlin
D.R. Fulkerson
E. Augurusa
F. Bry
F. Bry
F. Bry
F. Wei
F. Weigel
G. Gottlob
G. Karvounarakis
H. Björklund
H. Garcia-Molina
H. Meuss
H. Meuss
H. Przymusinska
H. Tamaki
H. Wang
H.V. Jagadish
J. Bailey
J. Euzenat
J. Pérez
J. Pérez
J. Pérez
J.D. Ullman
J.J. Carroll
J.V.D. Bussche
K. Kochut
K.A. Ross
K.R. Apt
K.S. Booth
L. Cabibbo
M. Habib
M. Kay
M. Marx
M. Marx
N. Bruno
N. Walsh
P. Boncz
P. Buneman
P. Cholak
P. O’Neil
P.G. Kolaitis
P.P. Schneider
R. Agrawal
R. Fagin
R. Goldman
R. Hull
R. Khare
R. Khare
R. Schenkel
S. Abiteboul
S. Abiteboul
S. Abiteboul
S. Al-Khalifa
S. Berger
S. Groppe
S. Trißl
T. Chen
T. Furche
T. Grust
T. Schwentick
T.C. Przymusinski
U. Assmann
W. Akhtar
W. Chen
W.L. Hsu
W.L. Hsu
Z. Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Exposing not only human-centered information, but machine-processable data on the Web is one of the commonalities of recent Web trends. It has enabled a new kind of applications and businesses where the data is used in ways not foreseen by the data providers. Yet this exposition has fractured the Web into islands of data, each in different Web formats: Some providers choose XML, others RDF, again others JSON or OWL, for their data, even in similar domains. This fracturing stifles innovation as application builders have to cope not only with one Web stack (e.g., XML technology) but with several ones, each of considerable complexity. With Xcerpt we have developed a rule- and pattern based query language that aims to give shield application builders from much of this complexity: In a single query language XML and RDF data can be accessed, processed, combined, and re-published. Though the need for combined access to XML and RDF data has been recognized in previous work (including the W3C’s GRDDL), our approach differs in four main aspects: (1) We provide a single language (rather than two separate or embedded languages), thus minimizing the conceptual overhead of dealing with disparate data formats. (2) Both the declarative (logic-based) and the operational semantics are unified in that they apply for querying XML and RDF in the same way. (3) We show that the resulting query language can be implemented reusing traditional database technology, if desirable. Nevertheless, we also give a unified evaluation approach based on interval labelings of graphs that is at least as fast as existing approaches for tree-shaped XML data, yet provides linear time and space querying also for many RDF graphs. We believe that Web query languages are the right tool for declarative data access in Web applications and that Xcerpt is a significant step towards a more convenient, yet highly efficient data access in a “Web of Data”

CiteSeerX

Crossref

Open Access LMU

Hyperbolic Deep Neural Networks: A Survey

Author: Mostafa Abdelrahman
Peng Wei
Shi Henglin
Varanka Tuomas
Zhao Guoying
Publication venue
Publication date: 01/01/2021
Field of study

Recently, there has been a rising surge of momentum for deep representation learning in hyperbolic spaces due to theirhigh capacity of modeling data like knowledge graphs or synonym hierarchies, possessing hierarchical structure. We refer to the model as hyperbolic deep neural network in this paper. Such a hyperbolic neural architecture potentially leads to drastically compact model withmuch more physical interpretability than its counterpart in Euclidean space. To stimulate future research, this paper presents acoherent and comprehensive review of the literature around the neural components in the construction of hyperbolic deep neuralnetworks, as well as the generalization of the leading deep approaches to the Hyperbolic space. It also presents current applicationsaround various machine learning tasks on several publicly available datasets, together with insightful observations and identifying openquestions and promising future directions

arXiv.org e-Print Archive

University of Oulu Repository - Jultika

Language and Logic in Wittgenstein's Tractatus Logico-Philosophicus

Author: Mezzadri Daniele
Publication venue: University of Stirling
Publication date: 01/01/2010
Field of study

This thesis discusses some central aspects of Wittgenstein’s conception of language and logic in his Tractatus Logico-Philosophicus and brings them into relation with the philosophies of Frege and Russell. The main contention is that a fruitful way of understanding the Tractatus is to see it as responding to tensions in Frege’s conception of logic and Russell’s theory of judgement. In the thesis the philosophy of the Tractatus is presented as developing from these two strands of criticism and thus as the culmination of the philosophy of logic and language developed in the early analytic period. Part one examines relevant features of Frege’s philosophy of logic. Besides shedding light on Frege’s philosophy in its own right, it aims at preparing the ground for a discussion of those aspects of the Tractatus’ conception of logic which derive from Wittgenstein’s critical response to Frege. Part two first presents Russell’s early view on truth and judgement, before considering several variants of the multiple relation theory of judgement, devised in opposition to it. Part three discusses the development of Wittgenstein’s conception of language and logic, beginning with Wittgenstein’s criticism of the multiple relation theory and his early theory of sense, seen as containing the seeds of the picture theory of propositions presented in the Tractatus. I then consider the relation between Wittgenstein’s pictorial conception of language and his conception of logic, arguing that Wittgenstein’s understanding of sense in terms of bipolarity grounds his view of logical complexity and of the essence of logic as a whole. This view, I show, is free from the internal tensions that affect Frege’s understanding of the nature of logic

PhilPapers

Stirling Online Research Repository

HIT and brain reward function: a case of mistaken identity (theory)

Author: Beard Alexander
Colombo Matteo
Wright Cory
Publication venue
Publication date: 01/01/2017
Field of study

This paper employs a case study from the history of neuroscience—brain reward function—to scrutinize the inductive argument for the so-called ‘Heuristic Identity Theory’ (HIT). The case fails to support HIT, illustrating why other case studies previously thought to provide empirical support for HIT also fold under scrutiny. After distinguishing two different ways of understanding the types of identity claims presupposed by HIT and considering other conceptual problems, we conclude that HIT is not an alternative to the traditional identity theory so much as a relabeling of previously discussed strategies for mechanistic discovery

PhilPapers

PhilSci Archive

Tilburg University Repository

Weisfeiler and Leman go Hyperbolic: Learning Distance Preserving Node Representations

Author: Chatzianastasis Michail
Nikolentzos Giannis
Vazirgiannis Michalis
Publication venue
Publication date: 04/11/2022
Field of study

In recent years, graph neural networks (GNNs) have emerged as a promising tool for solving machine learning problems on graphs. Most GNNs are members of the family of message passing neural networks (MPNNs). There is a close connection between these models and the Weisfeiler-Leman (WL) test of isomorphism, an algorithm that can successfully test isomorphism for a broad class of graphs. Recently, much research has focused on measuring the expressive power of GNNs. For instance, it has been shown that standard MPNNs are at most as powerful as WL in terms of distinguishing non-isomorphic graphs. However, these studies have largely ignored the distances between the representations of nodes/graphs which are of paramount importance for learning tasks. In this paper, we define a distance function between nodes which is based on the hierarchy produced by the WL algorithm, and propose a model that learns representations which preserve those distances between nodes. Since the emerging hierarchy corresponds to a tree, to learn these representations, we capitalize on recent advances in the field of hyperbolic neural networks. We empirically evaluate the proposed model on standard node and graph classification datasets where it achieves competitive performance with state-of-the-art models

arXiv.org e-Print Archive