276 research outputs found
Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation
We present a large-scale collection of diverse natural language inference
(NLI) datasets that help provide insight into how well a sentence
representation captures distinct types of reasoning. The collection results
from recasting 13 existing datasets from 7 semantic phenomena into a common NLI
structure, resulting in over half a million labeled context-hypothesis pairs in
total. We refer to our collection as the DNC: Diverse Natural Language
Inference Collection. The DNC is available online at https://www.decomp.net,
and will grow over time as additional resources are recast and added from novel
sources.Comment: To be presented at EMNLP 2018. 15 page
ParaBank: Monolingual Bitext Generation and Sentential Paraphrasing via Lexically-constrained Neural Machine Translation
We present ParaBank, a large-scale English paraphrase dataset that surpasses
prior work in both quantity and quality. Following the approach of ParaNMT, we
train a Czech-English neural machine translation (NMT) system to generate novel
paraphrases of English reference sentences. By adding lexical constraints to
the NMT decoding procedure, however, we are able to produce multiple
high-quality sentential paraphrases per source sentence, yielding an English
paraphrase resource with more than 4 billion generated tokens and exhibiting
greater lexical diversity. Using human judgments, we also demonstrate that
ParaBank's paraphrases improve over ParaNMT on both semantic similarity and
fluency. Finally, we use ParaBank to train a monolingual NMT model with the
same support for lexically-constrained decoding for sentence rewriting tasks.Comment: To be presented at AAAI 2019. 8 page
Idiosyncratic behaviour of tRNA-like structures in translation of plant viral RNA genomes
Supramolecular & Biomaterials Chemistr
Colloidal particles at a nematic-isotropic interface: effects of confinement
When captured by a flat nematic-isotropic interface, colloidal particles can
be dragged by it. As a result spatially periodic structures may appear, with
the period depending on a particle mass, size, and interface
velocity~\cite{west.jl:2002}. If liquid crystal is sandwiched between two
substrates, the interface takes a wedge-like shape, accommodating the
interface-substrate contact angle and minimizing the director distortions on
its nematic side. Correspondingly, particles move along complex trajectories:
they are first captured by the interface and then `glide' towards its vertex
point. Our experiments quantify this scenario, and numerical minimization of
the Landau-de Gennes free energy allow for a qualitative description of the
interfacial structure and the drag force.Comment: 7 pages, 9 figure
Two-Qubit Gate Set Tomography with Fewer Circuits
Gate set tomography (GST) is a self-consistent and highly accurate method for
the tomographic reconstruction of a quantum information processor's quantum
logic operations, including gates, state preparations, and measurements.
However, GST's experimental cost grows exponentially with qubit number. For
characterizing even just two qubits, a standard GST experiment may have tens of
thousands of circuits, making it prohibitively expensive for platforms. We show
that, because GST experiments are massively overcomplete, many circuits can be
discarded. This dramatically reduces GST's experimental cost while still
maintaining GST's Heisenberg-like scaling in accuracy. We show how to exploit
the structure of GST circuits to determine which ones are superfluous. We
confirm the efficacy of the resulting experiment designs both through numerical
simulations and via the Fisher information for said designs. We also explore
the impact of these techniques on the prospects of three-qubit GST.Comment: 46 pages, 13 figures. V2: Minor edits to acknowledgment
- …