38 research outputs found
Multi-level Memory for Task Oriented Dialogs
Recent end-to-end task oriented dialog systems use memory architectures to
incorporate external knowledge in their dialogs. Current work makes simplifying
assumptions about the structure of the knowledge base, such as the use of
triples to represent knowledge, and combines dialog utterances (context) as
well as knowledge base (KB) results as part of the same memory. This causes an
explosion in the memory size, and makes the reasoning over memory harder. In
addition, such a memory design forces hierarchical properties of the data to be
fit into a triple structure of memory. This requires the memory reader to infer
relationships across otherwise connected attributes. In this paper we relax the
strong assumptions made by existing architectures and separate memories used
for modeling dialog context and KB results. Instead of using triples to store
KB results, we introduce a novel multi-level memory architecture consisting of
cells for each query and their corresponding results. The multi-level memory
first addresses queries, followed by results and finally each key-value pair
within a result. We conduct detailed experiments on three publicly available
task oriented dialog data sets and we find that our method conclusively
outperforms current state-of-the-art models. We report a 15-25% increase in
both entity F1 and BLEU scores.Comment: Accepted as full paper at NAACL 201
Edge Replacement Grammars: A Formal Language Approach for Generating Graphs
Graphs are increasingly becoming ubiquitous as models for structured data. A
generative model that closely mimics the structural properties of a given set
of graphs has utility in a variety of domains. Much of the existing work
require that a large number of parameters, in fact exponential in size of the
graphs, be estimated from the data. We take a slightly different approach to
this problem, leveraging the extensive prior work in the formal graph grammar
literature. In this paper, we propose a graph generation model based on
Probabilistic Edge Replacement Grammars (PERGs). We propose a variant of PERG
called Restricted PERG (RPERG), which is analogous to PCFGs in string grammar
literature. With this restriction, we are able to derive a learning algorithm
for estimating the parameters of the grammar from graph data. We empirically
demonstrate on real life datasets that RPERGs outperform existing methods for
graph generation. We improve on the performance of the state-of-the-art
Hyperedge Replacement Grammar based graph generative model. Despite being a
context free grammar, the proposed model is able to capture many of the
structural properties of real networks, such as degree distributions, power law
and spectral characteristics.Comment: To be presented at SIAM International Conference on Data Mining
(SDM19). arXiv admin note: text overlap with arXiv:1802.08068,
arXiv:1608.03192 by other author
Reduced basis method for Boltzmann equation
Thesis (S.M.)--Massachusetts Institute of Technology, Computation for Design and Optimization Program, 2006.Includes bibliographical references (p. 103-106).The main aim of the project is to solve the BGK model of the Knudsen parameterized Boltzmann equation which is 1-d with respect to both space and velocity. In order to solve the Boltzmann equation, we first transform the original differential equation by replacing the dependent variable with another variable, weighted with function t(y); next we obtain a Petrov Galerkin weak form of this new transformed equation. To obtain a stable and accurate solution of this weak form, we perform a transformation of the velocity variable y, such that the semi-infinite domain is mapped into a finite domain; we choose the weighting function t(y), to balance contributions at infinity. Once we obtain an accurate and well defined finite element solution of the problem. The next step is to perform the reduced basis analysis of the equation using these accurate finite element solutions. We conclude the project by verifying that the orthonormal reduced Basis method based on the greedy algorithm converges rapidly over the chosen test space.by Revanth Reddy Garlapati.S.M
Online Payment Module
The aim of this project is to deploy the online payment service in Moodle. All the major debit, credit and international card (transactions) can be accepted for payment. Online payment module prepares a web server that takes all types of transactions. This module can be enabled by the site administrator. If it is enabled, students can pay for their classes through online transactions. Administrator can set an individual price for a course if needed. It allows the user to create their own account and add optional account links. This project is important to resolve the issues for students and administrators to have an easy glance at the course registration like selection of their courses, Fee details. This project makes it easy for students to look for the courses and register, one can check the site as a guest and can create his/her own account and can enroll for subjects. One can see the fee details for each course
Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
Open-domain Question Answering (OpenQA) aims at answering factual questions
with an external large-scale knowledge corpus. However, real-world knowledge is
not static; it updates and evolves continually. Such a dynamic characteristic
of knowledge poses a vital challenge for these models, as the trained models
need to constantly adapt to the latest information to make sure that the
answers remain accurate. In addition, it is still unclear how well an OpenQA
model can transfer to completely new knowledge domains. In this paper, we
investigate the generalization performance of a retrieval-augmented QA model in
two specific scenarios: 1) adapting to updated versions of the same knowledge
corpus; 2) switching to completely different knowledge domains. We observe that
the generalization challenges of OpenQA models stem from the reader's
over-reliance on memorizing the knowledge from the external corpus, which
hinders the model from generalizing to a new knowledge corpus. We introduce
Corpus-Invariant Tuning (CIT), a simple but effective training strategy, to
mitigate the knowledge over-memorization by controlling the likelihood of
retrieved contexts during training. Extensive experimental results on multiple
OpenQA benchmarks show that CIT achieves significantly better generalizability
without compromising the model's performance in its original corpus and domain.Comment: Accepted to NAACL 2024 Finding
Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations
Open-domain dialog involves generating search queries that help obtain
relevant knowledge for holding informative conversations. However, it can be
challenging to determine what information to retrieve when the user is passive
and does not express a clear need or request. To tackle this issue, we present
a novel approach that focuses on generating internet search queries that are
guided by social commonsense. Specifically, we leverage a commonsense dialog
system to establish connections related to the conversation topic, which
subsequently guides our query generation. Our proposed framework addresses
passive user interactions by integrating topic tracking, commonsense response
generation and instruction-driven query generation. Through extensive
evaluations, we show that our approach overcomes limitations of existing query
generation techniques that rely solely on explicit dialog information, and
produces search queries that are more relevant, specific, and compelling,
ultimately resulting in more engaging responses.Comment: Accepted in EMNLP 2023 Finding
PLC Multi-robot Integration via Ethernet for Human Operated Quality Sampling
In automation, quality control inspection is a critical requirement to ensure product standards. The goal of this work is to insure product quality without interrupting the production line flow. The multi-robot system presented, connects a programmable logic controller (PLC), as the main controller, to a conveyor belt and two FANUC industrial robotic arms via EtherNet/IP. Human interaction is implemented to pick a work piece from the moving conveyor and return it with a quality label. This label is used by the PLC to execute the correct robot action; either to return the inspected part to the conveyor or discard it into the rejection bin. The operator uses a custom control panel connected to the PLC, which controls the conveyor and robot actions. The results show the feasibility of the presented multi robot automation line controlled by a PLC that allows human machine interaction to enable manual quality inspection during production. This paper details a student project developed in the advanced programmable logic controllers class. It is part of the master program in mechatronics. Students work in groups in a creative setting, where they learn to integrate various automation technologies and learn to write scientific publications