38 research outputs found

    Multi-level Memory for Task Oriented Dialogs

    Full text link
    Recent end-to-end task oriented dialog systems use memory architectures to incorporate external knowledge in their dialogs. Current work makes simplifying assumptions about the structure of the knowledge base, such as the use of triples to represent knowledge, and combines dialog utterances (context) as well as knowledge base (KB) results as part of the same memory. This causes an explosion in the memory size, and makes the reasoning over memory harder. In addition, such a memory design forces hierarchical properties of the data to be fit into a triple structure of memory. This requires the memory reader to infer relationships across otherwise connected attributes. In this paper we relax the strong assumptions made by existing architectures and separate memories used for modeling dialog context and KB results. Instead of using triples to store KB results, we introduce a novel multi-level memory architecture consisting of cells for each query and their corresponding results. The multi-level memory first addresses queries, followed by results and finally each key-value pair within a result. We conduct detailed experiments on three publicly available task oriented dialog data sets and we find that our method conclusively outperforms current state-of-the-art models. We report a 15-25% increase in both entity F1 and BLEU scores.Comment: Accepted as full paper at NAACL 201

    Edge Replacement Grammars: A Formal Language Approach for Generating Graphs

    Full text link
    Graphs are increasingly becoming ubiquitous as models for structured data. A generative model that closely mimics the structural properties of a given set of graphs has utility in a variety of domains. Much of the existing work require that a large number of parameters, in fact exponential in size of the graphs, be estimated from the data. We take a slightly different approach to this problem, leveraging the extensive prior work in the formal graph grammar literature. In this paper, we propose a graph generation model based on Probabilistic Edge Replacement Grammars (PERGs). We propose a variant of PERG called Restricted PERG (RPERG), which is analogous to PCFGs in string grammar literature. With this restriction, we are able to derive a learning algorithm for estimating the parameters of the grammar from graph data. We empirically demonstrate on real life datasets that RPERGs outperform existing methods for graph generation. We improve on the performance of the state-of-the-art Hyperedge Replacement Grammar based graph generative model. Despite being a context free grammar, the proposed model is able to capture many of the structural properties of real networks, such as degree distributions, power law and spectral characteristics.Comment: To be presented at SIAM International Conference on Data Mining (SDM19). arXiv admin note: text overlap with arXiv:1802.08068, arXiv:1608.03192 by other author

    Reduced basis method for Boltzmann equation

    Get PDF
    Thesis (S.M.)--Massachusetts Institute of Technology, Computation for Design and Optimization Program, 2006.Includes bibliographical references (p. 103-106).The main aim of the project is to solve the BGK model of the Knudsen parameterized Boltzmann equation which is 1-d with respect to both space and velocity. In order to solve the Boltzmann equation, we first transform the original differential equation by replacing the dependent variable with another variable, weighted with function t(y); next we obtain a Petrov Galerkin weak form of this new transformed equation. To obtain a stable and accurate solution of this weak form, we perform a transformation of the velocity variable y, such that the semi-infinite domain is mapped into a finite domain; we choose the weighting function t(y), to balance contributions at infinity. Once we obtain an accurate and well defined finite element solution of the problem. The next step is to perform the reduced basis analysis of the equation using these accurate finite element solutions. We conclude the project by verifying that the orthonormal reduced Basis method based on the greedy algorithm converges rapidly over the chosen test space.by Revanth Reddy Garlapati.S.M

    Online Payment Module

    Get PDF
    The aim of this project is to deploy the online payment service in Moodle. All the major debit, credit and international card (transactions) can be accepted for payment. Online payment module prepares a web server that takes all types of transactions. This module can be enabled by the site administrator. If it is enabled, students can pay for their classes through online transactions. Administrator can set an individual price for a course if needed. It allows the user to create their own account and add optional account links. This project is important to resolve the issues for students and administrators to have an easy glance at the course registration like selection of their courses, Fee details. This project makes it easy for students to look for the courses and register, one can check the site as a guest and can create his/her own account and can enroll for subjects. One can see the fee details for each course

    Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization

    Full text link
    Open-domain Question Answering (OpenQA) aims at answering factual questions with an external large-scale knowledge corpus. However, real-world knowledge is not static; it updates and evolves continually. Such a dynamic characteristic of knowledge poses a vital challenge for these models, as the trained models need to constantly adapt to the latest information to make sure that the answers remain accurate. In addition, it is still unclear how well an OpenQA model can transfer to completely new knowledge domains. In this paper, we investigate the generalization performance of a retrieval-augmented QA model in two specific scenarios: 1) adapting to updated versions of the same knowledge corpus; 2) switching to completely different knowledge domains. We observe that the generalization challenges of OpenQA models stem from the reader's over-reliance on memorizing the knowledge from the external corpus, which hinders the model from generalizing to a new knowledge corpus. We introduce Corpus-Invariant Tuning (CIT), a simple but effective training strategy, to mitigate the knowledge over-memorization by controlling the likelihood of retrieved contexts during training. Extensive experimental results on multiple OpenQA benchmarks show that CIT achieves significantly better generalizability without compromising the model's performance in its original corpus and domain.Comment: Accepted to NAACL 2024 Finding

    Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations

    Full text link
    Open-domain dialog involves generating search queries that help obtain relevant knowledge for holding informative conversations. However, it can be challenging to determine what information to retrieve when the user is passive and does not express a clear need or request. To tackle this issue, we present a novel approach that focuses on generating internet search queries that are guided by social commonsense. Specifically, we leverage a commonsense dialog system to establish connections related to the conversation topic, which subsequently guides our query generation. Our proposed framework addresses passive user interactions by integrating topic tracking, commonsense response generation and instruction-driven query generation. Through extensive evaluations, we show that our approach overcomes limitations of existing query generation techniques that rely solely on explicit dialog information, and produces search queries that are more relevant, specific, and compelling, ultimately resulting in more engaging responses.Comment: Accepted in EMNLP 2023 Finding

    PLC Multi-robot Integration via Ethernet for Human Operated Quality Sampling

    Get PDF
    In automation, quality control inspection is a critical requirement to ensure product standards. The goal of this work is to insure product quality without interrupting the production line flow. The multi-robot system presented, connects a programmable logic controller (PLC), as the main controller, to a conveyor belt and two FANUC industrial robotic arms via EtherNet/IP. Human interaction is implemented to pick a work piece from the moving conveyor and return it with a quality label. This label is used by the PLC to execute the correct robot action; either to return the inspected part to the conveyor or discard it into the rejection bin. The operator uses a custom control panel connected to the PLC, which controls the conveyor and robot actions. The results show the feasibility of the presented multi robot automation line controlled by a PLC that allows human machine interaction to enable manual quality inspection during production. This paper details a student project developed in the advanced programmable logic controllers class. It is part of the master program in mechatronics. Students work in groups in a creative setting, where they learn to integrate various automation technologies and learn to write scientific publications
    corecore