744 research outputs found
Improving Intent Classication By Automatic Data Augmentation Using Word Sense Disambiguation
abstract: Virtual digital assistants are automated software systems which assist humans by understanding natural languages such as English, either in voice or textual form. In recent times, a lot of digital applications have shifted towards providing a user experience using natural language interface. The change is brought up by the degree of ease with which the virtual digital assistants such as Google Assistant and Amazon Alexa can be integrated into your application. These assistants make use of a Natural Language Understanding (NLU) system which acts as an interface to translate unstructured natural language data into a structured form. Such an NLU system uses an intent finding algorithm which gives a high-level idea or meaning of a user query, termed as intent classification. The intent classification step identifies the action(s) that a user wants the assistant to perform. The intent classification step is followed by an entity recognition step in which the entities in the utterance are identified on which the intended action is performed. This step can be viewed as a sequence labeling task which maps an input word sequence into a corresponding sequence of slot labels. This step is also termed as slot filling.
In this thesis, we improve the intent classification and slot filling in the virtual voice agents by automatic data augmentation. Spoken Language Understanding systems face the issue of data sparsity. The reason behind this is that it is hard for a human-created training sample to represent all the patterns in the language. Due to the lack of relevant data, deep learning methods are unable to generalize the Spoken Language Understanding model. This thesis expounds a way to overcome the issue of data sparsity in deep learning approaches on Spoken Language Understanding tasks. Here we have described the limitations in the current intent classifiers and how the proposed algorithm uses existing knowledge bases to overcome those limitations. The method helps in creating a more robust intent classifier and slot filling system.Dissertation/ThesisMasters Thesis Computer Science 201
Federated Learning for Semantic Parsing: Task Formulation, Evaluation Setup, New Algorithms
This paper studies a new task of federated learning (FL) for semantic
parsing, where multiple clients collaboratively train one global model without
sharing their semantic parsing data. By leveraging data from multiple clients,
the FL paradigm can be especially beneficial for clients that have little
training data to develop a data-hungry neural semantic parser on their own. We
propose an evaluation setup to study this task, where we re-purpose widely-used
single-domain text-to-SQL datasets as clients to form a realistic heterogeneous
FL setting and collaboratively train a global model. As standard FL algorithms
suffer from the high client heterogeneity in our realistic setup, we further
propose a novel LOss Reduction Adjusted Re-weighting (Lorar) mechanism to
mitigate the performance degradation, which adjusts each client's contribution
to the global model update based on its training loss reduction during each
round. Our intuition is that the larger the loss reduction, the further away
the current global model is from the client's local optimum, and the larger
weight the client should get. By applying Lorar to three widely adopted FL
algorithms (FedAvg, FedOPT and FedProx), we observe that their performance can
be improved substantially on average (4%-20% absolute gain under MacroAvg) and
that clients with smaller datasets enjoy larger performance gains. In addition,
the global model converges faster for almost all the clients.Comment: ACL 2023 long pape
From Data to Actions in Intelligent Transportation Systems: A Prescription of Functional Requirements for Model Actionability
Advances in Data Science permeate every field of Transportation Science and Engineering,
resulting in developments in the transportation sector that are data-driven. Nowadays, Intelligent
Transportation Systems (ITS) could be arguably approached as a “story” intensively producing and
consuming large amounts of data. A diversity of sensing devices densely spread over the infrastructure,
vehicles or the travelers’ personal devices act as sources of data flows that are eventually
fed into software running on automatic devices, actuators or control systems producing, in turn,
complex information flows among users, traffic managers, data analysts, traffic modeling scientists,
etc. These information flows provide enormous opportunities to improve model development and
decision-making. This work aims to describe how data, coming from diverse ITS sources, can be used
to learn and adapt data-driven models for efficiently operating ITS assets, systems and processes;
in other words, for data-based models to fully become actionable. Grounded in this described data
modeling pipeline for ITS, we define the characteristics, engineering requisites and challenges intrinsic
to its three compounding stages, namely, data fusion, adaptive learning and model evaluation.
We deliberately generalize model learning to be adaptive, since, in the core of our paper is the firm
conviction that most learners will have to adapt to the ever-changing phenomenon scenario underlying
the majority of ITS applications. Finally, we provide a prospect of current research lines within
Data Science that can bring notable advances to data-based ITS modeling, which will eventually
bridge the gap towards the practicality and actionability of such models.This work was supported in part by the Basque Government for its funding support through the EMAITEK program (3KIA, ref. KK-2020/00049). It has also received funding support from the Consolidated Research Group MATHMODE (IT1294-19) granted by the Department of Education of the Basque Government
- …