83 research outputs found
Recommended from our members
The role of machine learning in personalised instructional sequencing for language learning
The origins of personalised instructional sequencing can be dated back to the times of the Ancient Greeks to the times of Alexander The Great's tutor, Aristotle. However, over the centuries the demand for education and growth of students has been disproportionately greater than the number of teachers in training. Therefore, there has been a longstanding interest in finding a way to scale education without negatively affecting learning outcomes. This interest was fuelled further with the advent of computers and artificial intelligence, where a plethora of systems and models were built to bring technology driven personalised instructional sequencing to the world. Unfortunately, results were far from groundbreaking and many challenges still remain.
In my thesis, I investigate three aspects of personalised instructional sequencing: the personalised instructional sequencing mechanism, the student knowledge representation, and human forgetting. While I do not cover the entirety of personalised instructional sequencing, I cover what I consider the foundational components. I link psychological theory to model selection and design in each of my systems and present experiments to illustrate their impact. I show how reinforcement learning can be used for vocabulary learning. I also present a model that uses neural collaborative filtering to learn student knowledge representations. Lastly, I present a state-of-the-art model to predict the probability of vocabulary word recall for students learning English as a second language. The system's novelty lies in the use of word complexity to adapt the forgetting curve as well as its incorporation of psychological theory to select an appropriate model
Personalised Dialogue Management for Users with Speech Disorders
Many electronic devices are beginning to include Voice User Interfaces (VUIs) as an alternative to conventional interfaces. VUIs are especially useful for users with restricted upper limb mobility, because they cannot use keyboards and mice. These users, however, often suffer from speech disorders (e.g. dysarthria), making Automatic Speech Recognition (ASR) challenging, thus degrading the performance of the VUI. Partially Observable Markov Decision Process (POMDP) based Dialogue Management (DM) has been shown to improve the interaction performance in challenging ASR environments, but most of the research in this area has focused on Spoken Dialogue Systems (SDSs) developed to provide information, where the users interact with the system only a few times. In contrast, most VUIs are likely to be used by a single speaker over a long period of time, but very little research has been carried out on adaptation of DM models to specific speakers.
This thesis explores methods to adapt DM models (in particular dialogue state tracking models and policy models) to a specific user during a longitudinal interaction. The main differences between personalised VUIs and typical SDSs are identified and studied. Then, state-of-the-art DM models are modified to be used in scenarios which are unique to long-term personalised VUIs, such as personalised models initialised with data from different speakers or scenarios where the dialogue environment (e.g. the ASR) changes over time. In addition, several speaker and environment related features are shown to be useful to improve the interaction performance. This study is done in the context of homeService, a VUI developed to help users with dysarthria to control their home devices. The study shows that personalisation of the POMDP-DM framework can greatly improve the performance of these interfaces
Architectural artificial intelligence: exploring and developing strategies, tools, and pedagogies toward the integration of deep learning in the architectural profession
The growing incessance for data collection is a trend born from the basic promise of data: “save
everything you can, and someday you’ll be able to figure out some use for it all” (Schneier 2016,
p. 40). However, this has manifested as a plague of information overload, where “it would simply
be impossible for humans to deal with all of this data” (Davenport 2014, p. 151). Especially within
the field of architecture, where designers are tasked with leveraging all available sources of
information to compose an informed solution. Too often, “the average designer scans whatever
information [they] happen on, […] and introduces this randomly selected information into forms
otherwise dreamt up in the artist’s studio of mind” (Alexander 1964, p. 4). As data accumulates—
less so the “oil”, and more the “exhaust of the information age” (Schneier 2016, p. 20)—we are
rapidly approaching a point where even the programmers enlisted to automate are inadequate.
Yet, as the size of data warehouses increases, so too does the available computational power and
the invention of clever algorithms to negotiate it. Deep learning is an exemplar. A subset of
artificial intelligence, deep learning is a collection of algorithms inspired by the brain, capable of
automated self-improvement, or “learning”, through observations of large quantities of data. In
recent years, the rise in computational power and the access to these immense databases have
fostered the proliferation of deep learning to almost all fields of endeavour. The application of
deep learning in architecture not only has the potential to resolve the issue of rising complexity,
but introduce a plethora of new tools at the architect’s disposal, such as computer vision, natural
language processing, and recommendation systems. Already, we are starting to see its impact on
the field of architecture. Which raises the following questions: what is the current state of deep
learning adoption in architecture, how can one better facilitate its integration, and what are the
implications for doing so? This research aims to answer those questions through an exploration
of strategies, tools, and pedagogies for the integration of deep learning in the architectural
profession
Recommended from our members
Discriminative methods for statistical spoken dialogue systems
Dialogue promises a natural and effective method for users to interact with and obtain information from computer systems. Statistical spoken dialogue systems are able to disambiguate in the presence of errors by maintaining probability distributions over what they believe to be the state of a dialogue. However, traditionally these distributions have been derived using generative models, which do not directly optimise for the criterion of interest and cannot easily exploit arbitrary information that may potentially be useful. This thesis presents how discriminative methods can overcome these problems in Spoken Language Understanding (SLU) and Dialogue State Tracking (DST).
A robust method for SLU is proposed, based on features extracted from the full posterior distribution of recognition hypotheses encoded in the form of word confusion networks. This method uses discriminative classifiers, trained on unaligned input/output pairs. Performance is evaluated on both an off-line corpus, and on-line in a live user trial. It is shown that a statistical discriminative approach to SLU operating on the full posterior ASR output distribution can substantially improve performance in terms of both accuracy and overall dialogue reward. Furthermore, additional gains can be obtained by incorporating features from the system's output.
For DST, a new word-based tracking method is presented that maps directly from the speech recognition results to the dialogue state without using an explicit semantic decoder. The method is based on a recurrent neural network structure that is capable of generalising to unseen dialogue state hypotheses, and requires very little feature engineering. The method is evaluated in the second and third Dialog State Tracking Challenges, as well as in a live user trial. The results demonstrate consistently high performance across all of the off-line metrics and a substantial increase in the quality of the dialogues in the live trial. The proposed method is shown to be readily applied to expanding dialogue domains, by exploiting robust features and a new method for online unsupervised adaptation. It is shown how the neural network structure can be adapted to output structured joint distributions, giving an improvement over estimating the dialogue state as a product of marginal distributions
Multi-agent stochastic simulation of occupants in buildings
One of the principle causes for deviations between predicted and simulated performance of buildings relates to the stochastic nature of their occupants: their presence, activities whilst present, activity dependent behaviours and the consequent implications for their perceived comfort. A growing research community is active in the development and validation of stochastic models addressing these issues; and considerable progress has been made. Specifically models in the areas of presence, activities while present, shading devices, window openings and lighting usage.
One key outstanding challenge relates to the integration of these prototype models with building simulation in a coherent and generalizable way; meaning that emerging models can be integrated with a range of building simulation software. This thesis describes our proof of concept platform that integrates stochastic occupancy models within a multi agent simulation platform, which communicates directly with building simulation software. The tool is called Nottingham Multi-Agent Stochastic Simulation (No-MASS).
No-MASS is tested with a building performance simulation solver to demonstrate the effectiveness of the integrated stochastic models on a residential building and a non-residential building. To account for diversity between occupants No-MASS makes use of archetypical behaviours within the stochastic models of windows, shades and activities. Thus providing designers with means to evaluate the performance of their designs in response to the range of expected behaviours and to evaluate the robustness of their design solutions; which is not possible using current simplistic deterministic representations.
A methodology for including rule based models is built into No-MASS, this allows for testing what-if scenarios with building performance simulation and provides a pragmatic basis for the modelling of the behaviours for which there is insufficient data to develop stochastic models. A Belief-Desire-Intention model is used to develop a set of goals and plans that an agent must follow to influence the environment based on their beliefs about current environmental conditions. Recommendations for the future development of stochastic models are presented based on the sensitivity analysis of the plans.
A social interactions framework is developed within No-MASS to resolve conflicts between competing agents.This framework resolves situations where each agent may have different desires, for example one may wish to have a window open and another closed based on the outputs of the stochastic models. A votes casting system determines the agent choice, the most votes becomes the action acted on.
No-MASS employs agent machine learning techniques that allow them to learn how to respond to the processes taking place within a building and agents can choose a strategy without the need for context specific rules.
Employing these complementary techniques to support the comprehensive simulation of occupants presence and behaviour, integrated within a single platform that can readily interface with a range of building (and urban) energy simulation programs is the key contribution to knowledge from this thesis. Nevertheless, there is significant scope to extend this work to further reduce the performance gap between simulated and real world buildings
Computational Argumentation for the Automatic Analysis of Argumentative Discourse and Human Persuasion
Tesis por compendio[ES] La argumentación computacional es el área de investigación que estudia y analiza el uso de distintas técnicas y algoritmos que aproximan el razonamiento argumentativo humano desde un punto de vista computacional. En esta tesis doctoral se estudia el uso de distintas técnicas propuestas bajo el marco de la argumentación computacional para realizar un análisis automático del discurso argumentativo, y para desarrollar técnicas de persuasión computacional basadas en argumentos. Con estos objetivos, en primer lugar se presenta una completa revisión del estado del arte y se propone una clasificación de los trabajos existentes en el área de la argumentación computacional. Esta revisión nos permite contextualizar y entender la investigación previa de forma más clara desde la perspectiva humana del razonamiento argumentativo, así como identificar las principales limitaciones y futuras tendencias de la investigación realizada en argumentación computacional. En segundo lugar, con el objetivo de solucionar algunas de estas limitaciones, se ha creado y descrito un nuevo conjunto de datos que permite abordar nuevos retos y investigar problemas previamente inabordables (e.g., evaluación automática de debates orales). Conjuntamente con estos datos, se propone un nuevo sistema para la extracción automática de argumentos y se realiza el análisis comparativo de distintas técnicas para esta misma tarea. Además, se propone un nuevo algoritmo para la evaluación automática de debates argumentativos y se prueba con debates humanos reales. Finalmente, en tercer lugar se presentan una serie de estudios y propuestas para mejorar la capacidad persuasiva de sistemas de argumentación computacionales en la interacción con usuarios humanos. De esta forma, en esta tesis se presentan avances en cada una de las partes principales del proceso de argumentación computacional (i.e., extracción automática de argumentos, representación del conocimiento y razonamiento basados en argumentos, e interacción humano-computador basada en argumentos), así como se proponen algunos de los cimientos esenciales para el análisis automático completo de discursos argumentativos en lenguaje natural.[CA] L'argumentació computacional és l'àrea de recerca que estudia i analitza l'ús de distintes tècniques i algoritmes que aproximen el raonament argumentatiu humà des d'un punt de vista computacional. En aquesta tesi doctoral s'estudia l'ús de distintes tècniques proposades sota el marc de l'argumentació computacional per a realitzar una anàlisi automàtic del discurs argumentatiu, i per a desenvolupar tècniques de persuasió computacional basades en arguments. Amb aquestos objectius, en primer lloc es presenta una completa revisió de l'estat de l'art i es proposa una classificació dels treballs existents en l'àrea de l'argumentació computacional. Aquesta revisió permet contextualitzar i entendre la investigació previa de forma més clara des de la perspectiva humana del raonament argumentatiu, així com identificar les principals limitacions i futures tendències de la investigació realitzada en argumentació computacional. En segon lloc, amb l'objectiu de sollucionar algunes d'aquestes limitacions, hem creat i descrit un nou conjunt de dades que ens permet abordar nous reptes i investigar problemes prèviament inabordables (e.g., avaluació automàtica de debats orals). Conjuntament amb aquestes dades, es proposa un nou sistema per a l'extracció d'arguments i es realitza l'anàlisi comparativa de distintes tècniques per a aquesta mateixa tasca. A més a més, es proposa un nou algoritme per a l'avaluació automàtica de debats argumentatius i es prova amb debats humans reals. Finalment, en tercer lloc es presenten una sèrie d'estudis i propostes per a millorar la capacitat persuasiva de sistemes d'argumentació computacionals en la interacció amb usuaris humans. D'aquesta forma, en aquesta tesi es presenten avanços en cada una de les parts principals del procés d'argumentació computacional (i.e., l'extracció automàtica d'arguments, la representació del coneixement i raonament basats en arguments, i la interacció humà-computador basada en arguments), així com es proposen alguns dels fonaments essencials per a l'anàlisi automàtica completa de discursos argumentatius en llenguatge natural.[EN] Computational argumentation is the area of research that studies and analyses the use of different techniques and algorithms that approximate human argumentative reasoning from a computational viewpoint. In this doctoral thesis we study the use of different techniques proposed under the framework of computational argumentation to perform an automatic analysis of argumentative discourse, and to develop argument-based computational persuasion techniques. With these objectives in mind, we first present a complete review of the state of the art and propose a classification of existing works in the area of computational argumentation. This review allows us to contextualise and understand the previous research more clearly from the human perspective of argumentative reasoning, and to identify the main limitations and future trends of the research done in computational argumentation. Secondly, to overcome some of these limitations, we create and describe a new corpus that allows us to address new challenges and investigate on previously unexplored problems (e.g., automatic evaluation of spoken debates). In conjunction with this data, a new system for argument mining is proposed and a comparative analysis of different techniques for this same task is carried out. In addition, we propose a new algorithm for the automatic evaluation of argumentative debates and we evaluate it with real human debates. Thirdly, a series of studies and proposals are presented to improve the persuasiveness of computational argumentation systems in the interaction with human users. In this way, this thesis presents advances in each of the main parts of the computational argumentation process (i.e., argument mining, argument-based knowledge representation and reasoning, and argument-based human-computer interaction), and proposes some of the essential foundations for the complete automatic analysis of natural language argumentative discourses.This thesis has been partially supported by the Generalitat Valenciana project PROME-
TEO/2018/002 and by the Spanish Government projects TIN2017-89156-R and PID2020-
113416RB-I00.Ruiz Dolz, R. (2023). Computational Argumentation for the Automatic Analysis of Argumentative Discourse and Human Persuasion [Tesis doctoral]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/194806Compendi
Deep Learning of the Order Flow for Modelling Price Formation
The objective of this thesis is to apply deep learning to order flow data in novel ways, in order to improve price prediction models, and thus improve on current deep price formation models. A survey of previous work in the deep modelling of price formation revealed the importance of utilising the order flow for the deep learning of price formation had previously been over looked. Previous work in the statistical modelling of the price formation process in contrast has always focused on order flow data. To demonstrate the advantage of utilising order flow data for learning deep price formation models, the thesis first benchmarks order flow trained Recurrent Neural Networks (RNNs), against methods used in previous work for predicting directional mid-price movements. To further improve the price modelling capability of the RNN, a novel deep mixture model extension to the model architecture is then proposed. This extension provides a more realistically uncertain prediction of the mid-price, and also jointly models the direction and size of the mid-price movements. Experiments conducted showed that this novel architecture resulted in an improved model compared to common benchmarks. Lastly, a novel application of Generative Adversarial Networks (GANs) was introduced for generative modelling of the order flow sequences that induce the mid-price movements. Experiments are presented that show the GAN model is able to generate more realistic sequences than a well-known benchmark model. Also, the mid-price time-series resulting from the SeqGAN generated order flow is able to better reproduce the statistical behaviour of the real mid-price time-series
Recommended from our members
Learning meaning representations for text generation with deep generative models
This thesis explores conditioning a language generation model with auxiliary variables. By doing so, we hope to be able to better control the output of the language generator. We explore several kinds of auxiliary variables in this thesis, from unstructured continuous, to discrete, to structured discrete auxiliary variables, and evaluate their advantages and disadvantages. We consider three primary axes of variation: how interpretable the auxiliary variables are, how much control they provide over the generated text, and whether the variables can be induced from unlabelled data. The latter consideration is particularly interesting: if we can show that induced latent variables correspond to the semantics of the generated utterance, then by manipulating the variables, we have fine-grained control over the meaning of the generated utterance, thereby learning simple meaning representations for text generation.
We investigate three language generation tasks: open domain conversational response generation, sentence generation from a semantic topic, and generating surface form realisations of meaning representations. We use a different type of auxiliary variable for each task, describe the reasons for choosing that type of variable, and critically discuss how much the task benefited from an auxiliary variable decomposition. All of the models that we use combine a high-level graphical model with a neural language model text generator. The graphical model lets us specify the structure of the text generating process, while the neural text generator can learn how to generate fluent text from a large corpus of examples. We aim to show the utility of such \textit{deep generative models} of text for text generation in the following work
- …