Search CORE

9,210 research outputs found

Towards mathematical AI via a model of the content and process of mathematical question and answer dialogues

Author: A Pease
A Peldszus
C Kaliszyk
D Ferrucci
D Walton
F Macagno
F Tanswell
F Wiedijk
G Stahl
J Harrison
JF Sowa
K Nuamah
M Cramer
M Ganesalingam
Á Carrera
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

This paper outlines a strategy for building semantically meaningful representations and carrying out effective reasoning in technical knowledge domains such as mathematics. Our central assertion is that the semi-structured Q and A format, as used on the popular Stack Exchange network of websites, exposes domain knowledge in a form that is already reasonably close to the structured knowledge formats that computers can reason about. The knowledge in question is not only facts - but discursive, dialectical, argument for purposes of proof and pedagogy. We therefore assert that modelling the Q and A process computationally provides a route to domain understanding that is compatible with the day-to-day practices of mathematicians and students. This position is supported by a small case study that analyses one question from Mathoverflow in detail, using concepts from argumentation theory. A programme of future work, including a rigorous evaluation strategy, is then advanced

Crossref

Edinburgh Research Explorer

Oxford University Research Archive

University of Dundee Online Publications

Using dialogue to learn math in the LeActiveMath project

Author: Callaway Charles
Dzikovska Myroslava
Matheson Colin
Moore Johanna
Zinn Claus
Publication venue
Publication date: 01/01/2006
Field of study

We describe a tutorial dialogue system under development that assists students in learning how to differentiate equations. The system uses deep natural language understanding and generation to both interpret students ’ utterances and automatically generate a response that is both mathematically correct and adapted pedagogically and linguistically to the local dialogue context. A domain reasoner provides the necessary knowledge about how students should approach math problems as well as their (in)correctness, while a dialogue manager directs pedagogical strategies and keeps track of what needs to be done to keep the dialogue moving along.

CiteSeerX

Edinburgh Research Explorer

Modelling the way mathematics is actually done

Author: Alexander Christopher
Austin John Langshaw
Barr Avron
Barthélémy Jean-Hugues
Budzynska Katarzyna
Budzynska Katarzyna
Carnap Rudolf
Corneli Joseph
Cramer Marcos
Cramer Marcos
Ganesalingam M.
Ginev Deyan
Guckelsberger Christian
Hermann Karl Moritz
Jamnik Mateja
Kaliszyk Cezary
Lamport Leslie
Nesin Gabriela Rino
Pease Alison
Peirce Charles S.
Pólya G.
Schank Roger C
Sowa John F
Sowa John F
Thomas Hales
Zhang Amy X
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

Crossref

University of Brighton Research Portal

Edinburgh Research Explorer

University of Dundee Online Publications

Argumentation Theory for Mathematical Argument

Author: Corneli Joseph
Martin Ursula
Murray-Rust Dave
Pease Alison
Rino Nesin Gabriela
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/07/2018
Field of study

To adequately model mathematical arguments the analyst must be able to represent the mathematical objects under discussion and the relationships between them, as well as inferences drawn about these objects and relationships as the discourse unfolds. We introduce a framework with these properties, which has been used to analyse mathematical dialogues and expository texts. The framework can recover salient elements of discourse at, and within, the sentence level, as well as the way mathematical content connects to form larger argumentative structures. We show how the framework might be used to support computational reasoning, and argue that it provides a more natural way to examine the process of proving theorems than do Lamport's structured proofs.Comment: 44 pages; to appear in Argumentatio

arXiv.org e-Print Archive

Crossref

University of Brighton Research Portal

Edinburgh Research Explorer

University of Dundee Online Publications

Using ChatGPT and other LLMs in Professional Environments

Author: Alaswad S.
Kalganova T.
S. Awad W.
Publication venue: Arab Journals Platform
Publication date: 15/09/2023
Field of study

Large language models like ChatGPT, Google’s Bard, and Microsoft’s new Bing, to name a few, are developing rapidly in recent years, becoming very popular in different environments, and supporting a wide range of tasks. A deep look into their outcomes reveals several limitations and challenges that can be further improved. The main challenge of these models is the possibility of generating biased or inaccurate results, since these models rely on large amounts of data with no access to unpublic information. Moreover, these language models need to be properly monitored and trained to prevent generating inappropriate or offensive content and to ensure that they are used ethically and safely. This study investigates the use of ChatGPT and other large language models such as Blender, and BERT in professional environments. It has been found that none of the large language models, including ChatGPT, have been used in unstructured dialogues. Moreover, involving the models in professional environments requires extensive training and monitoring by domain professionals or fine-tuning through API

Arab Journals Platform

Computational Poetry Workshop: Making Sense of Work in Progress

Author: Colton Simon
Corneli Joseph
Guckelsberger Christian
Jordanous Anna
Llano Maria Teresa
Misztal Joanna
Shepperd Rosie
Publication venue
Publication date: 01/01/2015
Field of study

Edinburgh Research Explorer

Kent Academic Repository

Datasets for Large Language Models: A Comprehensive Survey

Author: Cao Jiahuan
Ding Kai
Jin Lianwen
Liu Chongyu
Liu Yang
Publication venue
Publication date: 27/02/2024
Field of study

This paper embarks on an exploration into the Large Language Model (LLM) datasets, which play a crucial role in the remarkable advancements of LLMs. The datasets serve as the foundational infrastructure analogous to a root system that sustains and nurtures the development of LLMs. Consequently, examination of these datasets emerges as a critical topic in research. In order to address the current lack of a comprehensive overview and thorough analysis of LLM datasets, and to gain insights into their current status and future trends, this survey consolidates and categorizes the fundamental aspects of LLM datasets from five perspectives: (1) Pre-training Corpora; (2) Instruction Fine-tuning Datasets; (3) Preference Datasets; (4) Evaluation Datasets; (5) Traditional Natural Language Processing (NLP) Datasets. The survey sheds light on the prevailing challenges and points out potential avenues for future investigation. Additionally, a comprehensive review of the existing available dataset resources is also provided, including statistics from 444 datasets, covering 8 language categories and spanning 32 domains. Information from 20 dimensions is incorporated into the dataset statistics. The total data size surveyed surpasses 774.5 TB for pre-training corpora and 700M instances for other datasets. We aim to present the entire landscape of LLM text datasets, serving as a comprehensive reference for researchers in this field and contributing to future studies. Related resources are available at: https://github.com/lmmlzn/Awesome-LLMs-Datasets.Comment: 181 pages, 21 figure

arXiv.org e-Print Archive