Search CORE

8,867 research outputs found

Non-Autoregressive Math Word Problem Solver with Unified Tree Structure

Author: Bin Yi
Han Mengqun
Ng See-Kiong
Shen Heng Tao
Shi Wenhao
Wang Lei
Yang Yang
Publication venue
Publication date: 28/10/2023
Field of study

Existing MWP solvers employ sequence or binary tree to present the solution expression and decode it from given problem description. However, such structures fail to handle the variants that can be derived via mathematical manipulation, e.g.,

(a_1+a_2) * a_3

and

a_1 * a_3+a_2 * a_3

can both be possible valid solutions for a same problem but formulated as different expression sequences or trees. The multiple solution variants depicting different possible solving procedures for the same input problem would raise two issues: 1) making it hard for the model to learn the mapping function between the input and output spaces effectively, and 2) wrongly indicating \textit{wrong} when evaluating a valid expression variant. To address these issues, we introduce a unified tree structure to present a solution expression, where the elements are permutable and identical for all the expression variants. We propose a novel non-autoregressive solver, named \textit{MWP-NAS}, to parse the problem and deduce the solution expression based on the unified tree. For evaluating the possible expression variants, we design a path-based metric to evaluate the partial accuracy of expressions of a unified tree. The results from extensive experiments conducted on Math23K and MAWPS demonstrate the effectiveness of our proposed MWP-NAS. The codes and checkpoints are available at: \url{https://github.com/mengqunhan/MWP-NAS}.Comment: Accepted at EMNLP202

arXiv.org e-Print Archive

A Survey in Mathematical Language Processing

Author: Freitas Andre
Meadows Jordan
Publication venue
Publication date: 30/05/2022
Field of study

The University of Manchester - Institutional Repository

A Theme-Rewriting Approach for Generating Algebra Word Problems

Author: Hajishirzi Hannaneh
Koncel-Kedziorski Rik
Konstas Ioannis
Zettlemoyer Luke
Publication venue
Publication date: 01/01/2016
Field of study

Texts present coherent stories that have a particular theme or overall setting, for example science fiction or western. In this paper, we present a text generation method called {\it rewriting} that edits existing human-authored narratives to change their theme without changing the underlying story. We apply the approach to math word problems, where it might help students stay more engaged by quickly transforming all of their homework assignments to the theme of their favorite movie without changing the math concepts that are being taught. Our rewriting method uses a two-stage decoding process, which proposes new words from the target theme and scores the resulting stories according to a number of factors defining aspects of syntactic, semantic, and thematic coherence. Experiments demonstrate that the final stories typically represent the new theme well while still testing the original math concepts, outperforming a number of baselines. We also release a new dataset of human-authored rewrites of math word problems in several themes.Comment: To appear EMNLP 201

arXiv.org e-Print Archive

Heriot Watt Pure

Crossref

A Survey of Deep Learning for Mathematical Reasoning

Author: Chang Kai-Wei
Lu Pan
Qiu Liang
Welleck Sean
Yu Wenhao
Publication venue
Publication date: 21/06/2023
Field of study

Mathematical reasoning is a fundamental aspect of human intelligence and is applicable in various fields, including science, engineering, finance, and everyday life. The development of artificial intelligence (AI) systems capable of solving math problems and proving theorems has garnered significant interest in the fields of machine learning and natural language processing. For example, mathematics serves as a testbed for aspects of reasoning that are challenging for powerful deep learning models, driving new algorithmic and modeling advances. On the other hand, recent advances in large-scale neural language models have opened up new benchmarks and opportunities to use deep learning for mathematical reasoning. In this survey paper, we review the key tasks, datasets, and methods at the intersection of mathematical reasoning and deep learning over the past decade. We also evaluate existing benchmarks and methods, and discuss future research directions in this domain.Comment: Accepted to ACL 2023. The repository is available at https://github.com/lupantech/dl4mat

arXiv.org e-Print Archive

Unbiased Math Word Problems Benchmark for Mitigating Solving Bias

Author: Chen Jiaqi
Liang Xiaodan
Qin Jinghui
Yang Zhicheng
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 17/05/2022
Field of study

In this paper, we revisit the solving bias when evaluating models on current Math Word Problem (MWP) benchmarks. However, current solvers exist solving bias which consists of data bias and learning bias due to biased dataset and improper training strategy. Our experiments verify MWP solvers are easy to be biased by the biased training datasets which do not cover diverse questions for each problem narrative of all MWPs, thus a solver can only learn shallow heuristics rather than deep semantics for understanding problems. Besides, an MWP can be naturally solved by multiple equivalent equations while current datasets take only one of the equivalent equations as ground truth, forcing the model to match the labeled ground truth and ignoring other equivalent equations. Here, we first introduce a novel MWP dataset named UnbiasedMWP which is constructed by varying the grounded expressions in our collected data and annotating them with corresponding multiple new questions manually. Then, to further mitigate learning bias, we propose a Dynamic Target Selection (DTS) Strategy to dynamically select more suitable target expressions according to the longest prefix match between the current model output and candidate equivalent equations which are obtained by applying commutative law during training. The results show that our UnbiasedMWP has significantly fewer biases than its original data and other datasets, posing a promising benchmark for fairly evaluating the solvers' reasoning skills rather than matching nearest neighbors. And the solvers trained with our DTS achieve higher accuracies on multiple MWP benchmarks. The source code is available at https://github.com/yangzhch6/UnbiasedMWP

arXiv.org e-Print Archive