ArMATH: a Dataset for Solving Arabic Math Word Problems

Reem Alghamdi, Zhenwen Liang, Xiangliang Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

This paper studies solving Arabic Math Word Problems by deep learning. A Math Word Problem (MWP) is a text description of a mathematical problem that can be solved by deriving a math equation to reach the answer. Effective models have been developed for solving MWPs in English and Chinese. However, Arabic MWPs are rarely studied. This paper contributes the first large-scale dataset for Arabic MWPs, which contains 6,000 samples of primary-school math problems, written in Modern Standard Arabic (MSA). Arabic MWP solvers are then built with deep learning models and evaluated on this dataset. In addition, a transfer learning model is built to let the high-resource Chinese MWP solver promote the performance of the low-resource Arabic MWP solver. This work is the first to use deep learning methods to solve Arabic MWP and the first to use transfer learning to solve MWP across different languages. The transfer learning enhanced solver has an accuracy of 74.15%, which is 3% higher than the solver without using transfer learning.
Original languageEnglish (US)
Title of host publication13th International Conference on Language Resources and Evaluation (LREC)
PublisherEuropean Language Resources Association
Pages351-362
Number of pages12
StatePublished - Jun 2022

Fingerprint

Dive into the research topics of 'ArMATH: a Dataset for Solving Arabic Math Word Problems'. Together they form a unique fingerprint.

Cite this