17th International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2023, Hammamet, Tunus, 20 - 23 Eylül 2023
Math Word Problem (MWP) is a challenging Natural Language Processing (NLP) task. Existing MWP solvers have shown that current models need to generalize better and obtain higher performances. In this study, we aim to enrich existing MWP datasets with high-quality data, which may improve MWP solvers' performances. We propose several data augmentation methods by applying minor modifications to the problem texts and equations of English MWPs datasets which contain equations with one unknown. Extensive experiments on two MWPs datasets have shown that data created by augmented methods have considerably improved performance. Moreover, further increasing the training samples by combining the samples generated by the proposed augmentation methods provides further performance improvements.