Mathbert: A pre-trained model for mathematical formula understanding

S Peng, K Yuan, L Gao, Z Tang - arxiv preprint arxiv:2105.00377, 2021 - arxiv.org
Large-scale pre-trained models like BERT, have obtained a great success in various Natural
Language Processing (NLP) tasks, while it is still a challenge to adapt them to the math …

Mathematical Information Retrieval: A Review

P Dadure, P Pakray, S Bandyopadhyay - ACM Computing Surveys, 2024 - dl.acm.org
Mathematical formulas are commonly used to demonstrate theories and basic fundamentals
in the Science, Technology, Engineering, and Mathematics (STEM) domain. The burgeoning …

Introduction to mathematical language processing: Informal proofs, word problems, and supporting tasks

J Meadows, A Freitas - Transactions of the Association for …, 2023 - direct.mit.edu
Automating discovery in mathematics and science will require sophisticated methods of
information extraction and abstract reasoning, including models that can convincingly …

Math word problem generation with mathematical consistency and problem context constraints

Z Wang, AS Lan, RG Baraniuk - arxiv preprint arxiv:2109.04546, 2021 - arxiv.org
We study the problem of generating arithmetic math word problems (MWPs) given a math
equation that specifies the mathematical computation and a context that specifies the …

Overview of ARQMath 2020: CLEF lab on answer retrieval for questions on math

R Zanibbi, DW Oard, A Agarwal, B Mansouri - Experimental IR Meets …, 2020 - Springer
Abstract The ARQMath Lab at CLEF considers finding answers to new mathematical
questions among posted answers on a community question answering site (Math Stack …

Overview of arqmath-3 (2022): Third clef lab on answer retrieval for questions on math

B Mansouri, V Novotný, A Agarwal, DW Oard… - … Conference of the Cross …, 2022 - Springer
This paper provides an overview of the third and final year of the Answer Retrieval for
Questions on Math (ARQMath-3) lab, run as part of CLEF 2022. ARQMath has aimed to …

Evaluating token-level and passage-level dense retrieval models for math information retrieval

W Zhong, JH Yang, Y **e, J Lin - arxiv preprint arxiv:2203.11163, 2022 - arxiv.org
With the recent success of dense retrieval methods based on bi-encoders, studies have
applied this approach to various interesting downstream retrieval tasks with good efficiency …

Continual pre-training of language models for math problem understanding with syntax-aware memory network

Z Gong, K Zhou, WX Zhao, J Sha… - Proceedings of the …, 2022 - aclanthology.org
In this paper, we study how to continually pre-train language models for improving the
understanding of math problems. Specifically, we focus on solving a fundamental challenge …

Tree-based representation and generation of natural and mathematical language

A Scarlatos, A Lan - arxiv preprint arxiv:2302.07974, 2023 - arxiv.org
Mathematical language in scientific communications and educational scenarios is important
yet relatively understudied compared to natural languages. Recent works on mathematical …

Contrastive graph representations for logical formulas embedding

Q Lin, J Liu, L Zhang, Y Pan, X Hu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Currently, the non-transparent computing process of deep learning has become a significant
reason hindering its further development. The Neural-Symbolic (NS) system formed by …