Mathematical Information Retrieval: A Review

P Dadure, P Pakray, S Bandyopadhyay - ACM Computing Surveys, 2024 - dl.acm.org
Mathematical formulas are commonly used to demonstrate theories and basic fundamentals
in the Science, Technology, Engineering, and Mathematics (STEM) domain. The burgeoning …

Tangent-CFT: An embedding model for mathematical formulas

B Mansouri, S Rohatgi, DW Oard, J Wu… - Proceedings of the …, 2019 - dl.acm.org
When searching for mathematical content, accurate measures of formula similarity can help
with tasks such as document ranking, query recommendation, and result set clustering …

Introduction to mathematical language processing: Informal proofs, word problems, and supporting tasks

J Meadows, A Freitas - Transactions of the Association for …, 2023 - direct.mit.edu
Automating discovery in mathematics and science will require sophisticated methods of
information extraction and abstract reasoning, including models that can convincingly …

Evaluating token-level and passage-level dense retrieval models for math information retrieval

W Zhong, JH Yang, Y **e, J Lin - arxiv preprint arxiv:2203.11163, 2022 - arxiv.org
With the recent success of dense retrieval methods based on bi-encoders, studies have
applied this approach to various interesting downstream retrieval tasks with good efficiency …

Layout and semantics: Combining representations for mathematical formula search

K Davila, R Zanibbi - Proceedings of the 40th International ACM SIGIR …, 2017 - dl.acm.org
Math-aware search engines need to support formulae in queries. Mathematical expressions
are typically represented as trees defining their operational semantics or visual layout. We …

Accelerating substructure similarity search for formula retrieval

W Zhong, S Rohatgi, J Wu, CL Giles… - Advances in Information …, 2020 - Springer
Formula retrieval systems using substructure matching are effective, but suffer from slow
retrieval times caused by the complexity of structure matching. We present a specialized …

Structural similarity search for formulas using leaf-root paths in operator subtrees

W Zhong, R Zanibbi - Advances in Information Retrieval: 41st European …, 2019 - Springer
We present a new search method for mathematical formulas based on Operator Trees
(OPTs) representing the application of operators to operands. Our method provides (1) a …

Mathematical Information Retrieval: Search and Question Answering

R Zanibbi, B Mansouri, A Agarwal - Foundations and Trends® …, 2025 - nowpublishers.com
Mathematical information is essential for technical work, but its creation, interpretation, and
search are challenging. To help address these challenges, researchers have developed …

Math-word embedding in math search and semantic extraction

A Greiner-Petter, A Youssef, T Ruas, BR Miller… - Scientometrics, 2020 - Springer
Word embedding, which represents individual words with semantically fixed-length vectors,
has made it possible to successfully apply deep learning to natural language processing …

One blade for one purpose: advancing math information retrieval using hybrid search

W Zhong, SC Lin, JH Yang, J Lin - … of the 46th International ACM SIGIR …, 2023 - dl.acm.org
Neural retrievers have been shown to be effective for math-aware search. Their ability to
cope with math symbol mismatches, to represent highly contextualized semantics, and to …