Mathematical Information Retrieval: A Review
Mathematical formulas are commonly used to demonstrate theories and basic fundamentals
in the Science, Technology, Engineering, and Mathematics (STEM) domain. The burgeoning …
in the Science, Technology, Engineering, and Mathematics (STEM) domain. The burgeoning …
Tangent-CFT: An embedding model for mathematical formulas
When searching for mathematical content, accurate measures of formula similarity can help
with tasks such as document ranking, query recommendation, and result set clustering …
with tasks such as document ranking, query recommendation, and result set clustering …
Introduction to mathematical language processing: Informal proofs, word problems, and supporting tasks
Automating discovery in mathematics and science will require sophisticated methods of
information extraction and abstract reasoning, including models that can convincingly …
information extraction and abstract reasoning, including models that can convincingly …
Evaluating token-level and passage-level dense retrieval models for math information retrieval
With the recent success of dense retrieval methods based on bi-encoders, studies have
applied this approach to various interesting downstream retrieval tasks with good efficiency …
applied this approach to various interesting downstream retrieval tasks with good efficiency …
Layout and semantics: Combining representations for mathematical formula search
Math-aware search engines need to support formulae in queries. Mathematical expressions
are typically represented as trees defining their operational semantics or visual layout. We …
are typically represented as trees defining their operational semantics or visual layout. We …
Accelerating substructure similarity search for formula retrieval
Formula retrieval systems using substructure matching are effective, but suffer from slow
retrieval times caused by the complexity of structure matching. We present a specialized …
retrieval times caused by the complexity of structure matching. We present a specialized …
Structural similarity search for formulas using leaf-root paths in operator subtrees
We present a new search method for mathematical formulas based on Operator Trees
(OPTs) representing the application of operators to operands. Our method provides (1) a …
(OPTs) representing the application of operators to operands. Our method provides (1) a …
Mathematical Information Retrieval: Search and Question Answering
Mathematical information is essential for technical work, but its creation, interpretation, and
search are challenging. To help address these challenges, researchers have developed …
search are challenging. To help address these challenges, researchers have developed …
Math-word embedding in math search and semantic extraction
Word embedding, which represents individual words with semantically fixed-length vectors,
has made it possible to successfully apply deep learning to natural language processing …
has made it possible to successfully apply deep learning to natural language processing …
One blade for one purpose: advancing math information retrieval using hybrid search
Neural retrievers have been shown to be effective for math-aware search. Their ability to
cope with math symbol mismatches, to represent highly contextualized semantics, and to …
cope with math symbol mismatches, to represent highly contextualized semantics, and to …