A survey of document image word spotting techniques

AP Giotis, G Sfikas, B Gatos, C Nikou - Pattern recognition, 2017 - Elsevier
Vast collections of documents available in image format need to be indexed for information
retrieval purposes. In this framework, word spotting is an alternative solution to optical …

Visual search engine for handwritten and typeset math in lecture videos and latex notes

K Davila, R Zanibbi - 2018 16th International Conference on …, 2018 - ieeexplore.ieee.org
To fill a gap in online educational tools, we are working to support search in lecture videos
using formulas from lecture notes and vice versa. We use an existing system to convert …

A comparative study using contours and skeletons as shape representations for binary image matching

H Chatbri, K Kameyama, P Kwan - Pattern Recognition Letters, 2016 - Elsevier
Contours and skeletons are well-known shape representations that embody visual
information by using a limited set of object points. Both representations have been applied in …

Tangent-V: Math formula image search using line-of-sight graphs

K Davila, R Joshi, S Setlur, V Govindaraju… - Advances in Information …, 2019 - Springer
We present a visual search engine for graphics such as math, chemical diagrams, and
figures. Graphics are represented using Line-of-Sight (LOS) graphs, with symbols connected …

[책][B] Symbolic and visual retrieval of mathematical notation using formula graph symbol pair matching and structural alignment

KD Castellanos - 2017 - search.proquest.com
Large data collections containing millions of math formulae in different formats are available
on-line. Retrieving math expressions from these collections is challenging. We propose a …

Document image dataset indexing and compression using connected components clustering

H Chatbri, K Kameyama - 2015 14th IAPR International …, 2015 - ieeexplore.ieee.org
We present a method for document image dataset indexing and compression by clustering
of connected components. Our method extracts connected components from each dataset …

Generalized Haar-like filters for document analysis: application to word spotting and text extraction from comics

A Ghorbel - 2016 - theses.hal.science
The presented thesis follows two directions. The first one disposes a technique for text and
graphic separation in comics. The second one points out a learning free segmentation free …

A modular approach for query spotting in document images and its optimization using genetic algorithms

H Chatbri, P Kwan, K Kameyama - 2014 IEEE Congress on …, 2014 - ieeexplore.ieee.org
Query spotting in document images is a subclass of Content-Based Image Retrieval (CBIR)
algorithms concerned with detecting occurrences of a query in a document image. Due to …

Text-Line Extraction from Historical Kannada Document

P Ravi, C Naveena, YH Sharath Kumar… - Frontiers in Intelligent …, 2020 - Springer
In this work, we propose identification of text line from a historical Kannada document. The
proposed method consists of three stages: initially, preprocess the image by using Sauvola's …

[PDF][PDF] A two-stage approach for word searching in handwritten document images

P Mukherjee - 2019 - 20.198.91.3
Use of handwritten paper documents is still playing an importance role, despite growing use
of electronic documents in our day to day life. Current technologies allow convenient and …