- Academic Search

J Wan, S Song, W Yu, Y Liu, W Cheng… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recently visually-situated text parsing (VsTP) has experienced notable advancements
driven by the increasing demand for automated document understanding and the …

Spara Citera Citerat av 14 Relaterade artiklar Alla 7 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Odm: A text-image further alignment pre-training approach for scene text detection and spotting

C Duan, P Fu, S Guo, Q Jiang… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

In recent years text-image joint pre-training techniques have shown promising results in
various tasks. However in Optical Character Recognition (OCR) tasks aligning text instances …

Spara Citera Citerat av 6 Relaterade artiklar Alla 7 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Lane2seq: towards unified lane detection via sequence generation

K Zhou - Proceedings of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

In this paper we present a novel sequence generation-based framework for lane detection
called Lane2Seq. It unifies various lane detection formats by casting lane detection as a …

Spara Citera Citerat av 5 Relaterade artiklar Alla 5 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Platypus: A generalized specialist model for reading text in various forms

P Wang, Z Li, J Tang, H Zhong, F Huang… - … on Computer Vision, 2024 - Springer

Reading text from images (either natural scenes or documents) has been a long-standing
research topic for decades, due to the high technical challenge and wide application range …

Spara Citera Citerat av 2 Relaterade artiklar Alla 7 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

DNTextSpotter: Arbitrary-shaped scene text spotting via improved denoising training

Q Qiao, Y **e, J Gao, T Wu, S Huang, J Fan… - Proceedings of the …, 2024 - dl.acm.org

More and more end-to-end text spotting methods based on Transformer architecture have
demonstrated superior performance. These methods utilize a bipartite graph matching …

Spara Citera Citerat av 4 Relaterade artiklar Alla 4 versionerna

Hyper-local deformable transformers for text spotting on historical maps

Y Lin, YY Chiang - Proceedings of the 30th ACM SIGKDD Conference …, 2024 - dl.acm.org

Text on historical maps contains valuable information providing georeferenced historical,
political, and cultural contexts. However, text extraction from historical maps has been …

Spara Citera Citerat av 4 Relaterade artiklar Alla 2 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Hierarchical text spotter for joint text spotting and layout analysis

S Long, S Qin, Y Fujii, A Bissacco… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract We propose Hierarchical Text Spotter (HTS), a novel method for the joint task of
word-level text spotting and geometric layout analysis. HTS can recognize text in an image …

Spara Citera Citerat av 5 Relaterade artiklar Alla 6 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Bridging the Gap Between End-to-End and Two-Step Text Spotting

M Huang, H Li, Y Liu, X Bai… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Modularity plays a crucial role in the development and maintenance of complex systems.
While end-to-end text spotting efficiently mitigates the issues of error accumulation and sub …

Spara Citera Relaterade artiklar Alla 7 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap

D Kim, Y Kim, DH Kim, Y Lim… - Proceedings of the …, 2023 - openaccess.thecvf.com

Inspired by the great success of language model (LM)-based pre-training, recent studies in
visual document understanding have explored LM-based pre-training methods for modeling …

Spara Citera Citerat av 2 Relaterade artiklar Alla 5 versionerna Se som HTML-version

A Mixed-Precision Transformer Accelerator With Vector Tiling Systolic Array for License Plate Recognition in Unconstrained Scenarios

J Li, D Yan, F He, Z Dong… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Power efficiency for license plate recognition (LPR) under unconstrained scenarios is a
crucial factor in many edge-based real-world applications, eg, autonomous vehicles whose …

Spara Citera Citerat av 1 Relaterade artiklar Alla 2 versionerna

Skapa alarm

Citera

Avancerad sökning

Har sparats i Mitt bibliotek

Towards unified scene text spotting based on sequence generation

Omniparser: A unified framework for text spotting key information extraction and table recognition

Odm: A text-image further alignment pre-training approach for scene text detection and spotting

Lane2seq: towards unified lane detection via sequence generation

Platypus: A generalized specialist model for reading text in various forms

DNTextSpotter: Arbitrary-shaped scene text spotting via improved denoising training

Hyper-local deformable transformers for text spotting on historical maps

Hierarchical text spotter for joint text spotting and layout analysis

Bridging the Gap Between End-to-End and Two-Step Text Spotting

SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap

A Mixed-Precision Transformer Accelerator With Vector Tiling Systolic Array for License Plate Recognition in Unconstrained Scenarios