Академия Google

M Kejriwal - Information, 2022 - mdpi.com

Knowledge graphs (KGs) have rapidly emerged as an important area in AI over the last ten
years. Building on a storied tradition of graphs in the AI community, a KG may be simply …

Сохранить Цитировать Цитируется: 70 Похожие статьи Все версии статьи (4) Сохраненная копия

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Webformer: The web-page transformer for structure information extraction

Q Wang, Y Fang, A Ravula, F Feng, X Quan… - Proceedings of the ACM …, 2022 - dl.acm.org

Structure information extraction refers to the task of extracting structured text fields from web
pages, such as extracting a product offer from a shop** page including product title …

Сохранить Цитировать Цитируется: 75 Похожие статьи Все версии статьи (8)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

NAS-BERT: Task-agnostic and adaptive-size BERT compression with neural architecture search

J Xu, X Tan, R Luo, K Song, J Li, T Qin… - Proceedings of the 27th …, 2021 - dl.acm.org

While pre-trained language models (eg, BERT) have achieved impressive results on
different natural language processing tasks, they have large numbers of parameters and …

Сохранить Цитировать Цитируется: 87 Похожие статьи Все версии статьи (4)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Spatial dependency parsing for semi-structured document information extraction

W Hwang, J Yim, S Park, S Yang, M Seo - arxiv preprint arxiv:2005.00642, 2020 - arxiv.org

Information Extraction (IE) for semi-structured document images is often approached as a
sequence tagging problem by classifying each recognized input token into one of the IOB …

Сохранить Цитировать Цитируется: 105 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Markuplm: Pre-training of text and markup language for visually-rich document understanding

J Li, Y Xu, L Cui, F Wei - arxiv preprint arxiv:2110.08518, 2021 - arxiv.org

Multimodal pre-training with text, layout, and image has made significant progress for
Visually Rich Document Understanding (VRDU), especially the fixed-layout documents such …

Сохранить Цитировать Цитируется: 62 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Data extraction via semantic regular expression synthesis

Q Chen, A Banerjee, Ç Demiralp, G Durrett… - Proceedings of the ACM …, 2023 - dl.acm.org

Many data extraction tasks of practical relevance require not only syntactic pattern matching
but also semantic reasoning about the content of the underlying text. While regular …

Сохранить Цитировать Цитируется: 16 Похожие статьи Все версии статьи (9)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dom-lm: Learning generalizable representations for html documents

X Deng, P Shiralkar, C Lockard, B Huang… - arxiv preprint arxiv …, 2022 - arxiv.org

HTML documents are an important medium for disseminating information on the Web for
human consumption. An HTML document presents information in multiple text formats …

Сохранить Цитировать Цитируется: 33 Похожие статьи Все версии статьи (2) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Simplified dom trees for transferable attribute extraction from the web

Y Zhou, Y Sheng, N Vo, N Edmonds, S Tata - arxiv preprint arxiv …, 2021 - arxiv.org

There has been a steady need to precisely extract structured knowledge from the web (ie
HTML documents). Given a web page, extracting a structured object along with various …

Сохранить Цитировать Цитируется: 42 Похожие статьи Все версии статьи (2) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Web question answering with neurosymbolic program synthesis

Q Chen, A Lamoreaux, X Wang, G Durrett… - Proceedings of the …, 2021 - dl.acm.org

In this paper, we propose a new technique based on program synthesis for extracting
information from webpages. Given a natural language query and a few labeled webpages …

Сохранить Цитировать Цитируется: 37 Похожие статьи Все версии статьи (9)

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

WIERT: web information extraction via render tree

Z Li, B Shao, L Shou, M Gong, G Li… - Proceedings of the AAAI …, 2023 - ojs.aaai.org

Web information extraction (WIE) is a fundamental problem in web document understanding,
with a significant impact on various applications. Visual information plays a crucial role in …

Сохранить Цитировать Цитируется: 6 Похожие статьи Все версии статьи (2) В виде HTML

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Freedom: A transferable neural architecture for structured information extraction on web documents

Knowledge graphs: A practical review of the research landscape

Webformer: The web-page transformer for structure information extraction

NAS-BERT: Task-agnostic and adaptive-size BERT compression with neural architecture search

Spatial dependency parsing for semi-structured document information extraction

Markuplm: Pre-training of text and markup language for visually-rich document understanding

Data extraction via semantic regular expression synthesis

Dom-lm: Learning generalizable representations for html documents

Simplified dom trees for transferable attribute extraction from the web

Web question answering with neurosymbolic program synthesis

WIERT: web information extraction via render tree