Knowledge graphs: A practical review of the research landscape
M Kejriwal - Information, 2022 - mdpi.com
Knowledge graphs (KGs) have rapidly emerged as an important area in AI over the last ten
years. Building on a storied tradition of graphs in the AI community, a KG may be simply …
years. Building on a storied tradition of graphs in the AI community, a KG may be simply …
Webformer: The web-page transformer for structure information extraction
Structure information extraction refers to the task of extracting structured text fields from web
pages, such as extracting a product offer from a shop** page including product title …
pages, such as extracting a product offer from a shop** page including product title …
NAS-BERT: Task-agnostic and adaptive-size BERT compression with neural architecture search
While pre-trained language models (eg, BERT) have achieved impressive results on
different natural language processing tasks, they have large numbers of parameters and …
different natural language processing tasks, they have large numbers of parameters and …
Spatial dependency parsing for semi-structured document information extraction
Information Extraction (IE) for semi-structured document images is often approached as a
sequence tagging problem by classifying each recognized input token into one of the IOB …
sequence tagging problem by classifying each recognized input token into one of the IOB …
Markuplm: Pre-training of text and markup language for visually-rich document understanding
Multimodal pre-training with text, layout, and image has made significant progress for
Visually Rich Document Understanding (VRDU), especially the fixed-layout documents such …
Visually Rich Document Understanding (VRDU), especially the fixed-layout documents such …
Data extraction via semantic regular expression synthesis
Many data extraction tasks of practical relevance require not only syntactic pattern matching
but also semantic reasoning about the content of the underlying text. While regular …
but also semantic reasoning about the content of the underlying text. While regular …
Dom-lm: Learning generalizable representations for html documents
HTML documents are an important medium for disseminating information on the Web for
human consumption. An HTML document presents information in multiple text formats …
human consumption. An HTML document presents information in multiple text formats …
Simplified dom trees for transferable attribute extraction from the web
There has been a steady need to precisely extract structured knowledge from the web (ie
HTML documents). Given a web page, extracting a structured object along with various …
HTML documents). Given a web page, extracting a structured object along with various …
Web question answering with neurosymbolic program synthesis
In this paper, we propose a new technique based on program synthesis for extracting
information from webpages. Given a natural language query and a few labeled webpages …
information from webpages. Given a natural language query and a few labeled webpages …
WIERT: web information extraction via render tree
Web information extraction (WIE) is a fundamental problem in web document understanding,
with a significant impact on various applications. Visual information plays a crucial role in …
with a significant impact on various applications. Visual information plays a crucial role in …