Research directions for principles of data management (dagstuhl perspectives workshop 16151)
The area of Principles of Data Management (PDM) has made crucial contributions to the
development of formal frameworks for understanding and managing data and knowledge …
development of formal frameworks for understanding and managing data and knowledge …
Schema extraction and structural outlier detection for JSON-based NoSQL data stores
Zusammenfassung Although most NoSQL Data Stores are schema-less, information on the
structural properties of the persisted data is nevertheless essential during application …
structural properties of the persisted data is nevertheless essential during application …
Ensuring the correctness of regular expressions: A review
Regular expressions are widely used within and even outside of computer science due to
their expressiveness and flexibility. However, regular expressions have a quite compact and …
their expressiveness and flexibility. However, regular expressions have a quite compact and …
[BUCH][B] Web data management
The Internet and World Wide Web have revolutionized access to information. Users now
store information across multiple platforms from personal computers to smartphones and …
store information across multiple platforms from personal computers to smartphones and …
Learning deterministic regular expressions for the inference of schemas from XML data
Inferring an appropriate DTD or XML Schema Definition (XSD) for a given collection of XML
documents essentially reduces to learning deterministic regular expressions from sets of …
documents essentially reduces to learning deterministic regular expressions from sets of …
A universal approach for multi-model schema inference
P Koupil, S Hricko, I Holubová - Journal of Big Data, 2022 - Springer
The variety feature of Big Data, represented by multi-model data, has brought a new
dimension of complexity to all aspects of data management. The need to process a set of …
dimension of complexity to all aspects of data management. The need to process a set of …
Complexity and Expressiveness of ShEx for RDF
We study the expressiveness and complexity of Shape Expression Schema (ShEx), a novel
schema formalism for RDF currently under development by W3C. A ShEx assigns types to …
schema formalism for RDF currently under development by W3C. A ShEx assigns types to …
Enabling information extraction by inference of regular expressions from sample entities
F Brauer, R Rieger, A Mocan… - Proceedings of the 20th …, 2011 - dl.acm.org
Regular expressions are the dominant technique to extract business relevant entities (eg,
invoice numbers or product names) from text data (eg, invoices), since these entity types …
invoice numbers or product names) from text data (eg, invoices), since these entity types …
SemMT: a semantic-based testing approach for machine translation systems
Machine translation has wide applications in daily life. In mission-critical applications such
as translating official documents, incorrect translation can have unpleasant or sometimes …
as translating official documents, incorrect translation can have unpleasant or sometimes …
InfeRE: Step-by-Step Regex Generation via Chain of Inference
Automatically generating regular expressions (abbrev. regexes) from natural language
description (NL2RE) has been an emerging research area. Prior studies treat regex as a …
description (NL2RE) has been an emerging research area. Prior studies treat regex as a …