An overview on XML similarity: Background, current trends and future directions
In recent years, XML has been established as a major means for information management,
and has been broadly utilized for complex data representation (eg multimedia objects) …
and has been broadly utilized for complex data representation (eg multimedia objects) …
An overview on xml semantic disambiguation from unstructured text to semi-structured data: Background, applications, and ongoing challenges
J Tekli - IEEE Transactions on Knowledge and Data …, 2016 - ieeexplore.ieee.org
Since the last two decades, XML has gained momentum as the standard for web information
management and complex data representation. Also, collaboratively built semi-structured …
management and complex data representation. Also, collaboratively built semi-structured …
Detecting and characterizing bots that commit code
Background: Some developer activity traditionally performed manually, such as making
code commits, opening, managing, or closing issues is increasingly subject to automation in …
code commits, opening, managing, or closing issues is increasingly subject to automation in …
Keyword search over relational databases: a metadata approach
Keyword queries offer a convenient alternative to traditional SQL in querying relational
databases with large, often unknown, schemas and instances. The challenge in answering …
databases with large, often unknown, schemas and instances. The challenge in answering …
The pq-gram distance between ordered labeled trees
When integrating data from autonomous sources, exact matches of data items that represent
the same real-world object often fail due to a lack of common keys. Yet in many cases …
the same real-world object often fail due to a lack of common keys. Yet in many cases …
A novel XML document structure comparison framework based-on sub-tree commonalities and label semantics
XML similarity evaluation has become a central issue in the database and information
communities, its applications ranging over document clustering, version control, data …
communities, its applications ranging over document clustering, version control, data …
A Family of LZ78-based Universal Sequential Probability Assignments
We propose and study a family of universal sequential probability assignments on individual
sequences, based on the incremental parsing procedure of the Lempel-Ziv (LZ78) …
sequences, based on the incremental parsing procedure of the Lempel-Ziv (LZ78) …
Clustering XML documents by patterns
Now that the use of XML is prevalent, methods for mining semi-structured documents have
become even more important. In particular, one of the areas that could greatly benefit from in …
become even more important. In particular, one of the areas that could greatly benefit from in …
XML clustering: a review of structural approaches
With its presence in data integration, chemistry, biological, and geographic systems,
eXtensible Markup Language (XML) has become an important standard not only in …
eXtensible Markup Language (XML) has become an important standard not only in …
X-class: Associative classification of xml documents by structure
The supervised classification of XML documents by structure involves learning predictive
models in which certain structural regularities discriminate the individual document classes …
models in which certain structural regularities discriminate the individual document classes …