Accelerating SPARQL queries by exploiting hash-based locality and adaptive partitioning

R Harbi, I Abdelaziz, P Kalnis, N Mamoulis, Y Ebrahim… - The VLDB Journal, 2016 - Springer
State-of-the-art distributed RDF systems partition data across multiple computer nodes
(workers). Some systems perform cheap hash partitioning, which may result in expensive …

Ontology summarization: Graph-based methods and beyond

S Pouriyeh, M Allahyari, Q Liu, G Cheng… - … Journal of Semantic …, 2019 - World Scientific
Ontologies have been widely used in numerous and varied applications, eg to support data
modeling, information integration, and knowledge management. With the increasing size of …

PCSG: pattern-coverage snippet generation for RDF datasets

X Wang, G Cheng, T Lin, J Xu, JZ Pan… - The Semantic Web …, 2021 - Springer
For reusing an RDF dataset, understanding its content is a prerequisite. To support the
comprehension of its large and complex structure, existing methods mainly generate an …

GSP (Geo-Semantic-Parsing): geoparsing and geotagging with machine learning on top of linked data

M Avvenuti, S Cresci, L Nizzoli, M Tesconi - European Semantic Web …, 2018 - Springer
Recently, user-generated content in social media opened up new alluring possibilities for
understanding the geospatial aspects of many real-world phenomena. Yet, the vast majority …

A Survey on Extractive Knowledge Graph Summarization: Applications, Approaches, Evaluation, and Future Directions

X Wang, G Cheng - arxiv preprint arxiv:2402.12001, 2024 - arxiv.org
With the continuous growth of large Knowledge Graphs (KGs), extractive KG summarization
becomes a trending task. Aiming at distilling a compact subgraph with condensed …

BANDAR: benchmarking snippet generation algorithms for (RDF) dataset search

X Wang, G Cheng, JZ Pan… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
The large volume of open data on the Web is expected to be reused and create value.
Finding the right data to reuse is a non-trivial task addressed by the recent dataset search …

WORQ: workload-driven RDF query processing

A Madkour, AM Aly, WG Aref - The Semantic Web–ISWC 2018: 17th …, 2018 - Springer
Cloud-based systems provide a rich platform for managing large-scale RDF data. However,
the distributed nature of these systems introduces several performance challenges, eg, disk …

Evaluating SPARQL queries on massive RDF datasets

R Al-Harbi, I Abdelaziz, P Kalnis, N Mamoulis - 2015 - repository.kaust.edu.sa
Distributed RDF systems partition data across multiple computer nodes. Partitioning is
typically based on heuristics that minimize inter-node communication and it is performed in …

Combining vertex-centric graph processing with SPARQL for large-scale RDF data analytics

I Abdelaziz, R Harbi, S Salihoglu… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Modern applications require sophisticated analytics on RDF graphs that combine structural
queries with generic graph computations. Existing systems support either declarative …

Scale-out processing of large RDF datasets

L Cheng, S Kotoulas - IEEE Transactions on Big Data, 2015 - ieeexplore.ieee.org
Distributed RDF data management systems become increasingly important with the growth
of the Semantic Web. Regardless, current methods meet performance bottlenecks either on …