Survey of vector database management systems

JJ Pan, J Wang, G Li - The VLDB Journal, 2024 - Springer
There are now over 20 commercial vector database management systems (VDBMSs), all
produced within the past five years. But embedding-based retrieval has been studied for …

Optimizing LSM-based indexes for disaggregated memory

R Wang, C Gao, J Wang, P Kadam, M TamerÖzsu… - The VLDB Journal, 2024 - Springer
The emerging trend of memory disaggregation where CPU and memory are physically
separated from each other and are connected via ultra-fast networking, eg, over Remote …

CaaS-LSM: compaction-as-a-service for LSM-based key-value stores in storage disaggregated infrastructure

Q Yu, C Guo, J Zhuang, V Thakkar, J Wang… - Proceedings of the ACM …, 2024 - dl.acm.org
Optimizing LSM-based Key-Value Stores (LSM-KVS) for disaggregated storage is essential
to achieve better resource utilization, performance, and flexibility. Most of the existing studies …

TDSQL: Tencent Distributed Database System

Y Chen, A Pan, H Lei, A Ye, S Han, Y Tang… - Proceedings of the …, 2024 - dl.acm.org
Distributed databases have become indispensable in contemporary computing and data
processing, owing to their pivotal role in ensuring high availability and scalability. They …

A CXL-powered database system: Opportunities and challenges

Y Guo, G Li - 2024 IEEE 40th International Conference on Data …, 2024 - ieeexplore.ieee.org
Compute Express Link (CXL) is emerging as a significant player in the landscape of modern
database man-agement systems (DBMS). CXL is an open industry-standard interconnect …

Share: Stackelberg-Nash based Data Markets

Y Bi, J Liu, C Zhao, J Zhao, K Ren… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
With the prevalence of data-driven intelligence, data markets with various data products are
gaining considerable interest as a promising paradigm for commoditizing data and …

Understanding the performance implications of the design principles in storage-disaggregated databases

X Pang, J Wang - Proceedings of the ACM on Management of Data, 2024 - dl.acm.org
Storage-compute disaggregation has recently emerged as a novel architecture in modern
data centers, particularly in the cloud. By decoupling compute from storage, this new …

Vector Database Management Techniques and Systems

JJ Pan, J Wang, G Li - Companion of the 2024 International Conference …, 2024 - dl.acm.org
Feature vectors are now mission-critical for many applications, including retrieval-based
large language models (LLMs). Traditional database management systems are not …

SELCC: Coherent Caching over Compute-Limited Disaggregated Memory

R Wang, J Wang, WG Aref - arxiv preprint arxiv:2409.02088, 2024 - arxiv.org
Disaggregating memory from compute offers the opportunity to better utilize stranded
memory in data centers. It is important to cache data in the compute nodes and maintain …

SIMDified Data Processing-Foundations, Abstraction, and Advanced Techniques

D Habich, J Pietrzyk - Companion of the 2024 International Conference …, 2024 - dl.acm.org
Query execution techniques in database systems are constantly adapting to novel hardware
features in order to improve query performance, in particular for analytical queries. In the last …