Multi-model databases: a new journey to handle the variety of data

J Lu, I Holubová - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
The variety of data is one of the most challenging issues for the research and practice in
data management systems. The data are naturally organized in different formats and …

Check out the big brain on BRAD: simplifying cloud data processing with learned automated data meshes

T Kraska, T Li, S Madden, M Markakis, A Ngom… - Proceedings of the …, 2023 - dl.acm.org
The last decade of database research has led to the prevalence of specialized systems for
different workloads. Consequently, organizations often rely on a combination of specialized …

Querying large language models with SQL

M Saeed, N De Cao, P Papotti - arxiv preprint arxiv:2304.00472, 2023 - arxiv.org
In many use-cases, information is stored in text but not available in structured data.
However, extracting data from natural language text to precisely fit a schema, and thus …

Handling iterations in distributed dataflow systems

GE Gévay, J Soto, V Markl - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Over the past decade, distributed dataflow systems (DDS) have become a standard
technology. In these systems, users write programs in restricted dataflow programming …

Babelfish: Efficient execution of polyglot queries

PM Grulich, S Zeuch, V Markl - Proceedings of the VLDB Endowment, 2021 - dl.acm.org
Today's users of data processing systems come from different domains, have different levels
of expertise, and prefer different programming languages. As a result, analytical workload …

The metaverse data deluge: What can we do about it?

BC Ooi, G Chen, MZ Shou, KL Tan… - 2023 IEEE 39th …, 2023 - ieeexplore.ieee.org
In the metaverse the physical space and the virtual space co-exist, and interact
simultaneously. While the physical space is virtually enhanced with information, the virtual …

On-demand state separation for cloud data warehousing

C Winter, J Giceva, T Neumann, A Kemper - Proceedings of the VLDB …, 2022 - dl.acm.org
Moving data analysis and processing to the cloud is no longer reserved for a few companies
with petabytes of data. Instead, the flexibility of on-demand resources is attracting an …

Expand your training limits! generating training data for ml-based data management

F Ventura, Z Kaoudi, JA Quiané-Ruiz… - Proceedings of the 2021 …, 2021 - dl.acm.org
Machine Learning (ML) is quickly becoming a prominent method in many data management
components, including query optimizers which have recently shown very promising results …

Agora: Bringing together datasets, algorithms, models and more in a unified ecosystem [vision]

J Traub, Z Kaoudi, JA Quiané-Ruiz, V Markl - ACM SIGMOD Record, 2021 - dl.acm.org
Data science and artificial intelligence are driven by a plethora of diverse data-related
assets, including datasets, data streams, algorithms, processing software, compute …

In-situ cross-database query processing

H Gavriilidis, K Beedkar… - 2023 IEEE 39th …, 2023 - ieeexplore.ieee.org
Today's organizations utilize a plethora of heterogeneous and autonomous DBMSes, many
of those being spread across different geo-locations. It is therefore crucial to have effective …