Cockroachdb: The resilient geo-distributed sql database
We live in an increasingly interconnected world, with many organizations operating across
countries or even continents. To serve their global user base, organizations are replacing …
countries or even continents. To serve their global user base, organizations are replacing …
A Survey on the Integration of Blockchains and Databases
The success of blockchain technology in cryptocurrencies reveals its potential in the data
management field. Recently, there is a trend in the database community to integrate …
management field. Recently, there is a trend in the database community to integrate …
Multi-tenant cloud data services: state-of-the-art, challenges and opportunities
Enterprises are moving their business-critical workloads to public clouds at an accelerating
pace. Multi-tenancy is a crucial tenet for cloud data service providers allowing them to …
pace. Multi-tenancy is a crucial tenet for cloud data service providers allowing them to …
Myrocks: Lsm-tree database storage engine serving facebook's social graph
Y Matsunobu, S Dong, H Lee - Proceedings of the VLDB Endowment, 2020 - dl.acm.org
Facebook uses MySQL to manage tens of petabytes of data in its main database named the
User Database (UDB). UDB serves social activities such as likes, comments, and shares. In …
User Database (UDB). UDB serves social activities such as likes, comments, and shares. In …
Profiling hyperscale big data processing
Computing demand continues to grow exponentially, largely driven by" big data" processing
on hyperscale data stores. At the same time, the slowdown in Moore's law is leading the …
on hyperscale data stores. At the same time, the slowdown in Moore's law is leading the …
Dremel: A decade of interactive SQL analysis at web scale
S Melnik, A Gubarev, JJ Long, G Romer… - Proceedings of the …, 2020 - dl.acm.org
Google's Dremel was one of the first systems that combined a set of architectural principles
that have become a common practice in today's cloud-native analytics tools, including …
that have become a common practice in today's cloud-native analytics tools, including …
Optimizing data-intensive systems in disaggregated data centers with teleport
Recent proposals for the disaggregation of compute, memory, storage, and accelerators in
data centers promise substantial operational benefits. Unfortunately, for resources like …
data centers promise substantial operational benefits. Unfortunately, for resources like …
What Goes Around Comes Around... And Around...
M Stonebraker, A Pavlo - ACM Sigmod Record, 2024 - dl.acm.org
Two decades ago, one of us co-authored a paper commenting on the previous 40 years of
data modelling research and development [188]. That paper demonstrated that the relational …
data modelling research and development [188]. That paper demonstrated that the relational …
OceanBase: a 707 million tpmC distributed relational database system
We have designed and developed OceanBase, a distributed relational database system
from the very basics for a decade. Being a scale-out multi-tenant system, OceanBase is …
from the very basics for a decade. Being a scale-out multi-tenant system, OceanBase is …
A survey on geographically distributed big-data processing using MapReduce
S Dolev, P Florissi, E Gudes… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Hadoop and Spark are widely used distributed processing frameworks for large-scale data
processing in an efficient and fault-tolerant manner on private or public clouds. These big …
processing in an efficient and fault-tolerant manner on private or public clouds. These big …