Distributed data management using MapReduce

F Li, BC Ooi, MT Özsu, S Wu - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
MapReduce is a framework for processing and managing large-scale datasets in a
distributed cluster, which has been used for applications such as generating search indexes …

Data storage management in cloud environments: Taxonomy, survey, and future directions

Y Mansouri, AN Toosi, R Buyya - ACM Computing Surveys (CSUR), 2017 - dl.acm.org
Storage as a Service (StaaS) is a vital component of cloud computing by offering the vision
of a virtually infinite pool of storage resources. It supports a variety of cloud-based data store …

In-memory big data management and processing: A survey

H Zhang, G Chen, BC Ooi, KL Tan… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Growing main memory capacity has fueled the development of in-memory big data
management and processing. By eliminating disk I/O bottleneck, it is now possible to support …

Logmine: Fast pattern recognition for log analytics

H Hamooni, B Debnath, J Xu, H Zhang… - Proceedings of the 25th …, 2016 - dl.acm.org
Modern engineering incorporates smart technologies in all aspects of our lives. Smart
technologies are generating terabytes of log messages every day to report their status. It is …

[PDF][PDF] Optimizing Space Amplification in RocksDB.

S Dong, M Callaghan, L Galanis, D Borthakur, T Savor… - CIDR, 2017 - lrita.github.io
RocksDB is an embedded, high-performance, persistent keyvalue storage engine
developed at Facebook. Much of our current focus in develo** and configuring RocksDB …

{LSM-trie}: An {LSM-tree-based}{Ultra-Large}{Key-Value} Store for Small Data Items

X Wu, Y Xu, Z Shao, S Jiang - 2015 USENIX Annual Technical …, 2015 - usenix.org
Key-value (KV) stores have become a backbone of large-scale applications in today's data
centers. The data set of the store on a single server can grow to billions of KV items or many …

X-Engine: An optimized storage engine for large-scale E-commerce transaction processing

G Huang, X Cheng, J Wang, Y Wang, D He… - Proceedings of the …, 2019 - dl.acm.org
Alibaba runs the largest e-commerce platform in the world serving more than 600 million
customers, with a GMV (gross merchandise value) exceeding USD 768 billion in FY2018 …

Blockchains vs. distributed databases: Dichotomy and fusion

P Ruan, TTA Dinh, D Loghin, M Zhang… - Proceedings of the …, 2021 - dl.acm.org
Blockchain has come a long way-a system that was initially proposed specifically for
cryptocurrencies is now being adapted and adopted as a general-purpose transactional …

{Application-Managed} Flash

S Lee, M Liu, S Jun, S Xu, J Kim - 14th USENIX Conference on File …, 2016 - usenix.org
In flash storage, an FTL is a complex piece of code that resides completely inside the
storage device and is provided by the manufacturer. Its principal virtue is providing …

{ElasticBF}: Elastic Bloom Filter with Hotness Awareness for Boosting Read Performance in Large {Key-Value} Stores

Y Li, C Tian, F Guo, C Li, Y Xu - 2019 USENIX Annual Technical …, 2019 - usenix.org
LSM-tree based key-value (KV) stores suffer from severe read amplification because
searching a key requires to check multiple SSTables. To reduce extra I/Os, Bloom filters are …