A literature review and existing challenges on software logging practices: From the creation to the analysis of software logs

MA Batoun, M Sayagh, R Aghili, A Ouni, H Li - Empirical Software …, 2024 - Springer
Software logging is the practice of recording different events and activities that occur within a
software system, which are useful for different activities such as failure prediction and …

Loghub: A large collection of system log datasets for ai-driven log analytics

J Zhu, S He, P He, J Liu, MR Lyu - 2023 IEEE 34th International …, 2023 - ieeexplore.ieee.org
Logs have been widely adopted in software system development and maintenance because
of the rich runtime information they record. In recent years, the increase of software size and …

{μSlope}: High Compression and Fast Search on {Semi-Structured} Logs

R Wang, D Gibson, K Rodrigues, Y Luo… - … USENIX Symposium on …, 2024 - usenix.org
Internet-scale services can produce a large amount of logs. Such logs are increasingly
appearing in semi-structured formats such as JSON. At Uber, the amount of semi-structured …

Logshrink: Effective log compression by leveraging commonality and variability of log data

X Li, H Zhang, VH Le, P Chen - Proceedings of the 46th IEEE/ACM …, 2024 - dl.acm.org
Log data is a crucial resource for recording system events and states during system
execution. However, as systems grow in scale, log data generation has become increasingly …

Unlocking the Power of Numbers: Log Compression via Numeric Token Parsing

S Yu, Y Wu, Y Li, P He - Proceedings of the 39th IEEE/ACM International …, 2024 - dl.acm.org
Parser-based log compressors have been widely explored in recent years because the
explosive growth of log volumes makes the compression performance of general-purpose …

Partition, Don't Sort! Compression Boosters for Cloud Data Ingestion Pipelines

P Hansert, S Michel - Proceedings of the VLDB Endowment, 2024 - dl.acm.org
Data Lakes deployed in the cloud are a go-to solution for enterprise data storage. While the
pay-as-you-go cost model allows flexible resource allocation and billing, it mandates an …

Mint: Cost-Efficient Tracing with All Requests Collection via Commonality and Variability Analysis

H Huang, C Chen, K Chen, P Chen, G Yu, Z He… - arxiv preprint arxiv …, 2024 - arxiv.org
Distributed traces contain valuable information but are often massive in volume, posing a
core challenge in tracing framework design: balancing the tradeoff between preserving …

Exploiting Data-pattern-aware Vertical Partitioning to Achieve Fast and Low-cost Cloud Log Storage

J Wei, G Zhang, J Chen, Y Wang, W Zheng… - ACM Transactions on …, 2024 - dl.acm.org
Cloud logs can be categorized into on-line, off-line, and near-line logs based on the access
frequency. Among them, near-line logs are mainly used for debugging, which means they …

COPR--Efficient, large-scale log storage and retrieval

J Reichinger, T Krismayer, J Rellermeyer - arxiv preprint arxiv …, 2024 - arxiv.org
Modern, large scale monitoring systems have to process and store vast amounts of log data
in near real-time. At query time the systems have to find relevant logs based on the content …

ESTELLE: An Efficient and Cost-effective Cloud Log Engine

Y Zhang, G Cong, J Qu, R Xu, Y Fu, W Li, F Hu… - Companion of the 2024 …, 2024 - dl.acm.org
With the advancement of cloud computing, more and more enterprises are adopting cloud
services to build a variety of applications. Monitoring and observability are integral to the …