Tools and benchmarks for automated log parsing
Logs are imperative in the development and maintenance process of many software
systems. They record detailed runtime information that allows developers and support …
systems. They record detailed runtime information that allows developers and support …
Towards the use of the readily available tests from the release pipeline as performance tests: Are we there yet?
Performance is one of the important aspects of software quality. Performance issues exist
widely in software systems, and the process of fixing the performance issues is an essential …
widely in software systems, and the process of fixing the performance issues is an essential …
A comprehensive survey of logging in software: From logging statements automation to log mining and analysis
S Gholamian, PAS Ward - arxiv preprint arxiv:2110.12489, 2021 - arxiv.org
Logs are widely used to record runtime information of software systems, such as the
timestamp and the importance of an event, the unique ID of the source of the log, and a part …
timestamp and the importance of an event, the unique ID of the source of the log, and a part …
An empirical investigation of incident triage for online service systems
Online service systems have become increasingly popular. During operation of an online
service system, incidents (unplanned interruptions or outages of the service) are inevitable …
service system, incidents (unplanned interruptions or outages of the service) are inevitable …
How incidental are the incidents? characterizing and prioritizing incidents for large-scale online service systems
Although tremendous efforts have been devoted to the quality assurance of online service
systems, in reality, these systems still come across many incidents (ie, unplanned …
systems, in reality, these systems still come across many incidents (ie, unplanned …
Predicting node failures in an ultra-large-scale cloud computing platform: an aiops solution
Many software services today are hosted on cloud computing platforms, such as Amazon
EC2, due to many benefits like reduced operational costs. However, node failures in these …
EC2, due to many benefits like reduced operational costs. However, node failures in these …
Logzip: Extracting hidden structures via iterative clustering for log compression
System logs record detailed runtime information of software systems and are used as the
main data source for many tasks around software engineering. As modern software systems …
main data source for many tasks around software engineering. As modern software systems …
How to mitigate the incident? an effective troubleshooting guide recommendation technique for online service systems
In recent years, more and more traditional shrink-wrapped software is provided as 7x24
online services. Incidents (events that lead to service disruptions or outages) could affect …
online services. Incidents (events that lead to service disruptions or outages) could affect …
An exploratory study of performance regression introducing code changes
Performance is an important aspect of software quality. In fact, large software systems
failures are often due to performance issues rather than functional bugs. One of the most …
failures are often due to performance issues rather than functional bugs. One of the most …
An empirical study of the impact of data splitting decisions on the performance of AIOps solutions
AIOps (Artificial Intelligence for IT Operations) leverages machine learning models to help
practitioners handle the massive data produced during the operations of large-scale …
practitioners handle the massive data produced during the operations of large-scale …