Comparing the use of Bayesian networks and neural networks in response time modeling for service-oriented systems

R Zhang, AJ Bivens - Proceedings of the 2007 workshop on Service …, 2007 - dl.acm.org
The new paradigm of service-oriented computing facilitates easy construction of dynamic,
complex distributed systems. Recent research has shown that machine learning methods …

Подходы к диагностике согласованности данных в байесовских сетях доверия

АВ Торопова - Информатика и автоматизация, 2015 - mathnet.ru
Байесовские сети доверия предоставляют возможность объединения нескольких видов
информации, например полученной от экспертов или статистически, позволяют …

[BOOK][B] Automatic performance diagnosis and recovery in cloud microservices

L Wu - 2022 - search.proquest.com
Microservices have emerged as a popular pattern for develo** large-scale applications in
cloud environments for its benefts of fexibility, scalability, and agility. A microservices-based …

Job scheduler for distributed systems using pervasive state estimation with modeling of capabilities of compute nodes

S Gupta, C Fritz, J De Kleer - US Patent 9,934,071, 2018 - Google Patents
The following relates generally to computer system efficiency improvements. Broadly,
systems and methods are disclosed that improve efficiency in a cluster of nodes by efficient …

[PDF][PDF] Diagnosing heterogeneous hadoop clusters

S Gupta, C Fritz, J de Kleer… - Workshop on Principles of …, 2012 - cs.toronto.edu
We present a data-driven approach for diagnosing performance issues in heterogeneous
Hadoop clusters. Hadoop is a popular and extremely successful framework for horizontally …

Problem localization using probabilistic dependency analysis for automated system management in ubiquitous computing

S Piao, J Park, E Lee - Internet Research, 2009 - emerald.com
Purpose–This paper seeks to develop an approach to problem localization and an algorithm
to address the issue of determining the dependencies among system metrics for automated …

A state machine approach for problem detection in large-scale distributed system

K Sun, J Qiu, Y Li, Y Chen, W Ji - NOMS 2008-2008 IEEE …, 2008 - ieeexplore.ieee.org
Efficient problem detection methods play an important role in system management. In this
paper, a formal method is described for problem detection in large scale and distributed …

Modeling autonomic recovery in web services with multi-tier reboots

R Zhang - IEEE International Conference on Web Services …, 2007 - ieeexplore.ieee.org
In order to offer adequate guidance to the emerging reboot-based self-healing processes in
Web services, this paper presents probabilistic models to estimate the recovery time and …

[PDF][PDF] Performance diagnosis of services in scalable and dynamic networks

S Tati, P Novotny, BJ Ko, A Wolf, A Swami, T La Porta - System, 2011 - researchgate.net
We propose a novel algorithm to determine the availability or the states of services
considering the dynamics and scale of the networks. Our approach is based on network …

[PDF][PDF] A data mining based approach to reliable distributed systems

M Mock, D Wegener - Proc. of the Second International Workshop …, 2009 - cse.buffalo.edu
The purpose of this paper is to open a novel research perspective on reliable distributed
systems. The underlying hypothesis is that dynamic models of distributed systems can be …