Data-Centric Systems and Applications
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …
accessible data source in the world. Web mining aims to discover useful information or …
Web graph similarity for anomaly detection
Web graphs are approximate snapshots of the web, created by search engines. They are
essential to monitor the evolution of the web and to compute global properties like …
essential to monitor the evolution of the web and to compute global properties like …
Distribution of centrality measures on undirected random networks via the cavity method
The Katz centrality of a node in a complex network is a measure of the node's importance as
far as the flow of information across the network is concerned. For ensembles of locally tree …
far as the flow of information across the network is concerned. For ensembles of locally tree …
[HTML][HTML] Causalrca: Causal inference based precise fine-grained root cause localization for microservice applications
Effectively localizing root causes of performance anomalies is crucial to enabling the rapid
recovery and loss mitigation of microservice applications in the cloud. Depending on the …
recovery and loss mitigation of microservice applications in the cloud. Depending on the …
[LIBRO][B] A course on the web graph
A Bonato - 2008 - books.google.com
" A Course on the Web Graph provides a comprehensive introduction to state-of-the-art
research on the applications of graph theory to real-world networks such as the web graph. It …
research on the applications of graph theory to real-world networks such as the web graph. It …
Temporal analysis of the wikigraph
Wikipedia is an online encyclopedia, available in more than 100 languages and comprising
over 1 million articles in its English version. If we consider each Wikipedia article as a node …
over 1 million articles in its English version. If we consider each Wikipedia article as a node …
[HTML][HTML] What is in PageRank? A historical and conceptual investigation of a recursive status index
B Rieder - Computational Culture, 2012 - computationalculture.net
This paper proposes an analysis, based in a software studies mindset, of Google's
PageRank algorithm. It develops two lines of investigation: first, it situates this 'evaluative …
PageRank algorithm. It develops two lines of investigation: first, it situates this 'evaluative …
Power law distributions in information retrieval
Several properties of information retrieval (IR) data, such as query frequency or document
length, are widely considered to be approximately distributed as a power law. This common …
length, are widely considered to be approximately distributed as a power law. This common …
In-degree and PageRank: why do they follow similar power laws?
PageRank is a popularity measure designed by Google to rank Web pages. Experiments
confirm that PageRank values obey a power law with the same exponent as In-Degree …
confirm that PageRank values obey a power law with the same exponent as In-Degree …
PageRank asymptotics on directed preferential attachment networks
We characterize the tail behavior of the distribution of the PageRank of a uniformly chosen
vertex in a directed preferential attachment graph and show that it decays as a power law …
vertex in a directed preferential attachment graph and show that it decays as a power law …