Not all mementos are created equal: Measuring the impact of missing resources
Web archives do not always capture every resource on every page that they attempt to
archive. This results in archived pages missing a portion of their embedded resources …
archive. This results in archived pages missing a portion of their embedded resources …
The Internet Archive and the socio-technical construction of historical facts
This article analyses the socio-technical epistemic processes behind the construction of
historical facts by the Internet Archive Wayback Machine (IAWM). Grounded in theoretical …
historical facts by the Internet Archive Wayback Machine (IAWM). Grounded in theoretical …
Know (ing) Infrastructure: The Wayback Machine as object and instrument of digital research
From documenting human rights abuses to studying online advertising, web archives are
increasingly positioned as critical resources for a broad range of scholarly Internet research …
increasingly positioned as critical resources for a broad range of scholarly Internet research …
Hardware Heritage—Briefcase-Sized Computers
K Król - Heritage, 2021 - mdpi.com
The computer industry was a vivid place in the 1980s. IT systems and technologies thrived,
and the market offered ever better, smaller, and more useful machines. Innovative technical …
and the market offered ever better, smaller, and more useful machines. Innovative technical …
Profiling web archive coverage for top-level domain and content language
Abstract The Memento Aggregator currently polls every known public web archive when
serving a request for an archived web page, even though some web archives focus on only …
serving a request for an archived web page, even though some web archives focus on only …
The Wayback Machine: notes on a re-enchantment
S Bowyer - Archival Science, 2021 - Springer
Abstract The Internet Archive's Wayback Machine holds over 424 billion webpages, making
it the largest publicly accessible archive in the world. Thus far, much of the research on the …
it the largest publicly accessible archive in the world. Thus far, much of the research on the …
" Why do you need 400 photographs of 400 different Lockheed Constellation?": Value Expressions by Contributors and Users of Wikimedia Commons
Understanding the values that collaborators bring to a collaboration is important for the
design of new systems. In collaborative systems understanding differing values could help …
design of new systems. In collaborative systems understanding differing values could help …
Web archive profiling through CDX summarization
With the proliferation of public web archives, it is becoming more important to better profile
their contents, both to understand their immense holdings as well as to support routing of …
their contents, both to understand their immense holdings as well as to support routing of …
Robots still outnumber humans in web archives, but less than before
To identify robots and humans and analyze their respective access patterns, we used the
Internet Archive's (IA) Wayback Machine access logs from 2012 and 2019, as well as …
Internet Archive's (IA) Wayback Machine access logs from 2012 and 2019, as well as …
Reproducible web corpora: interactive archiving with automatic quality assessment
The evolution of web pages from static HTML pages toward dynamic pieces of software has
rendered archiving them increasingly difficult. Nevertheless, an accurate, reproducible web …
rendered archiving them increasingly difficult. Nevertheless, an accurate, reproducible web …