2024-01-16: Paper summary: Archival HTTP Redirection Retrieval Policies
Figure 1: URI-R and URI-M HTTP redirection relationship cases. Figure 2 from AlSum et al. Redirection of web pages refers to the process of forwarding a user from one URI (Uniform Resource Identifier) to another. This can happen for various reasons, and it is a common practice on the web. Redirection is implemented using HTTP status codes, particularly those in the 3xx range . For example, status code 301 is a permanent redirect, and status code 302 is a temporary redirect. In these scenarios the clients (web browsers) automatically direct users to the location as stated in the location response header. In this blog post I summarize the paper titled " Archival HTTP redirection retrieval policies " by AlSum et al. . In web archives, mementos (archived web pages) can have archived redirects (i.e., URI-M (URI of the memento) with a 3xx HTTP status code) where the URI-R (URI of the original resource) returns a redirection status code at crawl time. Figure 2 shows the calend...