2010-11-05: Memento-Datetime is not Last-Modified
data:image/s3,"s3://crabby-images/59bb7/59bb73ab8c29dedc03aad22dbff62c101432ce60" alt=""
Rather than formally explain what we mean by "sticky, intended datetime", it is easier to explain how it is neither the value in the HTTP response header Last-Modified, nor is it the creation date of the resource (which has no corresponding HTTP header, for reasons that will become clear). For the examples below, we'll define the following abbreviations:
- CD (Creation-Datetime) = the datetime the resource was created
- MD (Memento-Datetime) = the datetime the representation was observed on the web
- LM (Last-Modified) = the datetime the resource last changed state
data:image/s3,"s3://crabby-images/fa82d/fa82d64eaad24805876d29b927f47abe634148c0" alt=""
http://wayback.archive-it.org/927/*/http://www.nyu.edu/fas/projects/vcb/case_911_FLASHcontent.html
The index page has a link to a single Memento. For simplicity, we'll assume Archive-It.org created this index page and the Memento it references at the moment of the crawl, thus the various datetimes of the Memento would all be equal:
Creation-Datetime: Wed, 05 Mar 2008 20:16:49 GMT
Memento-Datetime: Wed, 05 Mar 2008 20:16:49 GMT
Last-Modified: Wed, 05 Mar 2008 20:16:49 GMT
Case 2: CD == MD < LM
data:image/s3,"s3://crabby-images/4348e/4348ea39e71b27225e5480ea5552a40bee9bd9ee" alt=""
Now pretend the wording of the banner needs to be changed (for example, to address a new legal requirement). The CD and MD of the Memento are unchanged, but the LM must reflect when the wording of the banner changed:
Creation-Datetime: Wed, 05 Mar 2008 20:16:49 GMT
Memento-Datetime: Wed, 05 Mar 2008 20:16:49 GMT
Last-Modified: Fri, 05 Nov 2010 23:25:19 GMT
Both your lawyer and your HTTP cache consider this an important change, so you have to update LM. But it also clear that the essence of March 2008 observation of the Memento by Archive-It.org is unchanged by the wording change of the archive banner, so MD is not updated. And certainly the CD is unchanged by this modification.
Case 3: MD < CD <= LM
data:image/s3,"s3://crabby-images/5848e/5848e10fa711bea09c93c873a0b3d5b2f45acf96" alt=""
http://wayback.archive-it.org/927/20080305201649/http://www.nyu.edu/fas/projects/vcb/case_911_FLASHcontent.html
to:
http://archive.example.org/20101105232519/http://www.nyu.edu/fas/projects/vcb/case_911_FLASHcontent.html
The presence of the Memento-Datetime header from Archive-It.org indicates that the resource is an encapsulation of the state of another resource, at the MD datetime value. The link between the Memento and the original resource is indicated with an HTTP Link response header:
Link: rel="original"; <http://www.nyu.edu/fas/projects/vcb/case_911_flashcontent.html>
Thus, MD is sticky in that the new Memento at example.org retains the MD value it observed from Archive-It.org. However, the CD and LM values reflect the datetime relative to example.org:
Creation-Datetime: Fri, 05 Nov 2010 23:25:19 GMT
Memento-Datetime: Wed, 05 Mar 2008 20:16:49 GMT
Last-Modified: Fri, 05 Nov 2010 23:25:19 GMT
The MD and LM datetimes can also vary for the example.org Memento as described in Case 2. (In the unlikely case that the intent of example.org was to create an archive of how resources were archived, the MD could be reset to 05 Nov 2010 and the Link header would point to the Archive-It.org resource as the original resource instead of the nyu.edu resource; however, this is not the point of this discussion.)
Case 4: CD < MD <= LM
data:image/s3,"s3://crabby-images/2d4e6/2d4e6bdbd15a773ad1c715ceef869a31d3028d9d" alt=""
For example, a transactional archive might record as CD the first datetime in which a resource returns a 200 response, but might choose to delay archiving Mementos until the resource's state is something other than "Welcome to Apache". In this scenario, you could have:
Creation-Datetime: Wed, 05 Mar 2008 20:16:49 GMT
Memento-Datetime: Fri, 05 Nov 2010 23:25:19 GMT
Last-Modified: Fri, 05 Nov 2010 23:25:19 GMT
The MD and LM datetimes could also vary as described in Case 2.
Creation Datetime Is Often Unavailable
To illustrate the differences between the various datetime concepts, the above examples have discussed Creation Datetime as if it is a commonly available value. However, this is most often not the case -- in fact, there is no defined HTTP response header that corresponds to Creation Datetime. This is due to the historical limitation of Unix inodes (i.e., metadata for files), which track three notions of time: atime (access time of the file), mtime (modification time of the file), and ctime (modification time of the inode). Modern content management systems might keep track of Creation Datetime, but it is not formally defined at the HTTP level.
Summary
The above examples should provide illustrations of how the three notions of datetime, although obviously related, have slightly different semantics. It should be clear that a Memento's Memento-Datetime is also not just Creation-Datetime or Last-Modified inherited from the original resource for which it is a Memento. Rather than overload an existing HTTP response header (such as Last-Modified), we have introduced the Memento-Datetime (nee Content-Datetime) response header. Additional information about Memento headers, Link rel types, and HTTP interactions can be found at mementoweb.org.
-- Michael
* Datetime = neologism of "date" & "time": the former is often understood to have a granularity of days, and the latter a granularity of seconds.
Hola Michael gracias por este blog. Necesito ordenar registros por fecha pero el unico formato que me permite hacerlo es YYYY-MM-DD ya que los demás al no tener el cero delante de los dÃgitos únicos generan un error. Quedando por ejemplo 1-3-23, 11- 4-23, 2-3-23, 22-2-23 y asà sucesivamente. Cuál podrÃa ser la solución dado que necesito ver delante los dÃas (01--06-23, o mejor 3-5-23)
ReplyDeleteHello Michael, thanks for this blog. I need to order records by date but the only format that allows me to do so is YYYY-MM-DD since the others, since they do not have a zero in front of the unique digits, generate an error. Remaining for example 1-3-23, 11-4-23, 2-3-23, 22-2-23 and so on. What could be the solution since I need to see in front the days (01-06-23, or better 3-5-23)?
ReplyDeleteHi Cristian: I'm not sure I have enough information to answer this, but surely there is a library in whatever language you're developing in to transform the dates into YYYY-MM-DD format (ISO 8601). Don't try to write it yourself -- there's surely a library.
Delete