2023-05-24: IIPC Web Archiving Conference (WAC) Trip Report
This year's International Internet Preservation Consortium (IIPC) Web Archiving Conference (WAC) took place in Hilversum, The Netherlands at The Netherlands Institute of Sound and Vision. It was the first in-person event since 2019 and the 20th anniversary of IIPC! The program offered between two and three tracks for attendees to choose from, so this trip report will give a summary of the sessions I was able to attend. For more information on the other sessions, check out the full conference schedule and the official hashtag (#IIPCWAC23).
Day One
✨ Super excited to follow the #IIPCWAC23 @NetPreserve organised by our colleagues @BeeldenGeluid @benglabs @KB_Nederland!👏🎉
— Camille Françoise @CMFrancoise@mastodon.social (@CMFrancoise) May 11, 2023
💡Starting now! 🎉 Our director, Eppo asking AI how to start his introductory speech! Don't forget to quote @johanoomen 🎉 pic.twitter.com/wufmNjrWUb
Keynote
KEYNOTE: Eliot Higgins, Bellingcat - 'Open Source Investigation Comes Of Age', Introduced and chaired by Johan Oomen, Sound & Vision.https://t.co/iHCyos1acl#IIPCWAC23 #IIPC20Years #WebArchiving pic.twitter.com/gdsWcv2ajx
— UK Web Archive (@UKWebArchive) May 11, 2023
Session #1: Research & Access
“If you can’t access data, you’re not going to use it”
— Emily Escamilla (@EmilyEscamilla_) May 11, 2023
To kick off Session 1, @SamVFritz is presenting @unleasharchives work to make archival data accessible to researchers pic.twitter.com/jd12zBfFbi
Good morning #Hilversum! Presenting @ #IIPCWAC23 this morning to talk about @unleasharchives Cohort Program & opportunities to support #acess #use #engagment w/ #webarchives
— Samantha Fritz (@SamVFritz) May 11, 2023
Slides: https://t.co/zdraeELc0E
C1 Projects: https://t.co/ns8VijWFQW
C2 Projects: https://t.co/3bMM3g9Vki pic.twitter.com/vxOEVXghAT
Leontien Talboom (@makethecatwise) and Mark Simon Haydn presented "Research-Ready Collections: Challenges and Opportunities in Making Web Archive Material Accessible", their work with the Archive of Tomorrow Project. The project worked to curate a collection of 10k targets relating to health in the UK Web Archive (@UKWebArchive) and to explore ethical collection from the Web and responsible republishing. Legal limitations remain a significant barrier, but the project was about to achieve an increase from 1% to 8% of archives sites being publicly accessible!
Leontien Talboom (@theUL) & Mark Haydn (@natlibscot) gave an overview of the Archive of Tomorrow project - 'Research-Ready' Collections: Challenges & Opportunities in Making Web Archive Material Accessible' at #IIPCWAC23 https://t.co/nDyB3OC8oc#IIPC20YEARS #UKLegalDeposit pic.twitter.com/FJRCwwtHCB
— UK Web Archive (@UKWebArchive) May 11, 2023
Jennifer Morival (University of Lille), Sara Aubry (@saraaubry, BnF), and Dorothée Benhamou-Suesser (BnF) presented "Developing New Academic Uses of Web Archives Collections: Challenges and Sessions Learned from the Experimental Service Deployed at the University of Lille During the ResPaDon Project". BnF has worked with the University of Lille to allow full access to BnF holdings at the University of Lille. They also shared their experiences and lessons learned in helping researchers leverage the BnF's holdings through tools, datasets, and trained mediators.
How can archived enable researchers to access and use their holdings through tools and documentation?
— Emily Escamilla (@EmilyEscamilla_) May 11, 2023
Dorothée Benhamou-Suesser, Jennifer Morival, and @saraaubry from @DLwebBnF closed Session 1 with a discussion of their work on @Respadon_Projet to do just that#IIPCWAC23 pic.twitter.com/s8lf90JmDD
Session #3: Panel: Supporting Digital Scholarship
In a few minutes, @sp_meta @talya_cooper @mart1nkle1n and I are presenting a panel on Institutional Web Archiving Initiatives
— Emily Escamilla (@EmilyEscamilla_) May 11, 2023
to Support Digital Scholarship #IIPCWAC2023
Slides: https://t.co/nHjjIWYgRe
Honored to moderate the panel on institutional web archiving initiatives to support digital scholarship w/ @EmilyEscamilla_ @talya_cooper, and @sp_meta. Missing @VickyRampin #iipcwac23 pic.twitter.com/toz18PrpXa
— Martin Klein (@mart1nkle1n) May 11, 2023
Session #6: Social Media & Playback: Collaborative Approaches
And here it starts! 🎉@LN_ist from @meemoo_be and @KatrienWeyns #KADOC are introducing their work around archiving social media in Flemish Cultural or Private Archives 👏✨ pic.twitter.com/9kClKhICv0
— Camille Françoise @CMFrancoise@mastodon.social (@CMFrancoise) May 11, 2023
It is no secret that archiving social media presents unique and complex challenges. Zefi Kavvadia (@ZKavvadia), Katrien Weyns (@KatrienWeyns), Mirjam Schaap (@mrjmschaap), and Sophie Ham (@Sophies_posts) presented "Searching for a Little Help from My Friends: Reporting on the Efforts to Create an (Inter)national Distributed Collaborative Social Media Archiving Structure". They called for better collaboration between archives, institutions, and nations to tackle the complex challenges of archiving social media and the need for an improved legal policy to facilitate archiving social media as cultural heritage. They presented the results of a survey they conducted to gauge interest and challenges from potential collaborators.
@ZKavvadia @KatrienWeyns Mirjam Schaap & Sophie Ham on Searching for a little Help from My Friends : reporting on the efforts to create an (Inter)national Distributed collaborative Social media Archiving Structure. #iipcwzc23 pic.twitter.com/N9gLPV9GER
— Camille Françoise @CMFrancoise@mastodon.social (@CMFrancoise) May 11, 2023
Clare Stanton (@clare__stanton) from Harvard's Library Innovation Lab (@harvardlil) and Perma.cc (@permacc) presented "Collaborating on the Cutting Edge: Client Side Playback". They created WACZ-Exhibotor, a wrapper for web recorder's replay tool that shifts the burden of upkeep to a browser and away from the institution's servers. Clare presented the process of creating a working prototype for the #MeToo Project with Schlesinger Library and creating tools to make the process easy to replicate for others.
In Session 6, @clare__stanton from @permacc presented their work creating a Client-Side Playback. This tool enables non-software developers to integrate replay from WACZ directly into your website!
— Emily Escamilla (@EmilyEscamilla_) May 11, 2023
Check out their working prototype https://t.co/nXey7DmpZ9 pic.twitter.com/DX4uS60ggU
Session #7: Collaborations & Outreach
The "word of the day" award goes to @IngeRudomino for "HAWathon" - an effort to engage high schoolers in #webarchiving with HAW:https://t.co/2G7iOHPlOQ #IIPCWAC2023 pic.twitter.com/CPsmarZs4b
— Martin Klein (@mart1nkle1n) May 11, 2023
Session #10: Lightning & Drop-In Talks
“How do we preserve the past in a violent present for an uncertain future?”
— Emily Escamilla (@EmilyEscamilla_) May 11, 2023
We had lightning talks to close Day 1 of #IIPCWAC23. @helveticade and Benjamin Royer from @ndc_org talked about Memory in Uncertainty
Report: https://t.co/Rbbdf31Knv
Slides: https://t.co/dOcu8pyGu1 pic.twitter.com/wGRTx5HmVK
Day Two
Workshop #4
To kick off Day 2 of #iipcWAC23, @IlyaKreymer @AndersKlindt and @anjacks0n ran a workshop on Browsertrix Based Crawling. Participants were able to start their own crawls and ask questions. It can even run behind login pages
— Emily Escamilla (@EmilyEscamilla_) May 12, 2023
Collectively we ran 30 crawls. Really cool workshop! pic.twitter.com/oJeNh9yEfK
Session #12: Domain Crawls
Thoroughly investigating -good & bad- link rot @LosAlamosNatLab by @mart1nkle1n #IIPCWAC23 #netpreserve pic.twitter.com/Gp2aQUyd6j
— KB NL research (@KBNLresearch) May 12, 2023
Session #13: Crawling, Playback, Sustainability
Session #15: Data Considerations
What if GitHub disappeared tomorrow? How can we use existing digital libraries to find repositories? What percentage of scholarly code repositories would disappear forever?
— Emily Escamilla (@EmilyEscamilla_) May 17, 2023
I presented the answers to these questions at #IIPCWAC23: https://t.co/Sbk6VTuTd0#WebArchiveWednesday
Eld Zierau (@EldZierau) from the Royal Danish Library presented "Web Archives and FAIR Data: Exploring the Challenges for Research Data Management (RDM)", an overview of the WARCnet project. They presented the results of their semi-structured interviews on the Research Data Management (RDM) practices of those who engage in the Web Archiving Lifecycle (WAL). They specifically focused on FAIR principles (findable, accessible, interoperable, and reusable).
In Session 15, @EldZierau from the Royal Danish Library presented their work with WARCnet and research data management for Web archive studies
— Emily Escamilla (@EmilyEscamilla_) May 12, 2023
She referenced lots of great work from @WebSciDL @maturban1 @shawnmjones !#iipcWAC23 pic.twitter.com/7lzWxmMbin
Mark Phillips (@vphill) from the University of North Texas presented "Lessons Learned in Hosting the End of Term Web Archive in the Cloud". The End of Term Web Archive (@eotarchive) is to document the transition in the Executive Branch of the United States by archiving federal government Web pages before and after each election cycle. They have captured the 2008, 2012, 2016, and 2020 transitions with the help of multiple institutions include the University of North Texas and the Internet Archive. They recently moved the collections to Amazon S3 to allow for greater access and computational consumption of the collection.
To wrap up the session, @vphill from @UNT_Libraries presented their work with @ibnesayeed from @internetarchive and the @eotarchive
— Emily Escamilla (@EmilyEscamilla_) May 12, 2023
They recently moved the dataset to AWS3 to allow for greater access to the EOT datasets for reuse and research #iipcWAC23 pic.twitter.com/wqesUJ46Dr
Session #16: Preservation and Complex Digital Publications
Ian Cooke & Giulia Carla Rossi (@britishlibrary) gave an overview of 'Collecting & Presenting Complex Digital Publications'.
— UK Web Archive (@UKWebArchive) May 12, 2023
You can view some of the collection at the forthcoming #DigitalStorytelling exhibition https://t.co/EzUVK8kRWU#IIPCWAC23 #IIPC20Years #UKLegalDeposit pic.twitter.com/r51F3rg4QH
Next, Daniel Steinmeier and Susanne van den Eijkel (@SvandenEijkel) from KB Nationale Bibliotheek presented "What Can Web Archiving History Tell Us about Preservation Risks?" File format obsolescence is a problem for Web archiving. In migrating to a new format, archivists typically agree on significant properties with the producer. However, it can be difficult to identify significant properties when there is no clear producer and no way to know the original intent. They concluded by saying that, while obsolescence is a problem, completeness should be a preservation priority more urgent than solving obsolescence.
And again my colleagues are presenting. @SvandenEijkel and Daniel Steinmeier told about preservatieve risks #iipcWAC23 pic.twitter.com/Em5wKVmlAt
— Trienka Rohrbach (@trienka) May 12, 2023
Keynote
@marleenstikker offering the #iipcwac23 closing keynote titled "Public values in the digital domain" pic.twitter.com/0hcyTLas53
— Martin Klein (@mart1nkle1n) May 12, 2023
“If you can’t open it, you don’t own it”
— Emily Escamilla (@EmilyEscamilla_) May 12, 2023
“The internet is free, therefore, the platform owns you”
Some thought provoking quotes from Marleen Stikker’s keynote “Public values in the digital domain” to close out #IIPCWAC23 pic.twitter.com/p3KBdDY0jg
Comments
Post a Comment