veganism.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Veganism Social is a welcoming space on the internet for vegans to connect and engage with the broader decentralized social media community.

Administered by:

Server stats:

171
active users

#webarchiving

0 posts0 participants0 posts today

lab.depositar.io/news/250418_1

The depositar team were at the Web Archiving Conference 2025 in Oslo, Norway, and reported on our progress in converting the web archives of a recently orphaned newspaper into accessible article collections in the IPTC (International Press Telecommunications Council) standard format for long-term preservation and data exploration.

The slideset and the script of the presentation can be found at the depositar:

pid.depositar.io/ark:37281/k5p

Continued thread

Gil: What they learned (from British Library's point of view): they affect systems that are technical in nature, the audiences, and processes. The other teams cannot go back to their work because there were bits of technology they relied on in that chain. #iipcWAC25 #webarchiving

Archiving everything on the web isn't an option. So we need better tools and methods for making decisions about what is in need of #WebArchiving.

Here is a short case study in evaluating how much the US Census FTP site has been archived by the @internetarchive and the End of Term Web Archive--both are critical pieces of memory infrastructure.

inkdroid.org/2025/04/08/census

Thanks to @andrewjbtw for doing the heavy lift of collecting the data. All potential errors are my own.

I've had the opportunity to hear Mark Graham speak on several occasions, and he is always such a good spokesperson for @internetarchive . But he really was in top form in this 2 part interview with Amy Goodman of Democracy Now. He speaks on a wide variety of topics of interest to #webarchiving and #archives more generally, in a very circumscribed and personal way.

1. democracynow.org/2025/2/28/int
2. democracynow.org/2025/2/28/ext

Democracy Now! · Wayback Machine Saves Thousands of Federal Webpages Amid Purge of Government Data Under TrumpBy Democracy Now!

We're working on an update to the DIY #webarchiving #zine and are looking for recommendations for paid, secure, non-Google #Email Any suggestions? ProtonMail doesn't seem like a good option given the recent behavior of their CEO. (zinebakery.com/homemade-zines/)

Zine BakeryDIY Web Archiving | Zine BakeryZine Bakery Bakeshop #2, by Quinn Dombrowski, Tessa Walsh, Anna Kijas, Ilya Kreymer, and Amanda Wyatt Visconti

Love this! A 22-page DIY web archiving zine that "shows you why everyone should participate in preserving the things on the web they care about, and how anyone can do so (no special expertise required!)".

zinebakery.com/homemade-zines/

#internet #TheWeb #archiving #WebArchiving

Made by @quinnanya, @Literature_Geek, and bunch of other awesome folks, found via @lavaeolus fedihum.org/@lavaeolus/1138737

Zine BakeryDIY Web Archiving | Zine BakeryZine Bakery Bakeshop #2, by Quinn Dombrowski, Tessa Walsh, Anna Kijas, Ilya Kreymer, and Amanda Wyatt Visconti
Continued thread

Update #4: I went through even more of my bookmarks and hyperfixations today and added 54 more links to tech resources on moule.world/links.html.

"Technology and Web" will be the largest of my link categories as I've got a whole heap of stuff about the Fediverse, cybersecurity, and IndieWeb I've been bookmarking a lot recently.

For now, if you're interested in:

#OldWeb
#QuantumComputing
#Retrocomputing
#WebAccessibility
#WebArchiving
#WebBrowsing
#WorldBuilding

Check it out!

I'm finding that browsertrix crawler's ability to screenshot pages for QA is also a handy way to identify crawler traps for a live #WebArchiving crawl, where an infinite number of URLs get generated for the same page content. Yeah, there are sites like this...

crawler.docs.browsertrix.com/u

At the moment I've just been using standard hash but perhaps some type of image hash could be useful too, for where the content is nearly identical, like a calendar trap.

crawler.docs.browsertrix.comQuality Assurance - Browsertrix Crawler Docs