@PassingThrough

PassingThrough@lemm.ee · 2 hours ago

As I understand it, their data does in fact enter into the Wayback Machine. They are just also available in the direct WARC archive files(which IMO sounds beneficial to the idea of exporting in bulk to another backup host). At least that’s how their FAQ reads.

And given that they focus on web crawling, and not other arbitrary data formats that IA accepts, 2.8% of over 100 petabytes is still a respectable amount of data.

That said, help is help. If another archival project team wants me to run a worker node so they can distribute load and dodge crawler blocks, let me know, I’ve got space.

PassingThrough@lemm.ee · 2 days ago

Now try working with all three big platforms daily, throw in some Command keys, and I recently realized I’m losing my grip on shortcuts, and my sanity.

PassingThrough@lemm.ee · 2 months ago

The good ol’ Red Scare.