Internet Archive breached again through stolen access tokens

Dju@lemmy.world · 2 years ago

Internet Archive breached again through stolen access tokens

grue@lemmy.world · 2 years ago

Okay, enough is enough. The Internet Archive is both essential infrastructure and irreplaceable historical record; it cannot be allowed to fall. Rather than just hoping the Archive can defend itself, I say It’s time to hunt down and counterattack the scum perpetrating this!

dovahking@lemmy.world · 2 years ago

Where are the anonymous group and 4chan autists? They should attack these assholes. Attacking internet archive is like kicking a kitten. Everyone will hate you for it.

notfromhere@lemmy.ml · 2 years ago

We need IA full mirrors. This is too critical to leave to this one company.

psycotica0@lemmy.ca · 2 years ago

Knowing the folks at IA I’m sure they would love a backup. They would love a community. I’m sure they don’t want to be the only ones doing this. But dang, they’ve got like 99 Petabytes of data. I don’t know about you, but my NAS doesn’t have that laying around…

el_abuelo@programming.dev · edit-2 2 years ago

I wonder if someone can come up with some kind of distributed storage that isn’t insanely slow. Kinda like a CDN but on personal devices. I’m thinking like SETI@HOME did with distributed compute.

Edit: this is kinda like torrents but where the contents are changing frequently.

psycotica0@lemmy.ca · 2 years ago

You should look up IPFS! It’s trying to be kinda like that.

It’ll always be slower than a CDN, though, partly because CDNs pay big money to be that fast, but also anything p2p is always going to have some overhead while the swarm tries to find something. It’s just a more complicated problem that necessarily has more layers.

But that doesn’t mean it’s not possible for it to be “fast enough”

el_abuelo@programming.dev · 2 years ago

Interesting, thanks

notfromhere@lemmy.ml · 2 years ago

That is an insane amount of storage. How much does it grow every year and is it stable growth or accelerating?

zlatiah@lemmy.world · 2 years ago

This again??

This time once archive.org is back online again… is it possible to get torrents of some of their popular data storage? For example I wouldn’t imagine their catalog of books with expired copyright to be very big. Would love a community way to keep the data alive if something even worse happens in the future (and their track record isn’t looking good now)

Exeous@lemmy.world · 2 years ago

Like this idea

njordomir@lemmy.world · 2 years ago

Yep, that seems like the ideal decentralized solution. If all the info can be distributed via torrent, anyone with spare disk space can help back up the data and anyone with spare bandwidth can help serve it.

rottingleaf@lemmy.world · 2 years ago

There’s an issue with torrents, only the most popular ones get replicated and the process is manual\social.

Something like Freenet is needed, which automatically “spreads” data over machines contributing storage, but Freenet is an unreliable storage, basically like a cache where older and unwanted stuff gets erased.

So it should be something like Freenet, but possibly with some “clusters” or “communities” with a central (cryptography-enabled) authority of each being able to determine the state of some collection of data as a whole, and pick priorities. My layman’s understanding is that this would be similar to something between Freenet and Ceph, LOL. More like a cluster filesystem spread over many nodes, not like cache.

njordomir@lemmy.world · edit-2 2 years ago

You have more knowledge on this than I did. I enjoyed reading about Freenet and Ceph. I have dealt with cloud stuff, but not as much on a technical-underpinnings level. My first freenet impression from reading some articles gives me 90s internet vibes based on the common use cases they listed.

I remember ceph because I ended up building it from the AUR once on my weak little personal laptop because it got dropped from some repository or whatever but was still flagged to stay installed. I could have saved myself an hours long build if I had read the release notes.

rottingleaf@lemmy.world · 2 years ago

My first freenet impression from reading some articles gives me 90s internet vibes based on the common use cases they listed.

That’s correct, I meant the way it works.

_sideffect@lemmy.world · 2 years ago

IndustryStandard@lemmy.world · 2 years ago

Hope they had a backup

Internet Archive breached again through stolen access tokens

Internet Archive breached again through stolen access tokens

Just a moment...