Correct me if I’m wrong, but they only index shadow libraries and do not host any files themselves (unless you count the torrents). So, you don’t need 900+ TB of storage to create a mirror.
Is hosting all that stuff even legal? I mean, they’re not making any money off of it, but they’re still a “piracy” hub. How have they survived this long?
It’s very illegal. iirc it was created by a group called “Pirate Library Mirror” after the guy that runs z-library got arrested, so I assume they’re taking anonymity seriously to avoid arrest.
how big is the database?
books can’t be that big, but i’m guessing the selection is simply huge?
The selection is literally all books that can be found on the internet.
deleted by creator
According to their total dataset size excluding duplicates, over 900 TB
Sure, that’s a bit more than $65.000 per year with Backblaze.
deleted by creator
You run a petabyte Synology at home?
deleted by creator
I’m guessing you’re talking GBs?
deleted by creator
That’s awesome - how many drives and of what sizes do you have? Also why synology instead of higher enterprise grade solution at this point?
deleted by creator
They put a link in with the total…
Total Excluding duplicates 133,708,037 files 913.1 TB
wait what? how expensive is it to buy and run? is it practical at all, what are the common snags? always wanted to get into doing some archiving.
deleted by creator
You should write a will instructing your family to send those disks to the internet archive for preservation if something happened to you.
Correct me if I’m wrong, but they only index shadow libraries and do not host any files themselves (unless you count the torrents). So, you don’t need 900+ TB of storage to create a mirror.
I imagine a couple of terabytes at the very least, though, I could be underestimating how many books have got deDRMed so far.
deleted by creator
Girl, what? No wonder they’re having trouble hosting their archive. Does Anna’s Archive host copyrighted content as well or is all that copyleft?
They host academic papers and books, most of them are copyrighted contents. They recently got in trouble for scraping a book metadata service to generate a list of books that hasn’t been archived yet: https://torrentfreak.com/lawsuit-accuses-annas-archive-of-hacking-worldcat-stealing-2-2-tb-data-240207/
Is hosting all that stuff even legal? I mean, they’re not making any money off of it, but they’re still a “piracy” hub. How have they survived this long?
It’s very illegal. iirc it was created by a group called “Pirate Library Mirror” after the guy that runs z-library got arrested, so I assume they’re taking anonymity seriously to avoid arrest.
No, it’s not.
They’ve survived by making themselves hard to identify and shut down. And as we can see here, by creating redundancies.
They index, not host, no? (Unless you count the torrents, which are distributed)
The archive includes copyrighted works. Often multiple copies of each work, across different formats.
I guess more than 5?
bigger than zlib or project Gutenberg?
It is huge! They claimed to have preserved about 5% of the world’s books.
oh i actually tought it was way more! there wasnt a single book i wanted (or even tought to look up) that i didnt actually find in there.