/AAD/ - Archiving And Donating computer resources general - /g/ (#106155146) [Archived: 255 hours ago]

Anonymous
8/5/2025, 11:05:20 PM No.106155146
1728763748356954
1728763748356954
md5: fe73c677f7370e80b8b888ba50355fc8๐Ÿ”
Useful archiving efforts and other projects to help out with for people new to and interested in archiving:

HIGH priority (If you don't help archive these automatically, the data will probably be lost forever):

1. http://warrior.archiveteam.org/
Help out automatically archive things being shut down right now by running ArchiveTeam Warrior program (or specific containers) in the background
Requirements: Few GB of space, some bandwidth and small amount of CPU power, more info: https://wiki.archiveteam.org/index.php/ArchiveTeam_Warrior

If you learn that a site or any online data is in danger of shutting down, read through this page and contact ArchiveTeam on their IRC if required in order to have it archived: https://wiki.archiveteam.org/index.php/Projects

2. Help out automatically forward URLs you browse that are not archived on https://archive.org to them for archival with a browser extension
https://github.com/internetarchive/wayback-machine-webextension
Anonymous
8/5/2025, 11:05:26 PM No.106155148
MEDIUM priority (Important overall)

3. Seed torrents for as long as possible, rare data forever. Make sure to look up a guide for your router to PORT FORWARD your torrent client port, to substantially increase your upload (and your download) speed. In low population torrent swarms, if no one is port forwarded then you might not be able to connect to each other at all and exchange any data despite having it.
Requirements: As much or as little bandwitdh you want (you can set the limits if you need to)
https://github.com/qbittorrent/qBittorrent (Recommended client, especially to replace uTorrent)

4. Archive web pages you want to have a local copy of with a "Web Extension for saving a faithful copy of a complete web page in a single HTML file with a single click"
https://github.com/gildas-lormeau/SingleFile

5. Archive videos with "GUI front-end for youtube-dl, yt-dlp and other compatible video downloaders"
https://github.com/axcore/tartube

6. "Capture or record any area of your screen and share it with a single press of a key"
https://github.com/ShareX/ShareX

7. Archive entire websites you want to have a local copy of
https://www.httrack.com/
Replies: >>106170629
Anonymous
8/5/2025, 11:06:26 PM No.106155158
8. Publish the data that you have archived that isn't easily or at all available online. You can easily create torrents yourself in your torrent client and then share the magnet link to it anywhere online for anyone to access and, as long as DHT (Distributed Hash Table, decentralized way to share torrents without the need for any specific tracker) is enabled in settings (on by default), your files will be searchable on DHT by DHT crawlers, local or online (for example https://btdig.com/, where you can actually also search for FILE NAMES within all DHT torrents)
(archive.org also creates torrents for all uploads automatically but their torrents shouldn't be relied on because of an error-prone implementation and since they can also break when more files are uploaded or if the item's metadata changes, which includes even getting a new comment on the item)


OTHER useful things:

- In your torrent client settings add the best trackers to be automatically added for all of your newly added torrents (helps more easily connect to peers, especially in obscure torrents)
https://github.com/ngosang/trackerslist

- Look into running a node for I2P (anonymous private network within the global internet)
Requirements: Mostly bandwidth, more info: https://geti2p.net/en/faq
https://geti2p.net/

- Look into running Tor/Hyphanet(Freenet)/IPFS/YaCy/SearXNG nodes

- Easily capture and digitize all data and metadata from optical media (CDs, DVDs, Blu-rays...) with Media Preservation Frontend (MPF)
https://github.com/SabreTools/MPF

- "A self-hosted BitTorrent indexer, DHT crawler, content classifier and torrent search engine with web UI"
https://github.com/bitmagnet-io/bitmagnet

- "ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view websites offline"
https://github.com/ArchiveBox/ArchiveBox
Anonymous
8/5/2025, 11:07:27 PM No.106155169
- Look into donating your PC resources to be used more intensively in projects:
BOINC (Berkeley Open Infrastructure for Network Computing): https://boinc.berkeley.edu/projects.php
GIMPS (Great Internet Mersenne Prime Search): https://www.mersenne.org/


- Additional archiving tools: https://github.com/iipc/awesome-web-archiving

- Additional links to archiving and similar communities:
https://wiki.archiveteam.org/index.php/Archiveteam:IRC
https://www.reddit.com/r/Archiveteam
https://www.reddit.com/r/DataHoarder
https://www.reddit.com/r/DataHoarder/wiki/index/ - Hardware and software for data hoarding FAQ
https://www.reddit.com/r/lostmedia
https://www.reddit.com/r/GamePreservationists
https://www.reddit.com/r/torrents
https://www.reddit.com/r/qBittorrent
https://annas-archive.se/torrents
>>>/t/

What are you archiving or want to archive?
Do you have or know anyone who has some rare interesting data or media not available online?
Replies: >>106158273
Anonymous
8/5/2025, 11:08:37 PM No.106155179
1727695184464763
1727695184464763
md5: cc1c9da0edd32fefb4689ab3880d8066๐Ÿ”
Updated the OP to include
>- Easily capture and digitize all data and metadata from optical media (CDs, DVDs, Blu-rays...) with Media Preservation Frontend (MPF)
https://github.com/SabreTools/MPF
Replies: >>106158273
Anonymous
8/5/2025, 11:16:09 PM No.106155262
buy an ad
Replies: >>106156862 >>106158273
Anonymous
8/6/2025, 1:44:48 AM No.106156862
>>106155262
Not selling anything
Anonymous
8/6/2025, 4:55:26 AM No.106158273
>>106155169
>What are you archiving or want to archive?
Audiobook collections, misc films as they come to me, series, boxsets.

I'd like to be able to download the entire https://www.songsterr.com/ and https://www.all-guitar-chords.com/website and run the tabs locally with local audio/video.

>>106155179
This looks really cool.

>>106155262
fuck off
Anonymous
8/6/2025, 4:56:27 AM No.106158284
That's https://www.all-guitar-chords.com without the /website.
Anonymous
8/6/2025, 4:57:41 AM No.106158296
There's so much shit on youtube to download, music videos, tutorials, courses, I don't have the time nor health to back it all up.
Replies: >>106161108
Anonymous
8/6/2025, 5:41:17 AM No.106158607
1754018983111126
1754018983111126
md5: 40b5393e12ebcb7864ea7b52eb5ad7ef๐Ÿ”
So some anons (a shit ton) and I were playing Dandy's World when suddenly we were discussing about archiving vidya that may be lost due to censoring. Some anon recommended LTO tapes. What is the cheapest brand to buy regarding a reader? I see the actual tapes are cheap. Any info would be appreciated.
Replies: >>106158985
Anonymous
8/6/2025, 6:51:20 AM No.106158985
>>106158607
Regardless of brand, the newer generation LTO tape machines are ludicrously expensive ($5-6k just for an LTO 9 deck and $150 per 18TB uncompressed/45TB theoretical maxiumum compressed tape cartridge) and the more affordable generations you might find for cheap on eBay have been far exceeded in cost per GB by newer helium hard drive tech. Tape is also linear (meaning it'll take you ages to retrieve data if it's on the other end of the tape) and needs to be stored in a cool, dry place unless you want flaking and/or mould in the next few decades.

Don't bother with tape unless you really have the capital to invest in it and somewhere to keep your tapes.
Replies: >>106159004
Anonymous
8/6/2025, 6:54:48 AM No.106159004
>>106158985
Is there any medium that would be similar to the concept the? I know it may sound strange, but I have an appreciation for being able to utilize a tape cartridge. I see older machines going for 200 to $300 on eBay. Are any of those good? I donโ€™t know how to explain it, but I have an extreme fascination when I am able to utilize anything with tape.
Replies: >>106159113
Anonymous
8/6/2025, 7:17:45 AM No.106159113
>>106159004
HDDs are cheap per TB so you can do
RAID (1,5,6,ZFS equivalent, and any combination thereof(5/6 is better for sequential data/storage economy)(4 exists but is niche)(10 is (a) mirrored stripe(s)))
And a backup
Ideally RAIDed or at least with parity (par files) AND offsite
(BTRFS has a redundancy mode called "dup" that allows "RAID 1" on a single drive)
BTRFS RAID 5/6 is not advised

ZFS is the ideal file system, but liscence shenanigans prevent it from being in the linux KERNEL (ie the sub 1gb partion)
Replies: >>106159142 >>106159208
Anonymous
8/6/2025, 7:22:43 AM No.106159142
>>106159113
Should also mention, unless you are buying 10+ drives with the expectation that 2 are going to die near the same time, buy from different manufacturers, production defects can be a problem, maybe a small one, but nothing to scoff at (especially if running 2 stripe RAID 10 or RAID 5)
Replies: >>106159208
Anonymous
8/6/2025, 7:33:03 AM No.106159208
>>106159113
>>106159142
Thanks
Anonymous
8/6/2025, 1:03:13 PM No.106161052
Bump
Anonymous
8/6/2025, 1:13:58 PM No.106161108
>>106158296
You can give a YouTube search result URL to yt-dlp so it goes through all search results it can and downloads everything there. Although this usually picks up a lot of irrelevant content over time, so it's good to place your unique keywords between "".
Replies: >>106162709
Anonymous
8/6/2025, 4:30:03 PM No.106162709
>>106161108
Solid advice, thanks.
Anonymous
8/6/2025, 4:35:31 PM No.106162770
Bumperino.
Anonymous
8/6/2025, 6:52:31 PM No.106164326
1745916134951044
1745916134951044
md5: 2e435d89506effd8c23e40c64a13aa59๐Ÿ”
music
Replies: >>106164658 >>106165676 >>106166117 >>106171100
Anonymous
8/6/2025, 7:23:51 PM No.106164658
>>106164326
BASED
Anonymous
8/6/2025, 8:51:42 PM No.106165676
>>106164326
based. I deleted mine a couple months ago and have been self-hosting my own music server since
Replies: >>106166117
Anonymous
8/6/2025, 9:27:38 PM No.106166117
>>106165676
Need to do this, honestly. zased.
>>106164326
I also need to do this. I requested a copy of my music from Spotify in the hopes of translating the json into searching youtube or other sources for those songs, though idk what to do about discoverability, since spotify's system was pretty good.
Anonymous
8/6/2025, 11:11:44 PM No.106167212
Thanks for the guide. I'm downloading all my favorite childhood cartoon parodies. I'm not going to give my ID to youtube on Aug 15th and I fucking will walk away forever for a different outlet.
Replies: >>106167756
Anonymous
8/7/2025, 12:08:41 AM No.106167756
1745695661025067
1745695661025067
md5: 4c565712f1c27584a37918cc26e1fd0d๐Ÿ”
>>106167212
Is it going to be a blanket ID requirement for any and all videos, or just for the R-rated age restricted stuff? Can you watch a music theory video or some DIY repair tutorial without an ID? FFS.
Replies: >>106169027
Anonymous
8/7/2025, 2:07:25 AM No.106169027
>>106167756
Might as well backup everything you care about
Anonymous
8/7/2025, 2:30:24 AM No.106169261
1741153618996485
1741153618996485
md5: a7862143d91e2c9f79dc2b6e17c4dcf7๐Ÿ”
all of my hard drives are full and now I have to buy more.
Anonymous
8/7/2025, 3:26:39 AM No.106169822
Screenshot 2024-11-22 081305
Screenshot 2024-11-22 081305
md5: c9257796096fba3af94437dd5a38236e๐Ÿ”
Internet Archive is that site run by a Jew that no one elected to store all our data, right?
Anonymous
8/7/2025, 4:54:35 AM No.106170629
>>106155148
>https://www.httrack.com/
How do I use this properly. It downloads forever and websites that can't be more than 100mb use like 5gb before I cancel the download.
Replies: >>106170667
Anonymous
8/7/2025, 4:58:52 AM No.106170667
>>106170629
Look at the download folder to see what is taking up the space, edit the depth to which the site links are crawled
Anonymous
8/7/2025, 5:52:55 AM No.106171100
>>106164326
based. i never got into streaming services and still have a massive digital music library that has slowly been growing since i was in high school.
it's really nice to never have to worry about certain albums suddenly vanishing because a license expired or whatever. of course I always buy physical copies of albums i really like to support the artist (unless the album is 5+ yrs old, then i just get a secondhand copy for $2 on discogs)