So with the ever increasing news that GameFAQs is getting full fandom’d, I figured I would grab some of the more useful guides for when I want to play a “retro” game or just 100% a LAD. From quick research, it looks like the txt guides are more than covered but the HTML ones are still kind of in a void.
So before I write my own set of scripts, I figured I would check if I am just not aware of something that already does this.
(also check with !datahoarder@lemmy.ml )
For simple HTML websites using
wget -p -k example.com
should work.For more complex sites maybe try ArchiveBox
What does getting full fandom’d mean?
A few years back, CBS sold CNET (CNET, Gamespot, GameFAQs, Giant Bomb, and probably one or two others) to Red Ventures who then sold basically everything but CNET proper to Fandom. If people aren’t aware of what Fandom is, just go to basically any video game wiki and see how many pop ups you need to close just to see some misinformation.
Anywho, Fandom have a decent record of killing every property they buy in the interest of monetization. And they can do that because they buy EVERYTHING. And as of a few days ago, one of the long standing admins at GameFAQs announced they were stepping down. Which… suggests Fandom realized they own GameFAQs and are likely about to start gutting it to add as many ads and autoplay twitch pages as possible.
Thank you for your response! I might as well archive as much as I can do, then.
This extension should work https://addons.mozilla.org/en-US/firefox/addon/single-file/
Looked at that. Seems like it would have worked back back when
?single=1
didn’t break all images. But since the guides are broken up into multiple pages, the automated scraping tends to lose its mind because it will try to get the entire site. Rather than a subset of pages.Thanks though
I only know of HTTrack. Fairly old and it needs some tweaks to work. Let me know if you find anything else