New ask Hacker News story: Tell HN: We should snapshot a mostly AI output free version of the web

Tell HN: We should snapshot a mostly AI output free version of the web
20 by jacquesm | 17 comments on Hacker News.
While we can, and if it isn't too late already. The web is overrun with AI generated drivel, I've been searching for information on some widely varying subjects and I keep landing in recently auto-generated junk. Unfortunately most search engines associate 'recency' with 'quality' or 'relevance' and that is very much no longer true. While there is still a chance I think we should snapshot a version of the web and make it publicly available. That can serve as something to calibrate various information sources against to get an idea of whether or not they are to be used or rather not. I'm pretty sure Google, OpenAI and Facebook all have such snapshots stashed away that they train their AIs on, and such data will rapidly become as precious as 'low background steel'. https://ift.tt/QtqO3yK

Comments