Slopinator: Attack AI training with poisoned GitHub repositories

atomic128 11 points 9 comments May 19, 2026
codeberg.org · View on Hacker News

Discussion Highlights (5 comments)

atomic128

Poison Fountain: https://news.ycombinator.com/item?id=46577464 Poison Fountain on Reddit: https://www.reddit.com/r/PoisonFountain/ Miasma Poison Tar Pit: https://news.ycombinator.com/item?id=47561819

hansmayer

Finally an AI project with a sense of purpose!

verdverm

I doubt things like this work against any serious Ai lab. They know data curation is paramount. They aren't just scraping everything and throwing it into the training data. You don't need to train on all of the internet, that actually hurts.

supern0va

I think these sort of efforts are mostly self-soothing at this point. It is almost certainly the case that the labs are at a minimum running inference over the information they're pulling and ensuring that it's useful/suitable for pre-training. The models are at least good enough to know whether they're looking at utter nonsense.

josefritzishere

I fully support this effort.

Semantic search powered by Rivestack pgvector
8,303 stories · 78,303 chunks indexed