A tool that removes censorship from open-weight LLMs
mvdwoord
144 points
62 comments
March 06, 2026
Related Discussions
Found 5 related stories in 37.5ms across 3,471 title embeddings via pgvector HNSW
- Taming LLMs: Using Executable Oracles to Prevent Bad Code mad44 · 32 pts · March 26, 2026 · 48% similar
- Wikipedia RFC on banning LLM contributions hackerBanana · 39 pts · March 20, 2026 · 47% similar
- LLMs can unmask pseudonymous users at scale with surprising accuracy Gagarin1917 · 42 pts · March 04, 2026 · 46% similar
- r/programming bans all discussion of LLM programming cryptoz · 27 pts · April 02, 2026 · 45% similar
- Arch Linux considers criticism of Age Verification to be a violation SockThief · 11 pts · March 24, 2026 · 45% similar
Discussion Highlights (10 comments)
greenpizza13
Never stopped to ask if they should...
Alifatisk
This is for local models right? I can't use it on, say my glm-5 subscription connected to opencode?
ComputerGuru
Reviews of the tool on twitter indicate that it completely nerfs the models in the process. It won't refuse, but it generates absolutely stupid responses instead.
littlestymaar
Don't use this 2 days old vibe coded bullshit please. p-e-w's Heretic ( https://news.ycombinator.com/item?id=45945587 ) is what you're looking for if you're looking for an automatic de-censoring solution.
a2128
You're not just using a tool — you're co-authoring the science. This README is an absolute headache that is filled with AI writing, terminology that doesn't exist or is being used improperly, and unsound ideas. For example, it focuses a lot on doing "ablation studies", by which it means removing random layers of an already-trained model, to find the source of the refusals(?), which is an absolute fool's errand because such behavior is trained into the model as a whole and would not be found in any particular layer. I can only assume somebody vibe-coded this and spent way too much time being told "You're absolutely right!" bouncing back the worst ideas
measurablefunc
This is another instance of avant-garde "art".
PeterStuer
Already censored for sharing on FB Messenger?
ftkftk
Didn't make it past the first paragraph of AI slop in the README. Have some respect for your readers and put actual information in it, ideally human generated. At least the first paragraph! Otherwise you may as well name it IGNOREME.
SilverElfin
Does anyone offer a live (paid) LLM chatbot / video generation / etc that is completely uncensored? Like not requiring doing any work except just paying for it?
g947o
Went through the README but still have no idea how well this works, in terms of removing the censorship while minimally degrading the quality of responses. Well to be honest I can't tell if this works at all or is just an idea.