The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A" (2023)
Anon84
25 points
44 comments
June 23, 2026
Related Discussions
Found 5 related stories in 122.6ms across 11,417 title embeddings via pgvector HNSW
- Constraint Decay: The Fragility of LLM Agents in Back End Code Generation wek · 210 pts · May 24, 2026 · 49% similar
- LLMorphism: When humans come to see themselves as language models okey · 75 pts · May 10, 2026 · 48% similar
- The case for zero-error horizons in trustworthy LLMs daigoba66 · 71 pts · April 02, 2026 · 48% similar
- A sleep-like consolidation mechanism for LLMs juxtapose · 195 pts · May 26, 2026 · 47% similar
- Can LLMs Beat Classical Hyperparameter Optimization Algorithms? galsapir · 109 pts · June 09, 2026 · 47% similar
Discussion Highlights (9 comments)
eurekin
I remember seeing this popping up in discussions the first time, but never noticed any resolution (other than to train both sides). Has SOTA advanced?
gipp
As I and several other people pointed out last time this was posted, "A is B," in natural language, does not imply "B is A." "Is" can denote any of many different shades of relationship weaker than logical identity.
zmgsabst
“A is B” doesn’t generally entail “B is A”. “A square is a rectangle” does not entail “a rectangle is a square”. Similarly, “Socrates is alive” doesn’t entail “alive is Socrates”. Notably, they mention when context is included, LLM performance rises — ie, exactly when we include extra information that allows it to recognize what kind of information is being conveyed. But the LLM is correct not to generalize that pattern when it doesn’t generalize — even if researchers have salient example, but ignore contrary ones (eg, square-rectangle or Socrates-alive).
ralferoo
Not only is the inverse not generally true (as others have pointed out), their examples requires several mental leaps. "Who is Tom Cruise's mother? [A: Mary Lee Pfeiffer]" and the reverse "Who is Mary Lee Pfeiffer's son?" The word "mother" has no relationship to "son" in terms of the model, and so while the model might be able to infer a proximity relationship between "Tom Cruise" and "Mary Lee Pfeiffer" just because they appear in the same sentence, expecting the AI to guess that the inverse of mother is son is a bit of a stretch, especially when they're both lossy mappings, because the relationship is {mother,father} <=> {son,daughter}. If we're going to train models to make that mental leap, we'd have to put up with false results like "Tom Cruise is the daughter of Mary Lee Pfeiffer" unless the model is also supposed to infer that Tom means he can only be a son.
whilenot-dev
Here's a short example from A. Karpathy in a 2024 video: https://www.youtube.com/watch?v=zjkBMFhNj_g&t=750s
josefritzishere
The premise here is false. AI does not learn. It is a word guessing machine. I know that some of this is the semantics of how we describe these analogs but pretending that an LLM can learn does not advance the topic.
bigstrat2003
Yes, because they can't reason. This is well known, and should be completely unsurprising. LLMs don't "learn" anything except that some token is statistically likely to be followed by some other token.
dang
Discussed at the time: LLMs trained on “A is B” fail to learn “B is A” - https://news.ycombinator.com/item?id=37621999 - Sept 2023 (158 comments)
jeremysalwen
I am surprised nobody links this blog post demonstrating that the paper's conclusion is not true (even for gpt3.5): https://andrewmayne.com/2023/11/14/is-the-reversal-curse-rea... It seems like restrictions on the model talking about non famous people might have been responsible for the appearance of the models being unable to do this.