Why Weibo's tiny VibeThinker-3B has the AI world arguing over benchmarks again

gmays 19 points 2 comments June 18, 2026

Discussion Highlights (1 comments)

embedding-shape

> The model, called VibeThinker-3B, scored 94.3 on AIME 2026 — the American Invitational Mathematics Examination, one of the most demanding standardized math competitions in the world. That figure places it alongside DeepSeek V3.2, a model with 671 billion parameters Overfitting, no need to argue about anything I think? The rest of the article seems to echoing people's misunderstanding of pretty elementary stuff.

Why Weibo's tiny VibeThinker-3B has the AI world arguing over benchmarks again

Discussion Highlights (1 comments)

Related Discussions