Qwen-3.6-Plus is the first model to break 1T tokens processed in a day
Alifatisk
49 points
16 comments
April 05, 2026
Related Discussions
Found 5 related stories in 83.7ms across 8,303 title embeddings via pgvector HNSW
- Orthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distribution FranckDernoncou · 34 pts · May 15, 2026 · 63% similar
- The Qwen 3.5 Small Model Series armcat · 11 pts · March 02, 2026 · 62% similar
- Qwen 3.7 Preview theanonymousone · 228 pts · May 18, 2026 · 61% similar
- Qwen3.5-Omni meetpateltech · 18 pts · March 30, 2026 · 59% similar
- We got 207 tok/s with Qwen3.5-27B on an RTX 3090 GreenGames · 162 pts · April 20, 2026 · 58% similar
Discussion Highlights (5 comments)
Alifatisk
https://xcancel.com/openrouter/status/2040239467865489874
roxolotl
I’m very curious if we’re going to ever get another “deepseek moment. Qwen is starting to feel like it could be one. But for it to be people would have to decide to care. It took about a month, I think mid December-mid January, from the deepseek paper for the “moment” so it doesn’t necessarily have to be right away.
dcre
Anybody want to give an anecdotal take on how good it is?
gertlabs
Qwen 3.6 Plus is a decent model in our benchmarks (which found it to perform lower than its model card) at gertlabs.com, but not ground-breaking. The reason for the insane popularity is because it's pretty good AND free. It's a no-brainer to switch to this for anything usage-based that isn't frontier coding while the free limits are available. It's probably running a model ~100B parameters under the hood, which won't be so heavily subsidized for long. EDIT: our tool usage benchmark is still running, but so far, its performance with tools is dramatically better than its one shot performance. I'm treating Qwen 3.6 Plus as a near-SOTA model now.
neonstatic
If it overthinks everything the way Qwen 3.5 running locally does, then I am not surprised! :)