Qwen3.7-Max: The Agent Frontier

kevinsimper 650 points 255 comments May 20, 2026
qwen.ai · View on Hacker News

Discussion Highlights (20 comments)

goyozi

These are very good numbers. I still don’t get why they don’t compare against latest competitor versions in these posts, it’s not like we’re all not going to notice.

bratao

It is super strange that all last (3?) releases they keep comparing older models such as Opus-4.6.

tarruda

Looking forward to more open weight releases from Qwen, especially 122B and 397B.

bsenftner

Any reports from people using their coding agent(s)?

tekacs

As they start to release more proprietary models, I so wish that they partnered with one of the major US hyperscalers to allow using these models through something US-domiciled. Totally understand why it may not be reasonable or in their best interest (and that the US is _absolutely_ not doing the same reflexively). But it would be lovely to be able to try these out on production workloads in earnest.

dfansteel

Can anyone check its knowledge base for me? I’m honestly not able to run it and the Qwen models I can run censor information critical towards the Chinese government. Tiananmen Square is the first place to start.

howmayiannoyyou

I can't bring myself to use any model that trains or sends telemetry back to my country's primary competitor/adversary. I don't care how much money is saved.

XCSme

Any info on pricing and latency?

esafak

Does anyone have experience with the Alibaba Cloud Model Studio that serves these qwen models?

goldenarm

The non-hallucination rate in AA-omniscience is SOTA, better than Opus 4.7, Gemini 3.1 Pro and GPT5.5! Congrats to the team

eddyaipt

The pattern I trust most is adding a small verification artifact after every external action. Agents usually fail from silent state drift faster than from lack of reasoning depth.

ndom91

Is this one of those ones where they'll drop the huggingface release a week later? Or do we know for sure that this is staying proprietary?

jdw64

QWEN really hits the sweet spot it's cheap, fast, and actually good.

hmaddipatla

The tokenomics and value for capability, context and latency look like they could deliver super competitive offer - what would it take for you to switch??

briga

I was getting dangerously close to my weekly Claude Code limit last night so I had Claude set up Qwen3.6 with llama.cpp and OpenCode. Honestly it's a great (free!) alternative to Claude Code--certainly more than good enough for a lot of smaller less complex tasks. I'm excited to try this new version. The fact that open-source models are so close to the frontier is very impressive.

flakiness

I'm using pi agent and love to try qwen models (hosted). What are the good options? The official provider doesn't include Alibaba. Is OpenRouter etc. fast enough? (As a reference, DeepSeek v4 is severely throttled on these proxy services.)

indigodaddy

Is it multimodal/vision?

xiaoluolyg

congrats to qwen teams, remarkable

aliljet

Where can a user reasonably host this in an affordable way to access the local LLM revolution?

cft

Downloading this and cancelling Google Antigravity Pro at the same time: I had a Google Pro account that I inherited from buying a Pixel 9 XL - it's free for a year after a flagship Pixel phone purchase. After a year they started charging for it, and i tolerated it, because Flash was usable in Antigravity for dumb auxiliary tasks that I did not want to waste GPT/Opus on. It had a separate generous quota from Gemini 3.1 Pro. Now with Flash 3.5 they combined the quotas with Pro, such that on a Google pro account you can work 4-5 hours per week in Flash. And by the way, 3.1 Pro is useless for programming, compared to Codex/Opus

Semantic search powered by Rivestack pgvector
8,303 stories · 78,303 chunks indexed