GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
gmays
130 points
27 comments
May 05, 2026
Related Discussions
Found 5 related stories in 93.5ms across 8,303 title embeddings via pgvector HNSW
- GLM-5.1: Towards Long-Horizon Tasks zixuanlimit · 481 pts · April 07, 2026 · 59% similar
- Agora-1: The Multi-Agent World Model olivercameron · 97 pts · May 18, 2026 · 56% similar
- Gemini Embedding 2: natively multimodal embedding model panarky · 22 pts · March 10, 2026 · 53% similar
- TLA+ Mental Models r4um · 15 pts · March 23, 2026 · 53% similar
- GLM-5.1 Is Available iamsyr · 24 pts · March 27, 2026 · 53% similar
Discussion Highlights (6 comments)
gertlabs
GLM-5V-Turbo is a model I wanted to like due to its speed and API reliability, but it didn't perform well in our coding and reasoning testing. More recent open source models have made it obsolete. GLM 5.1 is so many light years ahead of it on everything except speed, that I'm not sure why it's still being served. Comprehensive evaluation results at https://gertlabs.com/rankings
muddi900
z.ai will use quantized models in off hours. Buyer beware
julius
Click coordinates. Agentic GUI is really annoying when the multi-modal agent cannot click on x,y coordinates. I tested Qwen3.6, Gemma4, Nemotron3-nano-omni. They fully hallucinate x,y coords. (did not try GLM-5V yet) GPT-5.5 can easily do it. But also Vocaela, a tiny 500M model, is quite good at it. Hope they improve the training for x,y clicking soon on the smallish multi-modals. Recently slopped a http service together just so my local models can click, instead of relying on all the wild ways agents currently hack into the browser (browser-use, browser-harness, agent-browser, dev-browser etc) https://github.com/julius/vocaela-click-coords-http
_pdp_
We just migrated an AI agent from Kimi to GLM and frankly I am surprised by the results. It feels premium. However, both Kimi and GLM can end up in doom loops so be careful how you use them. Without a proper harness the agent can easily get into some tricky situations with no escape. We had to develop new heuristics in our cloud harness just because of this but I am really grateful that we did as the platform feels now more robust. A small price to pay for model plug & play!
desireco42
I've been using GLM pretty much exclusively last 6-8 months. I have access to Anthropic and OpenAI models and others. I always keep returning to GLM, it isn't the best, sometimes I would go to Codex to help it, but overall, especially with Turbo, it is everyday good model. Turbo makes a huge difference in everyday use because it saves you time and you are not in the mood always to wait endlessly.
zozbot234
Looks like this was not an open release, the latest GLM-xV release was 4.6V and Turbo models were never open.