Pro Max 5x quota exhausted in 1.5 hours despite moderate usage
cmaster11
580 points
526 comments
April 12, 2026
Related Discussions
Found 5 related stories in 51.3ms across 4,351 title embeddings via pgvector HNSW
- Claude Code users hitting usage limits 'way faster than expected' samizdis · 293 pts · March 31, 2026 · 64% similar
- Claude Code users hitting usage limits 'way faster than expected' steveharing1 · 18 pts · April 02, 2026 · 63% similar
- Claude usage limits hitting faster than expected Austin_Conlon · 11 pts · March 31, 2026 · 60% similar
- Claude Code adjusting down 5hr limits laacz · 27 pts · March 26, 2026 · 58% similar
- Anthropic discourages Claud demand during peak productivity hours dude250711 · 15 pts · March 26, 2026 · 58% similar
Discussion Highlights (20 comments)
cmaster11
For whoever else is having the same problems, worth voting these kind of issues. There needs to be more transparency over what goes on with our subscriptions.
wg0
Been experiencing similar issues even with the lower tier models. Fair transactions involve fair and transparent measurements of goods exchanged. I'm going to cancel my subscription this month.
comandillos
Quite scared by the fact that the original issue pointing out the actual root cause of the issue has been 'Closed as not planned' by Anthropic. https://github.com/anthropics/claude-code/issues/46829
spiderfarmer
That’s why I switched to Codex. It’s so much more generous and in my experience, just as good. Also, optimizing your setup for working with agents can easily make a 5x difference.
jedisct1
GPT-5.4 works amazingly well. I’ve moved away from Claude and toward open-source models plus a ChatGPT subscription. That setup has worked really well for me: the subscription is generous, the API is flexible, and it fits nicely into my workflow. GPT-5.4 + Swival ( https://swival.dev ) are now my daily drivers.
tedivm
Something similar is happening with GitHub Copilot too. It's impossible to know what a "request" is and some change in the last couple of months has seen my request usage go up for the same style of work. Toss in the bizarre and impossible to understand rate limiting that occurs with regular usage and it's pretty obvious that these companies are struggle to scale.
MeetingsBrowser
I pay for the lowest plan. I used to struggle to hit my quota. Now a single question consistently uses around 15% of my quota
rdevilla
Bubble's bursting, get in.
postalcoder
I had used Claude Code max as my daily driver last year and this sort of drama was par for the course. It's why I migrated entirely to Codex, despite liking Claude, the harness, more. There's this honeymoon period with Claude you experience for a month or two followed by a trough of disillusionment, and then a rebound after a model update (rinse and repeat). It doesn't help that Anthropic is experiencing a vicious compute famine atm.
mannanj
so basically the anthropic employee who responded says those 1h caches were writes were almost never accessed, so a silent 5m cache change is for our best interest and saves cost. (justifying why they did this silently) however his response gaslights us because in the OPs opening post his math demonstrates this is not true, it shows reads 26x more so at least in his case the cache is not doing what the anthropic employee describes. clearly we are being charged for less optimization here and being given the message (from my perspective by anthropic) that if you are in a special situation your needs don't matter and we will close your thread without really listening.
pxc
It's a bit shocking to me how opaque the pricing for the subscription services by the frontier labs is. It's basically impossible for people to tell what they're actually buying, and difficult to even meaningfully report or compare experiences. How is this normal?
lvl155
Constant complaints about Anthropic. Not much on OAI/Codex. It seems people should just use OAI and come back when they realize compute isn’t free elsewhere.
zkmon
Unless the agent code is open-sourced, there is hardly any transparency in how the agent is spending your tokens and how does it calculate the tokens. It's like asking your lawyer why they charged some amount.
vfalbor
Some months ago, I created a software for this reason, it has no success, but the thing is that communities could reduce tokens consumption, not all is LLM, you can share things from API calls between agents. Even my idea was no success I think it is a good concept share things each others, if you have some interest it's called tokenstree.com
holoduke
I spend full 20x the week quota in less than 10 hours. How is that possible? Well try to mass translate texts in 30 languages and you will hit limits extremely quick.
lforster
Lol imagine how much overcharging is going on for enterprise tokens. This is just the beginning.
Nic0
I'm i alone to think that it become slower that usual to get responses?
wolvoleo
Yeah perplexity used to be great but they've also clamped down on the 20€ plan. Only one deep research query was enough to block me until the end of the month. The thing is, if it's going to be this expensive it's not going to be worth it for me. Then I'll rather do it myself. I'm never going to pay for a €100 subscription, that's insane. It's more than my monthly energy bill. Maybe from a business standpoint it still makes sense because you can use it to make money, but as a consumer no way.
tiahura
Also pro max 5x and hit quota for first time yesterday.
stavros
It's crazy, a few weeks ago the limits would comfortably last me all week. This week, I've used up half the limit in a day.