Pro Max 5x quota exhausted in 1.5 hours despite moderate usage

cmaster11 580 points 526 comments April 12, 2026

Discussion Highlights (20 comments)

cmaster11

For whoever else is having the same problems, worth voting these kind of issues. There needs to be more transparency over what goes on with our subscriptions.

wg0

Been experiencing similar issues even with the lower tier models. Fair transactions involve fair and transparent measurements of goods exchanged. I'm going to cancel my subscription this month.

comandillos

Quite scared by the fact that the original issue pointing out the actual root cause of the issue has been 'Closed as not planned' by Anthropic. https://github.com/anthropics/claude-code/issues/46829

spiderfarmer

That’s why I switched to Codex. It’s so much more generous and in my experience, just as good. Also, optimizing your setup for working with agents can easily make a 5x difference.

jedisct1

GPT-5.4 works amazingly well. I’ve moved away from Claude and toward open-source models plus a ChatGPT subscription. That setup has worked really well for me: the subscription is generous, the API is flexible, and it fits nicely into my workflow. GPT-5.4 + Swival ( https://swival.dev ) are now my daily drivers.

tedivm

Something similar is happening with GitHub Copilot too. It's impossible to know what a "request" is and some change in the last couple of months has seen my request usage go up for the same style of work. Toss in the bizarre and impossible to understand rate limiting that occurs with regular usage and it's pretty obvious that these companies are struggle to scale.

MeetingsBrowser

I pay for the lowest plan. I used to struggle to hit my quota. Now a single question consistently uses around 15% of my quota

rdevilla

Bubble's bursting, get in.

postalcoder

I had used Claude Code max as my daily driver last year and this sort of drama was par for the course. It's why I migrated entirely to Codex, despite liking Claude, the harness, more. There's this honeymoon period with Claude you experience for a month or two followed by a trough of disillusionment, and then a rebound after a model update (rinse and repeat). It doesn't help that Anthropic is experiencing a vicious compute famine atm.

mannanj

so basically the anthropic employee who responded says those 1h caches were writes were almost never accessed, so a silent 5m cache change is for our best interest and saves cost. (justifying why they did this silently) however his response gaslights us because in the OPs opening post his math demonstrates this is not true, it shows reads 26x more so at least in his case the cache is not doing what the anthropic employee describes. clearly we are being charged for less optimization here and being given the message (from my perspective by anthropic) that if you are in a special situation your needs don't matter and we will close your thread without really listening.

pxc

It's a bit shocking to me how opaque the pricing for the subscription services by the frontier labs is. It's basically impossible for people to tell what they're actually buying, and difficult to even meaningfully report or compare experiences. How is this normal?

lvl155

Constant complaints about Anthropic. Not much on OAI/Codex. It seems people should just use OAI and come back when they realize compute isn’t free elsewhere.

zkmon

Unless the agent code is open-sourced, there is hardly any transparency in how the agent is spending your tokens and how does it calculate the tokens. It's like asking your lawyer why they charged some amount.

vfalbor

Some months ago, I created a software for this reason, it has no success, but the thing is that communities could reduce tokens consumption, not all is LLM, you can share things from API calls between agents. Even my idea was no success I think it is a good concept share things each others, if you have some interest it's called tokenstree.com

holoduke

I spend full 20x the week quota in less than 10 hours. How is that possible? Well try to mass translate texts in 30 languages and you will hit limits extremely quick.

lforster

Lol imagine how much overcharging is going on for enterprise tokens. This is just the beginning.

Nic0

I'm i alone to think that it become slower that usual to get responses?

wolvoleo

Yeah perplexity used to be great but they've also clamped down on the 20€ plan. Only one deep research query was enough to block me until the end of the month. The thing is, if it's going to be this expensive it's not going to be worth it for me. Then I'll rather do it myself. I'm never going to pay for a €100 subscription, that's insane. It's more than my monthly energy bill. Maybe from a business standpoint it still makes sense because you can use it to make money, but as a consumer no way.

tiahura

Also pro max 5x and hit quota for first time yesterday.

stavros

It's crazy, a few weeks ago the limits would comfortably last me all week. This week, I've used up half the limit in a day.

Pro Max 5x quota exhausted in 1.5 hours despite moderate usage

Discussion Highlights (20 comments)

Related Discussions