The Agent Layer

The Agent Layer

Your AI Subscription Is About to Feel Cheap and Expensive at the Same Time

Basic AI chat is becoming cheaper and more accessible. But the moment AI starts acting like an agent, the bill starts looking less like Netflix and more like cloud infrastructure.

The Agent Layer's avatar
The Agent Layer
May 21, 2026
∙ Paid

Here is the warning I think more people need to hear now: the age of the simple AI subscription is ending. For a while, many of us got used to the idea that paying $20 a month for an AI tool meant we had bought the future. It felt almost magical. One subscription, one login, and suddenly we could write, summarize, brainstorm, code, research, and analyze faster than ever.

But that simple story is starting to break. The next wave of AI will be more powerful, more useful, and more agentic than what came before. It will also be priced very differently. And if people do not understand that shift early, they are going to be surprised by how fast the bill can move.

What is happening now is not that AI companies are all “raising prices” in one obvious way. It is more subtle than that. Basic access is still relatively cheap. In some cases it is even getting better at the same price. But the moment you want AI to do more than chat, the logic changes. The moment you want it to run as an agent, search across tools, handle code, browse the web, work through a queue, or stay active for long stretches, the pricing starts to look less like Netflix and more like cloud infrastructure. That is the part many users have not fully noticed yet.

This is where MCP matters. Model Context Protocol is one of the reasons agentic AI is moving so quickly. The official docs describe it as an open standard for connecting AI applications to external systems, and honestly that is the easiest way to understand it. Read my intro to MCP here.

I think of MCP as a universal plug for AI. Instead of building one custom bridge for Google Drive, another for Slack, another for GitHub, and another for a database, MCP gives AI clients a standard way to discover tools, read context, and take action. The official docs even compare it to a USB-C port for AI, which is a very good mental model.

And once you see MCP that way, the real use cases become easy to picture. A developer can let an AI system inspect a codebase, run tests, and propose fixes. A sales team can let an AI assistant pull account notes, read past emails, and draft the next follow-up. A research team can let the model read documents, fetch fresh information, and assemble a report. A company can connect calendars, chats, files, and internal knowledge so the model stops answering in generalities and starts working from real context. The MCP documentation itself points to examples like file systems, databases, GitHub, Slack, and calendars. That is exactly why people are excited. The model stops being a smart textbox and starts becoming a useful operator.

But that is also where the pricing changes. A normal chat response is one thing. An agent that opens tools, reads files, triggers searches, reasons across long context, and runs for multiple steps is something else entirely. That agent may burn tokens, call tools, trigger web search fees, consume credit allowances, or move you into a higher usage tier. MCP does not magically make AI cheaper. It makes AI more connected. And connected AI usually means metered AI. That is the part I want readers to understand before they build big expectations on top of a small subscription.

If you look at ChatGPT, you can already see that split happening in public. The price ladder has widened. There is now Go at $8, Plus at $20, and Pro tiers at $100 and $200. Business has also moved beyond a single fixed seat model and now includes usage-based Codex seats, while extra credits can be purchased when users hit limits. That tells me the company is preparing users for a future where basic access stays simple, but serious execution gets priced separately. ChatGPT is no longer just a chatbot subscription. It is becoming a platform with layers.

Claude is showing the same shift in a different way. Pro still begins around the familiar $20 mark, but Max jumps to $100 and $200, and Anthropic has been unusually open about the reality that usage changes with capacity. It has shown five-hour session limits, weekly limits, and usage that can vary with demand. In early May 2026, it even removed peak-hour reductions for Claude Code on some plans and increased rate limits after securing more compute. That is a polite way of saying something important: when the work becomes agentic and compute-heavy, access itself becomes part of the product.

Gemini is taking another route. Rather than only selling a higher model tier, Google is bundling AI into a bigger ecosystem. Gemini moved from the old AI Premium model to AI Pro and AI Ultra, and Google has added AI credits, AI Studio perks, and even monthly cloud credits inside some plans. So with Gemini, the future feels a little less like “buy more chatbot” and a little more like “buy a broader AI membership.” That may end up being attractive for people who want a lot of experimentation inside one ecosystem, even if production API usage still becomes a separate line item later. LATEST from Google (May 19th):

We're moving from daily prompt limits to a 'compute-used' model. These new, compute-based usage limits in the Gemini app factor in the complexity of your prompt, the features you use and the length of your chat… Your limit will now refresh every five hours until you reach your weekly limit.

Perplexity is also worth watching because it makes the distinction very clear. Pro is still inexpensive by frontier-AI standards. But once you move into Computer, credits enter the picture. And once you move into the API, that billing is separate again. In other words, search is one product, autonomous action is another, and developer access is another. I think that structure is going to become more common across the industry, not less. It is simply a more honest way to price expensive AI behavior.

So my expectation from here is simple.

Basic AI chat will keep getting cheaper, broader, and more bundled. Agentic AI will keep getting more useful, but also more explicitly metered.

Subscriptions will remain important because they lower the barrier to entry.

But credits, usage tiers, and API billing will decide how far people can really go. That does not mean AI is becoming a bad deal. It means we have to stop thinking about it like streaming and start thinking about it like infrastructure. The companies that win from here will not just build the smartest models. They will build the clearest bridge between “what your subscription includes” and “what real work actually costs.”

If I were writing the simplest possible takeaway for readers, it would be this: Your monthly plan buys access, but your agent budget buys outcomes. And the faster MCP, tools, and autonomous workflows spread, the more important that difference becomes.

Don't build your AI strategy in the dark.

AI pricing is no longer just about "per-token" costs. Unlock this section to access our full pricing breakdown across Consumer, Business, Agentic, and Developer API models. See exactly what the top providers are charging, how they structure their hidden fees, and how to optimize your deployment for 2026.

User's avatar

Continue reading this post for free, courtesy of The Agent Layer.

Or purchase a paid subscription.
© 2026 Thomas Uthaug · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture