Back to InsightsDigital Transformation

"We Created a Monster": Why Big Tech Is Slamming the Brakes on AI Spending

By James HuangJune 22, 2026·Updated Jun 23, 20265 min read
AI Generated Cover for: "We Created a Monster": Why Big Tech Is Slamming the Brakes on AI Spending

Over the past year, the business narrative was simple: get AI into the hands of your employees as fast as humanly possible. The race was on. The tools were magical. The future was now.

But recently, a massive shift hit the boardroom. The honeymoon is officially over. And the bill has arrived.

According to recent reports, early adopters like Amazon, Walmart, Cisco, Uber, and Meta are actively restricting internal AI usage. Not because the technology failed. Because they looked at their server bills and realized they had accidentally created a financial monster.

As we help businesses navigate digital transformation, it is crucial to understand why this is happening and how to avoid the token trap.


The Rise of the Agents (and the Death of Flat-Rate Billing)

For a while, we were all trained to think of AI as cheap. Or even free. But compute is never free.

The initial cost was subsidized by flat-rate subscription models. You paid twenty dollars a month and got unlimited access to a frontier model. It felt like a bargain. But as AI labs like OpenAI and Anthropic shift to usage-based per-token billing, the true cost of artificial intelligence is being laid bare.

A token, for the uninitiated, is the basic unit of data processed by an AI model. Every word you send, every word you receive, every reasoning step in between—tokens. And they add up fast.

This billing shift coincided with a technological evolution: the leap from chatbots to AI Agents.

A chatbot waits for your prompt, answers, and goes to sleep. An AI Agent is autonomous. It loops. It reasons. It executes complex workflows and triggers other agents. As Cisco's Chief Product Officer Jeetu Patel noted, deploying agents requires exponentially more infrastructure. Every single human employee might suddenly have ten, a hundred, or even a thousand AI agents working tirelessly in the background.

The compute drain is staggering.


The Enterprise Reality Check: Burning the 2026 Budget by April

When tech becomes a toy rather than a tool, budgets evaporate.

Take Workato, a software company that saw AI usage spread "like wildfire" among its 1,300 employees. When Anthropic switched to per-token billing in May, Workato's costs spiked 7x on the very first day. Their CIO's reaction? "Holy crap, we built a monster."

Uber faced a similar crisis. Their COO admitted it was becoming impossible to justify the massive token spend against actual consumer feature output. The situation got so out of hand that Uber had burned through its entire allocated 2026 AI budget by April of this year. They have now capped individual employee token spending at $1,500 a month.

At Amazon, engineers were building agents just to climb internal productivity leaderboards. Management had to step in and explicitly warn teams to stop using "AI for the sake of AI."

The pattern is clear. Unchecked AI adoption, combined with per-token billing and autonomous agents, creates a cost explosion that outpaces any measurable return.


The Life Hack: AI Financial Responsibility and Model Routing

So does this mean the AI revolution is stalling? Absolutely not. It means the industry is maturing. We are entering the era of AI Financial Responsibility.

If you are a business leader integrating AI, here is your playbook to avoid bankrupting your IT department.

1. Stop Using a Ferrari to Go to the Grocery Store

You do not need the absolute most expensive frontier model—GPT-4o, Claude 3.5 Sonnet, or whatever the bleeding edge is this week—for every single task.

The hack: implement Model Routing. Assess the fit and purpose of a task. If an employee is summarizing a basic email, route that query to an older, cheaper model. Save the premium tokens for complex coding, deep strategic reasoning, or high-stakes creative work. Match the horsepower to the highway.

2. Leverage Local and Open-Source Models

To cut the cord on massive cloud AI bills, companies are increasingly asking employees to use open-source models that run locally on company servers or directly on employee devices. If you control the infrastructure, you stop paying the token toll to third-party labs. The upfront setup cost is real, but the long-term savings are substantial.

3. Watch the Global Market

Data shows that Chinese AI labs are currently offering tokens at significantly lower prices than their US counterparts, driven by cheaper energy and highly efficient models. This cost advantage is giving them a massive surge in token consumption volume. Keep an eye on global pricing dynamics as you build your tech stack. The cheapest token is not always the best token, but it is worth knowing where the market is heading.


Accelerate Digitality, Sustainably

At the end of the day, true digital transformation is not about blind adoption. It is about aligning cutting-edge technology with actual business efficiency.

Do not let the fear of missing out push you into handing your team a blank check for compute power. Define the ROI. Match the model to the task. Maintain strict governance over your tech stack.

The AI revolution is not slowing down. It is just growing up. And growing up means learning to live within a budget.

Stay ahead of the curve—and under budget.

— James

Originally published on MTS Blog & Research