
I don’t think there is any question about it. Your AI cost crisis can only be attributable to human error.
Pay for intelligence when you need intelligence. If you’re doing the same task twice, have the model write a program to do the task for both of you.
I put $30 on the Anthropic API three months ago. I burned $2.70 in the first few days figuring out my approach. Twenty cents since. Less than $3 total, and most of that was tuition.
That twenty cents built a text analysis engine. Sentence tokenizer, statistical metrics, entropy calculations, composite scoring for a website I built. The model wrote the code, I reviewed it, I deployed it as PHP on a Linux box I already own. The electricity cost is a rounding error on my power bill.
I keep reading about companies spending six figures a month on AI tokens and I’m doing the blinking eye meme. Putting every customer ticket, every document summary, every code review through a model, thousands of times an hour, 24/7, paying per token every time, seems crazy when the model could build you a deterministic tool instead.
Deterministic work belongs in traditional code.
But the integration pattern the industry has settled on is “put the model in the hot path.” Keep the AI in the loop on every request, forever. Perpetual token cost for work that stopped being an AI problem the moment someone understood the requirements. Your output changes when a new model version ships. Your pipeline breaks not because your code changed but because theirs did.
The model can write its own replacement for most of these use cases. Ask it to classify tickets? It can write you a classifier. Ask it to score text against a rubric? It can write the scoring engine. One build session. One deployment. Done. Stable output you control.
Companies pay the model to do the same work over and over because that’s how the tooling is sold. The SDKs make it easy to call the API. The tutorials show you how to put the model in your pipeline. The pricing page shows per-token costs that look cheap until you multiply by volume. Nobody in that funnel is suggesting you use the model to build something and then turn it off.
This is the SaaS treadmill applied to AI. Recurring revenue for the provider. Recurring cost for the customer. For work that could be a one-time build.
AI APIs are the right call when you need reasoning on novel inputs. Creative work, ambiguous classification, anything where the rules can’t be fully specified in advance. Pay for intelligence when you need intelligence. But if you’re sending the same shaped request a thousand times a day and getting predictable outputs, you don’t have an AI problem. You have an engineering problem. And the AI is the best engineer available to solve it for you once, without ever touching API billing.
Twenty cents. The result runs on its own.