Why We Stopped Charging Per Conversation

For a long time, the standard model for WhatsApp platforms was charging per conversation. At first glance, it makes sense: if there’s a conversation, you charge for it. Easy to explain. Seemingly simple to measure.

But when you look closer, that model has a structural issue. It directly affects how you design your product, and how your costs scale.

The Problem with Charging Per Conversation

Charging per conversation means a single message triggers the full cost. A simple “hi” instantly becomes a billable conversation, even if there’s no buying intent, no real system usage, and no value created for anyone.

Think about everything that falls under that same fixed cost: greetings, clarifications, short messages, internal testing, product iterations. A simple three-message flow costs the same as a forty-message flow with complex logic, tool calls, and AI processing.

That’s not a pricing model that encourages you to promote your product. If two thousand people message you just to say hello, you’ve already spent $1.000 (at $0,50 per conversation). No sale happened, but the cost is there.

For companies selling directly through WhatsApp or those with the budget to absorb it, this model can work. But for conversational products that don’t monetize every interaction, the cost becomes disconnected from the value created.

When Pricing Doesn’t Reflect Real Usage

When pricing doesn’t reflect what’s actually happening, product decisions start to shift. You focus on reducing conversations instead of improving the experience. You prioritize revenue-generating flows while neglecting support or service use cases. You avoid building more helpful experiences because every new interaction carries a fixed cost.

Pricing stops being a reflection of value and becomes a constraint on what you’re willing to build.

The Alternative: Charging for What the System Actually Does

A model based on executions and tokens flips that logic. Instead of charging because someone sent a message, you charge for what the system actually processed.

If your flow is simple and efficient, the cost stays low. If it’s more complex and uses more compute or integrations, the cost scales proportionally. No surprises. No hidden cross-subsidies.

	Per Conversation	Per Execution & Tokens
A “hi” with no meaningful outcome	Full conversation cost	Minimal cost (few tokens)
Internal test message	Full conversation cost	Minimal cost
Simple 3-message flow	Full conversation cost	Low, usage-based cost
Complex flow with AI, APIs, logic	Full conversation cost	Higher, processing-based cost
40-message flow with tools and logic	Full conversation cost	Cost reflects actual work done

The difference is simple: in a per-conversation model, everything costs the same regardless of value. In an execution-based model, you pay for what you use.

Why This Matters for Your Product

When pricing aligns with real usage, you can make product decisions without fear. You can iterate faster, test new flows, and improve user experience without every experiment feeling like a financial risk.

An execution-based model is more transparent, more predictable, and more aligned with how conversational products actually operate. It doesn’t penalize user activity. It charges for the real work your system performs.

And that fundamentally changes how you build.

The Problem with Charging Per Conversation

When Pricing Doesn’t Reflect Real Usage

The Alternative: Charging for What the System Actually Does

Why This Matters for Your Product

The conversational AI your business needs to grow