Part 6 of 7 · Lead intake bot series ~3 min read

What the lead intake bot costs

A coffee a month at SMB lead volume. The fixed cost is essentially zero. If your campaigns are quiet for a week, the bill matches. The variable cost is dominated by Bedrock tokens for the extractors and the composer. Everything else rounds to pennies.

Key takeaways · verified May 2026

  • Fixed cost is essentially zero. Quiet weeks bill nothing.
  • Variable cost is pennies per lead, dominated by Bedrock tokens for the extractors and the composer.
  • ~100 leads/month → under three dollars total.
  • ~1000 leads/month → under ten dollars total.
  • A $10 AWS Budgets alarm catches anything strange before it surprises you.
Cost structure: three tiers A horizontal three-tier cost diagram. Tier one, "Always-free or near it": Lambda invocations for the intake, extractors, composer, and dispatch; DynamoDB pay-per-request for dedupe, audit, and the on-call roster; S3 for archived lead packages and the Drive mirror at small volume; CloudWatch log retention at seven days; EventBridge schedules for the Google Ads polling and the daily digest; SNS topic for hot pings and escalations. Tier two, "Per-lead pennies": Bedrock Haiku tokens for the three extractors plus the composer at fractions of a cent per lead; Bedrock Titan Text Embeddings for the policies and voice file at a tiny one-off cost when files change; SES inbound at $0.10 per 1000 received messages plus a tiny outbound charge for the team draft emails; Meta Graph API calls free within their respective rate limits. Tier three, "Optional": Slack via Amazon Q Developer in chat applications free at this volume, otherwise a per-message charge on a paid Slack tier; SMS notifications via SNS for hot pings outside business hours at cents per message, off by default; AWS Secrets Manager for Meta page tokens and Google Ads API keys at forty cents per secret per month; AWS Budgets alarms at no extra cost. A bottom note reads: a typical small business at 100 leads a month lands well under three dollars total; a busy one at 1000 leads a month lands under ten. tier 1 Always-free or near it Lambda invocations ~ $0.00 DynamoDB (dedupe, audit) ~ $0.05 S3 (archive + Drive mirror) ~ $0.05 CloudWatch (7-day) ~ $0.10 EventBridge schedules ~ $0.00 SNS topic tier 2 Per-lead pennies Bedrock Haiku (extractors) fractions of a cent / lead Bedrock Haiku (composer) fractions of a cent / draft Titan Embeddings (files) tiny, on file change SES inbound + outbound $0.10 / 1000 messages Platform API calls tier 3 Optional Slack via Amazon Q free at this volume SMS for hot pings cents/msg, off by default Secrets Manager $0.40 / secret / month AWS Budgets alarm free ~ 100 leads/month → under three dollars total. ~ 1000 leads/month → under ten.
Fig 6. Three tiers of cost. The bill scales with how many leads land; the floor is nearly zero.

The fixed cost is essentially zero

There is no per-lead licence, no minimum monthly fee, no “starter plan.” If your campaigns are quiet for a week, the AWS bill matches. Lambda, DynamoDB pay-per-request, S3, CloudWatch, EventBridge, and SNS all sit in or near the always-free tier at the volumes a small business sees. Even running the Drive-sync schedule every five minutes and the Google Ads poll every hour, all month long, costs a vanishingly small amount. EventBridge scheduled invocations are essentially free at this volume, and each fire is a single Lambda call that does a small amount of work.

The biggest single line item in tier one is usually CloudWatch log storage. You control that by retaining seven days instead of forever. After day eight the logs are gone and the bill stops compounding.

The variable cost is per-lead pennies

Three things scale with lead volume:

  • Bedrock Haiku tokens for the extractors. Each lead pays for three small model calls (intent, urgency, fit) running against the message text. At small-model rates and the short input lengths most lead messages have, the all-in cost per lead for extraction lands at fractions of a cent.
  • Bedrock Haiku tokens for the composer. Hot-route drafts and warm follow-up drafts each pay for one composer call. The input is the structured lead plus the relevant pricing, voice, and ICP excerpts retrieved from the Knowledge Base. The output is a short reply. Even on the most expensive moves this is well under a cent per lead. The composer doesn’t fire on nurture or reject, so cold leads cost only the extractor tier.
  • Titan Embeddings for the policy docs. One-off per file change. If you edit your pricing doc twice a month and your voice doc once, that’s three embeddings runs totalling tiny fractions of a cent. The embeddings power the citation gate — finding the right policy passage to ground a reply.

Add it up at typical small-business lead volumes (fifty to a few hundred leads a month across all sources). Tier two lands in the under-three-dollars range. Tier one floors are under fifty cents. The all-in monthly bill is a coffee a month. At higher volumes — a busy B2B campaign at a thousand leads a month — tier two becomes the headline number. Even there it stays in the “phone bill, not Netflix subscription” range.

Three traps you’re avoiding

  • Per-seat sales-engagement tools. Many off-the-shelf lead-routing tools charge $50–$300 per seat per month, regardless of lead volume. They assume your SDR team is permanent and full-time. You’re trading a flat per-seat bill for pay-per-use that mostly comes in pennies. The bot runs on the same shape regardless of how many reps are on call.
  • Always-on infrastructure for polling. A naive setup might run a small server that polls Google Ads or Meta every minute. That’s a $30+/month idle bill before it processes a single lead. EventBridge schedules trigger Lambda — you pay only when the schedule fires, and Lambda scales to zero between fires.
  • One large model on every read. A frontier-class model would spend ten to twenty times more per lead for what is, at this scale, a short-input short-output extraction task. Haiku-class models are correctly sized. Reaching for a bigger one because it sounds smarter is the most common way to triple the bill on a system where the per-call output is a small structured object.

When this stops being cheap

The math changes if the campaigns scale up dramatically (a paid acquisition push, a product launch that surfaces in many directories at once) and lead traffic goes into the tens of thousands per month. At that point the tier-two number becomes the headline, and there are levers: cache policy embeddings (already cached after the first lookup), batch the extractors into a single multi-output call, or move the urgency extractor to a smaller dedicated model. Most small businesses, including small teams running paid ads, never need any of those levers.

For everyone below that — and that’s most small businesses — a $10 monthly AWS Budgets alarm catches anything strange before it becomes a surprise.

In plain words

The fixed bill is nearly zero. The variable bill is pennies per lead, dominated by tokens. A typical setup runs at coffee-money for the whole month. Set a budget alarm that fits your expected volume, and the bill can’t surprise you.

All posts