How a receipt gets categorized

Key takeaways

Categories come from your own chart of accounts in the rules doc — not a generic list.
A vendor-hint lookup handles the easy cases first, with no AI call at all.
Bedrock Haiku 4.5 picks the category for the rest, choosing only from your account list.
A sanity check makes sure the model returned a real category and a confidence score.
Low-confidence picks go to the bookkeeper’s review queue, not straight to the books.

Four gates on every category decision

Fig 4. Four gates between the read fields and the category. Try a vendor hint first. Ask the model only when needed. Sanity-check the answer. Then either file with confidence or send to the bookkeeper. The chosen category and its reason travel with the record.

Gate 1: the vendor hint (the free, instant case)

Most categorization is boring and repetitive. Anything from Shell or BP is fuel. Anything from your one software vendor is software. Anything from the office-supply shop down the road is office supplies. The rules doc holds a short list of these per-vendor hints — vendor name on the left, category on the right. Gate 1 checks the read vendor against that list first. If there’s a match, the category is set right there, with no AI call at all. Over a few weeks of use, this list grows to cover the vendors you see most, and the share of receipts that even reach the model shrinks.

This is the cheapest possible path: a lookup in a short list. The hints are plain text a bookkeeper can edit in Drive, so adding “the new café near the office goes to meals” is a thirty-second edit, not a code change.

Gate 2: ask the model (only when there’s no hint)

For a vendor with no hint — a one-off supplier, a shop nobody’s seen before — the organizer asks Bedrock Haiku 4.5. The prompt is short and tightly framed: here are the read fields, here is the full chart of accounts, pick exactly one category from this list, give a one-line reason, and rate your confidence. The crucial constraint is that the model chooses from your list. It can’t invent “travel-meals-misc” that your bookkeeper has never heard of; it has to return one of the categories that already exists in your books.

Haiku is the right model here: this is a small, well-defined classification job, not heavy reasoning, so the cheap, fast model is plenty. It runs only on the receipts that got past Gate 1, which over time is the minority. The reason it returns matters too — “restaurant name and a tip line suggest a meal” — because if a human ends up reviewing it, that one line saves them re-reading the whole receipt.

Gate 3: the sanity check

An AI answer needs checking before it’s trusted. Gate 3 confirms three things. The category the model returned actually exists in the chart of accounts (if it somehow returned something off-list, that’s treated as no answer). The confidence is a real number, not missing. And the numbers are plausible for the category — a receipt where the tax is larger than the total, or a “fuel” receipt for $4,000, is suspicious enough to flag regardless of how confident the model was. Anything that fails a sanity check is downgraded to low confidence, which means it heads to a human at the next gate.

Gate 4: the review gate (the human check)

The last gate is the one that decides whether a person ever sees the receipt. It compares the category confidence against the threshold in the rules doc. Above the line — a clear vendor hint, or a confident model pick that passed the sanity check — and the receipt is cleared to file with its category attached. Below the line — an unsure model pick, a sanity-check downgrade, a category the business has flagged as “always review” (some businesses want every “entertainment” receipt eyeballed) — and it’s sent to the bookkeeper’s review queue with the proposed category, the model’s reason, and an Approve button.

When the bookkeeper corrects a category — “no, that vendor is office supplies, not software” — the system offers to remember it as a new vendor hint. Accept, and the next receipt from that vendor skips the model entirely and files on its own. The system gets quieter the more it’s used.

Why the gates exist

None of these gates is clever on its own. They’re the steps a careful bookkeeper already takes — recognize the regulars on sight, look up the unfamiliar ones, double-check anything that smells wrong, and ask when genuinely unsure. Putting them in code as four small gates makes the AI a junior assistant that proposes, not a black box that decides. The chart of accounts is the only menu it gets to choose from, every choice comes with a reason, and the unsure ones always reach a human before they touch the books.

Next post: how a categorized receipt actually reaches the books — the clean record written to the expense sheet, the image filed by month, and what happens when the bookkeeper approves or corrects a reviewed one.

All posts