How names and prices stay exact

Key takeaways

Three kinds of content are protected: glossary terms, numbers and prices, and IDs and codes.
Each protected bit is swapped for a placeholder before translation, then restored to the exact original after.
The model translates the words around the placeholders but never sees the real names or figures.
Every swap is logged, so you can prove a figure was never altered in translation.
This same lock runs on the way in and the way out — both directions, identical rules.

Three kinds of protected content

Fig 5. Three kinds of protected content, locked before translation and restored after. Glossary terms, numbers and prices, and IDs and codes are each swapped for placeholders; the model translates the words around them; the restore step puts the exact originals back and logs every swap.

Kind 1: glossary terms (the things only you know)

Some words must never be translated because they’re names, not words. Your company is called “Northwind,” not “north wind.” Your product is “FreshKeep,” not “keeps fresh.” A model with no context will sometimes translate these into their literal meaning, which reads as wrong to anyone who knows your brand. The glossary — the Google Sheet from Part 1 — lists every one of these, and the matcher swaps each for a placeholder before translation.

The matching is careful on purpose. It’s case-aware and matches whole words only, so a glossary term won’t accidentally get matched inside a longer, unrelated word. Each entry can also carry a note — “FreshKeep is a product line, never translate” — that’s shown to the agent on hover, so the human review has context too. Because the glossary lives in Drive, adding a new product name the day you launch it is a thirty-second edit, no deploy required; the sync job picks it up within fifteen minutes.

Kind 2: numbers and prices (the things that cost money to get wrong)

Every number in a message is found by a strict pattern and locked before translation: currency amounts (“$42.00”, “€39,90”), plain numbers, percentages, dates, and measurements. The relay doesn’t trust the model to carry a figure faithfully, because it occasionally won’t — it’ll round, it’ll re-format, it’ll convert “1,000” (one thousand) into “1.000” in a language where the comma and period swap roles, and the meaning quietly changes. By locking the exact characters and restoring them verbatim, the figure that leaves is character-for-character the figure that arrived.

There’s one deliberate subtlety: the relay does not auto-convert currencies or units. If a customer writes “€40” the agent sees “€40,” not a guessed dollar amount — converting money is a business decision with a live exchange rate, not a translation, and silently doing it would be exactly the kind of “helpful” error this guardrail exists to prevent. If a conversion is needed, the agent does it on purpose, and the relay protects whatever figure they write.

Kind 3: IDs and codes (the things that must land exactly)

Order numbers, account IDs, tracking codes, SKUs, reference numbers — anything that’s an identifier rather than a word — must land character for character or it’s useless. A tracking code with one wrong character points at nothing. An order number with a flipped digit pulls up someone else’s order. These are matched by shape (mixes of letters and digits, hash-prefixed numbers, known ID formats) and locked the same way. They’re the lowest-risk to detect and the highest-cost to get wrong, which makes them the easiest call in the whole system: never let the model touch them.

Restore, then prove it

After the model returns its translation — with the placeholders still sitting in place, untouched — the restore step swaps each placeholder back to its exact original value. Then it does the part that makes the whole thing trustworthy: it writes every swap to the audit log. For each protected bit, the log records the original value, the placeholder it was given, and the value it was restored to. Because the restored value is always the original value, you can prove — not hope, prove — that no figure, name, or code was altered in translation. If a customer ever disputes “you quoted me $42, the email said $52,” the log settles it in seconds.

This same lock-and-restore runs identically on the way in (translating the customer’s message for staff) and on the way out (translating the staff reply for the customer). One guardrail, both directions, no exceptions. It’s the quiet reason the relay is safe to put in front of real money: the model makes the language fluent, and a deterministic step — not the model — keeps the facts exact.

Next post: the cost breakdown. The whole relay runs in coffee-money territory at SMB volume; Part 6 explains exactly where the dollars go and why the cheap model carries almost all of it.

All posts