How a review reaches the responder

Key takeaways

Google Business Profile pushes via Pub/Sub the moment a review lands; the cloud receives the notification and fetches the full review by ID.
Meta deprecated the Facebook recommendations webhook in January 2025; in 2026 the Facebook lane runs through a third-party aggregator (Birdeye, Yext, ReviewTrackers).
Yelp has no push for SMBs, so the cloud polls the Fusion endpoint hourly — well inside the 500-call free daily tier.
Reply APIs differ: Google supports public posting; Facebook and Yelp are draft-only in 2026.
One Lambda per lane normalizes into a common shape, dedupes by review ID, and runs cheap in-Lambda screens for spam before any AI runs.

Three lanes at the door

Fig 2. Three lanes, one queue. Push when the platform offers it, poll when it doesn’t. Cheap filters first — nothing reaches the AI until the platform-specific noise is stripped.

Why three different mechanisms

The three platforms make very different choices about how to tell a business their listing got a review, and the intake mirrors those choices honestly instead of pretending they’re uniform.

Google pushes natively, and that’s the easy lane. When a review lands on a Google Business Profile, Google fires a notification through Pub/Sub at a URL the cloud is listening on, and the responder can have the review in hand within seconds. Google is also the only one of the three that offers a public reply API to a regular small-business owner, so this lane is fully two-way: read in, reply out, no human in the middle for the safe cases.

Facebook used to push too, but stopped. Meta deprecated the Page recommendations webhook in early 2025 (Graph API v22.0); the old ratings field no longer fires, and there’s no documented reply API for a recommendation. The realistic Facebook lane in 2026 goes through a third-party aggregator — Birdeye, Yext, ReviewTrackers, or whichever one the business already uses to monitor listings — that watches the page for you and forwards normalized events to a webhook URL you control. The shape of the lane is the same; the source of the push moved from Meta directly to whichever aggregator you connect. Replies on Facebook are draft-only: the responder writes the reply, the human pastes it into the Pages app.

Yelp doesn’t push, and its reply API is enterprise-only. The honest workaround is a polling job that wakes up on a schedule, asks Yelp’s public Fusion endpoint “anything new?”, and queues whatever it didn’t see last time. Yelp’s free tier is 500 calls per day, so hourly polling per listing is well inside the budget. The same polling pattern works for niche platforms (Tripadvisor, Healthgrades, OpenTable, BBB, the platform every industry has its own one of). The rare review that lands at 2:47pm gets processed at 3pm, which is still inside the same business hour for any practical reply. Yelp’s reply path is also draft-only for SMBs — the cloud writes the reply, the human pastes it into biz.yelp.com.

Push is faster, poll is more universal. Auto-reply works on Google today and is draft-only on Facebook and Yelp. Mixing all three in the same intake means the responder doesn’t care which platform a given review came from once it’s in the queue — the downstream code never branches on source, only on content. The auto-vs-draft difference is a single per-source flag the dispatch column reads at the very end.

What “normalize” actually means

Each platform hands the cloud a slightly different shape: Google gives a star rating from one to five and a free-text comment; Facebook offers a binary “recommends/doesn’t recommend” and a comment; Yelp gives a star rating, a comment, and sometimes a photo. Normalization folds those into one common review object — a source name, a stable review ID, a numeric score on the same one-to-five scale (binary recommendations map to 5 or 2 with a flag), the comment text, the author display name, the timestamp, and a small bag of platform-specific extras for the engineering reference.

The reason for putting normalization here, before anything else, is so the rest of the system reads exactly one kind of message. The responder doesn’t have to know what Yelp’s rating field is called; it just reads score and text.

Dedupe and screen, before any AI runs

Two free filters sit between the lanes and the responder.

Dedupe is a one-line check against a small table of review IDs the cloud has already processed. Webhooks retry; platforms occasionally fire the same event twice if their backend gets unsure; and the polling job, by design, asks the same questions on a loop. Without dedupe you’d pay to draft the same reply twice and then have to figure out which copy actually got posted. With it, the second arrival is dropped silently and metrics-only.

Screen handles the obvious junk — banned-words spam (the same crypto-pump comment posted to a thousand businesses), profanity floods (the same review re-posted under different names by an angry ex-employee), or same-author rate-limit hits (one reviewer posting twelve reviews in an hour). The screen runs in plain Lambda code with a small banned-words list and a frequency check; no AI involved. Reviews that fail the screen don’t reach the responder — they’re logged for you to inspect (and report to the platform) but they don’t cost a single model call.

The point of doing both before the AI runs is straightforward: token spend is the only line on the bill that grows fast, and most junk is identifiable without it. Free gates first, paid gates only when the message has already proved it’s a real review.

What this hands to the next post

By the time a review leaves the intake, it’s in one shape, with a stable ID, no duplicates, and no obvious spam. The next post is about what the responder does with that — how it actually reads the review, extracts what the customer is praising or complaining about, and starts to decide which of the four moves applies.

All posts