Craft
Can AI Review a PRD? Yes, and What It Catches
Can AI review a PRD? Yes, and it is a different job than writing one. What it catches reliably, what it cannot, and whether it is safe to paste your spec.
Yes, AI can review a PRD, and it is a different job than writing one. A review reads your finished draft against a fixed rubric and reports what is missing or weak: absent acceptance criteria, metrics that cannot fail, error states you never wrote. Writing produces the page. Reviewing tries to break it. Those are opposite jobs, and most tools only do the first.
That distinction matters more than it sounds, so let me be specific about both halves before answering the question every cautious reader actually asks, which is whether it is safe to paste a confidential spec into a model at all.
Review and writing are different jobs
When you ask an AI to "write a PRD," you want it agreeable and additive: fill the template, draft the sections, keep going. When you ask it to review one, you want the opposite temperament. You want it adversarial, looking for the gap, willing to tell you the metric in section three is decorative.
Most "review my doc" features are built on the writing temperament, because that is what general-purpose assistants are tuned to be: helpful. Ask one to critique your spec and it hands back twenty polite suggestions of equal weight, no position on which one sinks the launch. That is the second way AI gets PRD review wrong. A model trained to be agreeable defaults to agreeable, so it spreads its findings flat and refuses to rank them. A review with no ranked severity is just a list, and you will do the easy ones and miss the one that mattered. The split between the two jobs is the whole subject of an AI PRD reviewer versus a writer, if you want the longer version.
What AI catches reliably
The things a model is genuinely good at are the structural absences, the gaps that are mechanical to detect once you read against a checklist instead of from memory. These are the findings I trust:
- Missing or vague acceptance criteria. "Handle errors gracefully" reads like a requirement and specifies nothing. A model spots that a flow has no testable condition attached and flags it, which is exactly the check a human skips at 6pm. The standard to hold criteria against is in what makes acceptance criteria testable.
- Metrics that cannot fail. "Increase engagement" names a direction and hides the cohort, the number, and the deadline. AI reliably catches a success metric with no baseline, no target, and no clock, because the absence is detectable on the page.
- Absent error states and edge cases. The happy path writes itself and nobody files a bug against it. A model walking each flow notices when the empty state, the expired session, and the partial permission are simply not in the spec.
- Missing non-functional requirements. Most drafts say what the feature does and go quiet on how well it has to do it. AI catches the missing performance, reliability, and security numbers, the ones covered in the non-functional requirements checklist.
What these share: each is something that should be on the page and is not, and its absence is invisible to the person who wrote it. You cannot proofread a gap you never noticed the first time. A machine reading against a rubric has no such blind spot. A real review hunts for what is missing, not what is on the page, and that is the one part of the job a model does tirelessly.
What AI cannot do
Here is the line, and it is a hard one. AI can tell you the spec is internally incomplete. It cannot tell you the spec is solving the wrong problem.
A model can confirm your problem statement exists, is stated before the solution, and matches the feature underneath it. It cannot know whether that problem is worth a quarter of engineering time, whether the user you named is the user who actually churns, or whether the thing you are about to build is the thing your market needs this year. That judgment depends on context the document does not contain: your strategy, your customers, the conversations you had last week. The rubric reads the page. It does not read your business.
So treat the verdict as one input, not the decision. Let the machine handle the mechanical completeness pass, then spend your human attention on the question it structurally cannot answer: should this exist at all. The skill of asking that question well is what critiquing your own PRD is really practicing, and it is the skill PMs are never taught. We learn to write specs from day one and never learn to red-team one, so the first real review most drafts get is someone else's, in the comments, a sprint too late to help.
Is it safe to paste my PRD into an AI
This is the question a careful evaluator reaches before the feature one, and it is the right instinct. A PRD is among the most sensitive documents a product team owns: unshipped strategy, named customers, internal metrics. Pasting it into a model whose data policy you do not understand is a real risk, and "trust us" is not an answer.
So here is the concrete one for Thinkr. Your PRD is not used to train models, not ours and not the underlying provider's. The stack runs on Google Gemini through its enterprise API, where customer content is excluded from model training by the provider's terms, not by a setting you have to find and toggle. Your document is processed to produce the review and is not retained as training data. And you can delete your content at any time, which removes it rather than flagging it.
That is the bar a cautious team should hold any review tool to, and it is a fair bar to hold us to. The capability is the reason to use a critique tool. The data handling is the reason you are allowed to.
How to use AI review well
The honest workflow is a split of labor, not a handoff. Let the machine do the first read, the one that needs stamina more than judgment: eleven passes against the draft, every finding classified blocker, major, or minor, with suggested rewrites landed on the line each concerns. That is the pass Thinkr's Critique runs, and it is the part that is genuinely tedious to do by hand at the end of a long day.
Then do the part no rubric can. Read the verdict, fix the gaps it ranked highest, and ask the one question it cannot: is this the right thing to build. If you want the rubric before you run it through anything, the standing version is the PRD review checklist, and the upstream half, catching problems while the spec is still being drafted, lives in how the spec gets generated.
Can AI review a PRD? Yes, reliably, for the structural gaps that get specs sent back. Just keep the one judgment that is yours: a model can tell you the doc is complete. Only you can tell whether it is right.