Your team is shipping more code than ever. Code review agents can read the diff and flag what they find right in the PR, but they don't open your product, click a button, or fill out a form.
Today we're launching Quality Review Agent.
For every PR your team opens, an agent loads your product in a real browser and uses it the way one of your customers would. It clicks. It types. It walks every flow the change touches. A code review runs alongside the same diff. You've got full coverage on every PR.
Quality Review Agent spins up a browser on every PR. It clicks, types, and navigates through the change and every surface it affects, and checks that it works.
Coverage runs at three layers:
- Critical flows: The happy path for the change itself.
- Edge cases: Empty states, invalid input, rate limits, error paths.
- Regression: Whether this change broke anything around it.
The agent reads the PR title, description, and diff to decide what to test. A tweak to a dashboard filter, for example, re-tests the charts that depend on it.
Every run posts a list of issues straight to the PR. Each flagged bug comes with a "Fix in Builder" button. Click it, describe the fix in plain English (or let the agent resolve it), and the update pushes back to the same PR for a re-run.
Every flagged bug comes with the full run that produced it. The replay is a video of the agent walking through your product, with three panels synced to the timeline:
- Agent reasoning. What it did and why, at each step.
- Network calls. Every request the agent triggered.
- Console output. Every log and error on the page.
Scrub frame by frame to see exactly what the agent saw. Play it at 8x to skim the boring parts. Jump straight to the second a bug fired. At that exact frame, the network panel has the failed request and the console has the exception.
Pair Quality Review Agent with a code review on the same PR. High-severity blocks a merge. Medium gets a reviewer's eyes before approval. Every flagged issue has a "Fix in Builder" button.
Between Functional Testing and Code Review, every change gets a full first pass before a human reviewer looks at it.
Quality Review Agent runs on GitHub PRs. Support for GitLab, Bitbucket, and Azure DevOps is on the way.
Today, Quality Review Agent runs the moment a PR opens. Soon you can trigger it on a local branch from wherever you code, whether that's Claude Code, Codex or Cursor.
The agent runs against your code and sends back a report with the video, bugs, network calls, console output, and a single command to fix all issues found.
Every PR your team opens gets a real-browser run and a code review on the diff. The person who caused an issue can fix it in Builder, whether that's the PM, designer, or agent who opened the branch.
Scaling across a large team? Request an enterprise trial