You don't know what your AI image pipeline ships.
Sentinel does.
Sentinel scores every input and every output image, so you can see what you are shipping and fix it at scale.
You're generating at scale.
Nobody's checking the output.
You can't see what's broken
Face distortions, wrong backgrounds, skin-tone issues, logo placement errors. At API scale, defects stay invisible until a user complains.
Human review doesn't scale
Manual QA works for dozens of images. Not for thousands. The moment you scale, quality control becomes the bottleneck, or disappears.
Bad outputs cost real money
Every image that fails QA after delivery is a refund, a complaint, or a churn event. The cost does not show in COGS. It shows in your NRR.
Swap your API key.
See what you ship.
Evaluations
2,924
last 30 days
Pass rate
89.53%
of all evaluations
Avg score
0.91
out of 1.00
Gate failures
20
0.7% of evaluations
Failed judges
1,133
0.4 per evaluation
Regenerate rate
9.68%
lower is better
Catches a weak input before you spend a GPU second on it.
Every result graded against the checks that matter for its use case.
One ID ties the input, the score, and the retry together.
Show your customers the quality of every image you ship.











1KOriginal
Custom
New recipe
Build another chain from scratch
Marketplace packages
Zalando bundle
Tag removal, mannequin, grey backdrop
Magic
Pinpoint a change
Click a spot, describe the change
Restyle with a reference
Brush an area, drop a reference
Add a logo
Place a brand logo with one instruction
Drop into a new scene
Place the subject anywhere
Compose
Cut out on white
Clean catalog composition on white
Smart resize
Recompose to any ratio, 1K to 4K
Reframe to a new ratio
Extend the canvas, keep the subject
Create custom workflows that scale with Sentinel.
Score the input and the output to deliver at scale, the way BetterPic does.
Step 02 below is Sentinel on the input, step 05 is Sentinel on the output.








Eight selfies to start. The customer sends their source photos.
Built for your use case,
not the generic one.
Generic quality
Universal checks that apply to any AI image generation. Runs automatically on every pipeline, zero configuration.
Use-case specific
Pre-tuned criteria for production pipelines. Headshots, fashion, on-model. Each one with its own quality standard.
Your custom rules
Define what matters to your business. Sentinel checks your edge cases on every single run, at any volume.
Send a generated image. Get a verdict back.
One POST to the Sentinel API. You send the image, the task, and what good looks like. Sentinel returns a verdict you can branch on, no quality model of your own to build.
POST /api/v1/evaluate with the image URL, the task type, and a description of what it should be. Authenticate with an x-api-key header.
Every evaluation returns pass, soft_fail, or hard_fail, plus a weighted pass rate to threshold on yourself.
Pass reference images and evaluation instructions so Sentinel scores against your use case, not a generic one.
"generated_image_url": "https://cdn.example.com/headshot_9a8b.jpg",
"task_type": "headshot",
"task_description": "Professional headshot, neutral background",
"evaluation_instructions": "Skin natural, identity matches the reference",
"reference_images": [{ "url": "...selfie.jpg", "role": "identity" }]
}
{
"eval_id": "ev_3f2d8a91",
"status": "completed",
"verdict": "pass",
"overall_passed": true,
"weighted_pass_rate": 0.94
}
More from Runflow
Stop shipping bad images.
Quality control is not optional at scale. It is what separates an AI product from a reliable one.
Create a free account
Add your API key and get your first quality insights on everything you ship. No call needed.
- →Swap your key, see what you ship
- →Input and output scoring on every call
- →Free to start
Build with Sentinel
Want Sentinel wired into a custom workflow, with input gates and automated next steps? We build that with you.
- →A custom evaluation schema for your use case
- →Input gates and automated next steps
- →Built hands-on with our team
See quality scoring in action
Try the free fashion product scorer. Upload a garment image, get an instant AI readiness score.