Some Things You Can't Write a Rule For
AI Judge evaluates your data against plain-English criteria using LLM evaluation. The third layer — after schema validation and business rules have already passed.
AI Judge Only Runs When It Should
Schema validation and business rules run first. AI Judge only evaluates data that already passed structural and deterministic checks — no wasted LLM cost on invalid payloads.
Structural — types, constraints, required fields
Deterministic — cross-field logic, lookups
Semantic — LLM evaluation of plain-English criteria
Plain English, Not Regular Expressions
Write criteria in natural language. Assign specialist personas for different checks. The LLM evaluates each criterion independently.
[ "Functions should follow SRP", "No hardcoded credentials", "Variable names are descriptive" ]
[
{
"criterion": "Functions follow SRP",
"persona": "Senior software architect"
},
{
"criterion": "No SQL injection risks",
"persona": "Application security engineer"
},
"Variable names are descriptive"
]Each criterion is evaluated independently by the LLM
Per-criterion persona overrides the gate-level default persona
Each criterion returns a verdict (pass/fail), confidence (0-1), and reasoning
Configurable threshold (default 1.0 = all must pass)
{
"criterion": "Functions follow SRP",
"verdict": "fail",
"confidence": 0.85,
"reasoning": "The processOrder()
function handles validation,
payment, and notification —
three distinct responsibilities."
}When AI Is Uncertain, Humans Decide
Every criterion gets a confidence score. Low confidence automatically routes to human review. You control the threshold and the fallback behavior.
onFail: "reject" or onFail: "review" for human override.AI Verdict + Human Judgment
When AI Judge is uncertain, it routes to a human reviewer. The reviewer sees the AI's verdict, confidence, and reasoning — then makes the final call.
Reviewers don't need a Rynko account. They receive a magic link via email, click it, and see the full review context.
- One-click access to review interface
- 2-hour link expiry for security
- Resend capability if link expires
Every decision — AI and human — is permanently logged. Both judgments are captured side by side.
- AI verdict, confidence, and reasoning
- Reviewer identity, timestamp, comment
- EU AI Act Article 14 compliance
Three-Layer Prompt Injection Defense
AI Judge processes untrusted data from your agents. The security model assumes every payload is adversarial.
Pay for What You Use
AI Judge runs cost 5x standard run credits, reflecting the underlying LLM cost. Available on paid tiers only.
| Tier | AI Judge Runs / Month | Max Criteria / Gate |
|---|---|---|
| Free | Not available | - |
| Starter | 500/mo | 10 per gate |
| Growth | 5,000/mo | 20 per gate |
| Scale | 25,000/mo | 50 per gate |
Each AI Judge run consumes 5 standard run credits. Cost visibility is available in the dashboard.
What Can AI Judge Evaluate?
Anything that requires judgment, context, or domain expertise that you cannot reduce to a deterministic rule.
Ready to Add AI Evaluation?
Set up criteria in plain English, let the LLM evaluate, and route uncertain cases to humans. Three layers of validation, one API call.