LLM Judge
Evaluate Q&A pairs using LLM judges
Created by Abhin Rustagi
- Domain
- Upload
- Criteria
- Model
- Results
Select Domain
Choose the domain context for evaluation. This adjusts scoring criteria and prompt guidance.
Legal
Citation accuracy, jurisdiction, fabrication detection
Medical
Medical accuracy, patient safety, evidence-based claims
Finance
Data accuracy, regulatory compliance, risk disclosure
General
Factual accuracy, clarity, completeness