Agent Grading
Evaluate agent performance using rubric-based grading
ECE Score
0.00
Target: < 0.10
Expected Calibration Error
Brier Score
0.00
Target: < 0.15
Probabilistic accuracy measure
Factuality
0.00
Target: >= 0.80
Factual accuracy rate
Avg Overall Score
0.0%
Top Performer
N/A
0%
Needs Improvement
0
agents below 70%
Calibration Status
ECEBrierFact