🔍 Hallucination Detection Results

Generated on 2025-12-09 08:33:38

Datasets Analyzed

1

Total Questions

1516

Total Issues Found

742

Avg Issues/Question

0.49

Quick Navigation

Dataset Results

date
Judge Type: logic_based | Modes: hot, self_refine, ltm, tot, cot, cove
1516 questions 438 with issues
HOT
SELF_REFINE
LTM
TOT
COT
COVE