Six AI models independently tested for mathematical rigor, intellectual honesty, self-awareness, and structural self-correction against the FairMind DNA framework.
All scores self-reported by each model. Sorted by total score descending.
Visual comparison across all five benchmark sections.
Per-section leaderboard and observations.
Truth Violations most frequently self-reported across all models.
Click to read the full benchmark report for each model.