Tagged: Evals

3 articles on evals.

March 30, 2026•6 min read

Table Stakes for Pragmatic Development Using LLMs

Updated for 2026: lessons from two years using Claude Code in production. Context engineering, real eval frameworks, model economics, and agent workflows. What works.

EngineeringRead more →

December 29, 2024

Every Failed AI Product Has the Same Root Cause

The same failure pattern shows up everywhere: teams shipping fast and iterating on vibes instead of building systematic evaluation. Evals aren't a nice-to-have. They're the core competency of any serious AI product team.

ProductRead more →

June 29, 2024

Why Your LLM Evaluator Is Lying to You

LLM-as-judge evaluators feel like quality assurance but behave like rubber stamps. They fail hardest on the outputs that matter most: edge cases, safety-critical errors, domain-specific nuance. What to do instead.

EngineeringRead more →

Tagged: Evals

Table Stakes for Pragmatic Development Using LLMs

Every Failed AI Product Has the Same Root Cause

Why Your LLM Evaluator Is Lying to You

Clint Johnson

Site

Connect

1Put Health