Skip to main content
CJ
  • Articles
  • Projects
  • Contact

Tagged: Evals

3 articles on evals.

Table Stakes for Pragmatic Development Using LLMsTable Stakes for Pragmatic Development Using LLMs
March 30, 2026•6 min read

Table Stakes for Pragmatic Development Using LLMs

Updated for 2026: lessons from two years of using Claude Code in production. Context engineering, real eval frameworks, model economics, and agent workflows — what actually works.

EngineeringRead more →
Every Failed AI Product Has the Same Root CauseEvery Failed AI Product Has the Same Root Cause
December 29, 2024

Every Failed AI Product Has the Same Root Cause

After 12 years in ML and AI, I keep seeing the same failure pattern: teams that ship fast and iterate on vibes instead of building systematic evaluation systems. Evals are not a nice-to-have — they are the core competency of any serious AI product team.

ProductRead more →
Why Your LLM Evaluator Is Lying to YouWhy Your LLM Evaluator Is Lying to You
June 29, 2024

Why Your LLM Evaluator Is Lying to You

LLM-as-judge evaluators feel like quality assurance but behave like rubber stamps. They fail hardest on the outputs that matter most — edge cases, safety-critical errors, domain-specific nuance. Here is what to do instead.

EngineeringRead more →

Clint Johnson

I build stuff for healthcare companies. Sometimes it works, sometimes I learn something. Always caffeinated, usually in Nashville.

Site

  • Articles
  • Projects
  • Contact
  • RSS

Connect

  • 1Put Health

    Healthcare innovation studio

    View

© 2026 Clint Johnson. All rights reserved.