Security
2 articles in Security.

How to Actually Test If Your AI Will Say Something Dangerous
Most teams treat jailbreak testing as a vibe check. StrongREJECT achieves 0.90 Spearman correlation with human judgment — which means automated safety evaluation is real, and there is no good excuse not to build it into your pipeline.
EngineeringRead more →

The Attack Your LLM App Is Definitely Vulnerable To
Prompt injection is the #1 OWASP threat to LLM applications — and most teams are not taking it seriously. Here is what the attack looks like, why it is so hard to stop, and how to actually harden your system.
EngineeringRead more →

