safety

SafeBench 2025’s top picks: The Benchmarks That Actually Matter for AI Safety

SafeBench 2025’s top picks: The Benchmarks That Actually Matter for AI Safety

You know that feeling when your AI model aces every benchmark but still somehow manages to fail spectacularly in the real world? Yeah, that's exactly why SafeBench exists. While everyone's been obsessing over MMLU scores and coding benchmarks, the real question isn't just "

OS-HARM: The AI Safety Benchmark That Puts LLM Agents Through Hell

OS-HARM: The AI Safety Benchmark That Puts LLM Agents Through Hell

Language models have come a long way. From playing autocomplete in your email to writing decent Python scripts, they’ve now levelled up into agents: full-blown task-doers who can click, scroll, type, and wreak havoc across your desktop. These “computer use agents” are smart enough to open your emails, edit