The "Petri" tool deploys AI agents to evaluate frontier models. AI's ability to discern harm is still highly imperfect. Early tests showed Claude Sonnet 4.5 and GPT-5 to be safest. Anthropic has ...
Anthropic PBC announced the release of Bloom on Friday, an open-source agentic framework for defining and exploring the behavior of frontier artificial intelligence models. Bloom takes a ...