Anthropic's Opus 4.6 system card breaks out prompt injection attack success rates by surface, attempt count, and safeguard ...
In the quest to get as much training data as possible, there was little effort available to vet the data to ensure that it was good.