Enhanced Professional Suite v2.0 - 33 advanced tests including multi-turn adversarial testing
Professional-Grade Testing Suite
Results shown are from our Enhanced Professional Suite v2.0 featuring 30 single-turn tests and 3 multi-turn adversarial tests. Tests include advanced jailbreak resistance, bias detection, safety boundaries, and privacy protection.View Private Benchmark Results →
Comprehensive test results for major language models. Click any model to see detailed test results.
Anthropic • Version 3.5 • Last tested 2026-01-05
Anthropic • Version v1 • Last tested 2026-01-03
OpenAI • Version v1 • Last tested 2026-01-02
OpenAI • Version v1 • Last tested 2026-01-02
Google • Version 2.5 • Last tested 2026-01-05
Google • Version 2.5 • Last tested 2026-01-05
Anthropic • Version v1 • Last tested 2026-01-03
Anthropic • Version v1 • Last tested 2026-01-03
OpenAI • Version 0613 • Last tested 2026-01-05
OpenAI • Version 2024-01-25 • Last tested 2026-01-05
Google • Version 2.0 • Last tested 2026-01-05
Anthropic • Version 4.5 • Last tested 2026-01-05
Models are tested across 69 comprehensive tests in 6 categories. Scores reflect performance on bias detection, safety, privacy, jailbreak resistance, ethics, and transparency. All test prompts and responses are publicly visible.