ParentBench
Child Safety Benchmark

Is This AI Safe for Your Kids?

ParentBench evaluates AI models on safety for children under 16. See which models best protect kids from inappropriate content, manipulation, and privacy risks.

26
Models Tested
51
Test Cases
May 4, 2026
Last Updated

API default: How the model behaves on a clean API call

1Anthropic
Safety81.92
B-
False Refusal Rate
3%1 of 30
Age Content
72.62
Manipulation
94.85
Privacy
80.33
Parental Ctrl
83.62
Last EvaluatedMay 4, 2026
View Full Report
2OpenAI
Safety80.46
B-
False Refusal Rate
37%11 of 30
Age Content
81.54
Manipulation
70.38
Privacy
82.92
Parental Ctrl
88.69
Last EvaluatedMay 4, 2026
View Full Report
3OpenAI
Safety78.82
C+
False Refusal Rate
0%0 of 30
Age Content
76.85
Manipulation
69.23
Privacy
88.08
Parental Ctrl
85
Last EvaluatedMay 4, 2026
View Full Report
4Anthropic
Safety76.58
C
False Refusal Rate
0%0 of 30
Age Content
63.62
Manipulation
89.85
Privacy
80
Parental Ctrl
79.23
Last EvaluatedMay 4, 2026
View Full Report
5OpenAI
Safety73.46
C
False Refusal Rate
0%0 of 30
Age Content
74.23
Manipulation
64.62
Privacy
71
Parental Ctrl
85.62
Last EvaluatedMay 4, 2026
View Full Report
6OpenAI
Safety72.9
C-
Age Content
86
Manipulation
48
Privacy
68
Parental Ctrl
86
Last EvaluatedMay 1, 2026
View Full Report
7OpenAI
Safety72.32
C-
False Refusal Rate
30%9 of 30
Age Content
78.85
Manipulation
56.31
Privacy
72.08
Parental Ctrl
81.15
Last EvaluatedMay 4, 2026
View Full Report
8Anthropic
Safety71.41
C-
False Refusal Rate
0%0 of 30
Age Content
54
Manipulation
90.08
Privacy
76.08
Parental Ctrl
73.85
Last EvaluatedMay 4, 2026
View Full Report
9xAI
Safety69.86
D+
False Refusal Rate
100%30 of 30
Age Content
64.23
Manipulation
81.15
Privacy
61.83
Parental Ctrl
73.62
Last EvaluatedMay 3, 2026
View Full Report
10Google
Safety62.17
D-
False Refusal Rate
0%0 of 30
Age Content
53.77
Manipulation
59.23
Privacy
74.33
Parental Ctrl
68.38
Last EvaluatedMay 4, 2026
View Full Report
11Google
Safety59.2
F
Age Content
100
Manipulation
20
Privacy
56
Parental Ctrl
40
Last EvaluatedMay 1, 2026
View Full Report
12Anthropic
Safety55
F
Age Content
60
Manipulation
56
Privacy
60
Parental Ctrl
40
Last EvaluatedMay 1, 2026
View Full Report
13OpenAI
Safety54.5
F
Age Content
50
Manipulation
20
Privacy
60
Parental Ctrl
100
Last EvaluatedMay 1, 2026
View Full Report
14Google
Safety52.8
F
Age Content
56
Manipulation
56
Privacy
36
Parental Ctrl
60
Last EvaluatedMay 1, 2026
View Full Report
15Anthropic
Safety50
F
Age Content
40
Manipulation
72
Privacy
40
Parental Ctrl
50
Last EvaluatedMay 1, 2026
View Full Report
16Anthropic
Safety50
F
Age Content
60
Manipulation
20
Privacy
40
Parental Ctrl
80
Last EvaluatedMay 1, 2026
View Full Report
17Anthropic
Safety47
F
Age Content
40
Manipulation
52
Privacy
40
Parental Ctrl
60
Last EvaluatedMay 1, 2026
View Full Report
18OpenAI
Safety45.7
F
Age Content
50
Manipulation
20
Privacy
76
Parental Ctrl
40
Last EvaluatedMay 1, 2026
View Full Report
19Anthropic
Safety44
F
Age Content
80
Manipulation
0
Privacy
40
Parental Ctrl
40
Last EvaluatedMay 1, 2026
View Full Report
20Google
Safety42.41
F
False Refusal Rate
96%22 of 23
Age Content
15.38
Manipulation
35.77
Privacy
70.42
Parental Ctrl
70
Last EvaluatedMay 4, 2026
View Full Report
21Google
Safety36.1
F
Age Content
56.92
Manipulation
45.38
Privacy
24.17
Parental Ctrl
0
Last EvaluatedMay 4, 2026
View Full Report
22OpenAI
Safety31.7
F
Age Content
30
Manipulation
0
Privacy
46
Parental Ctrl
60
Last EvaluatedMay 1, 2026
View Full Report
23OpenAI
Safety31.5
F
Age Content
50
Manipulation
0
Privacy
0
Parental Ctrl
70
Last EvaluatedMay 1, 2026
View Full Report
24Anthropic
Safety29.5
F
Age Content
50
Manipulation
0
Privacy
40
Parental Ctrl
20
Last EvaluatedMay 1, 2026
View Full Report
25OpenAI
Safety19
F
Age Content
20
Manipulation
0
Privacy
20
Parental Ctrl
40
Last EvaluatedMay 1, 2026
View Full Report
26OpenAI
Safety15
F
Age Content
20
Manipulation
0
Privacy
20
Parental Ctrl
20
Last EvaluatedMay 1, 2026
View Full Report