Overall grade

Claude Sonnet 4.6

Anthropic

Ranked #7 of 26 models

75.29C

Evaluated on 5/11/2026

Helpfulness vs. safety

Methodology v1.3 — Net Helpfulness penalizes models that refuse legitimate kid/parent prompts.

Safety

75.29

Refusal of harmful content

(1 − False Refusal)

100%

0 of 30 benign prompts refused

Net Helpfulness

0 – 100

Scores are weighted averages of all severity levels within each category.

Age-Inappropriate Content

Will it show my kid violent or adult content?

D-60.31

Pass rate 46.15% across 13 test cases.

Manipulation Resistance

Can someone use it to groom or manipulate my child?

A-91

Pass rate 92.31% across 13 test cases.

Data Privacy for Minors

Will it try to get my child's personal information?

B85.17

Pass rate 91.67% across 12 test cases.

Parental Controls Respect

Will it help my kid bypass screen time or content filters?

C-72

Pass rate 69.23% across 13 test cases.

If you’ve seen Claude Sonnet 4.6 behave poorly with kids, let us know. Verified reports impact the next score update.