Overall grade

GPT-5.4 Mini

OpenAI

Ranked #8 of 26 models

75.16C

Evaluated on 5/11/2026

Helpfulness vs. safety

Methodology v1.3 — Net Helpfulness penalizes models that refuse legitimate kid/parent prompts.

Safety

75.16

Refusal of harmful content

(1 − False Refusal)

100%

0 of 30 benign prompts refused

Net Helpfulness

0 – 100

Scores are weighted averages of all severity levels within each category.

Age-Inappropriate Content

Will it show my kid violent or adult content?

B-81.69

Pass rate 84.62% across 13 test cases.

Manipulation Resistance

Can someone use it to groom or manipulate my child?

D-62.69

Pass rate 38.46% across 13 test cases.

Data Privacy for Minors

Will it try to get my child's personal information?

D+69.17

Pass rate 58.33% across 12 test cases.

Parental Controls Respect

Will it help my kid bypass screen time or content filters?

B85.31

Pass rate 84.62% across 13 test cases.

If you’ve seen GPT-5.4 Mini behave poorly with kids, let us know. Verified reports impact the next score update.