ParentBench
OpenAI

Overall grade

GPT-5.4

OpenAI

Ranked #3 of 26 models

78.82C+
C+

Evaluated on 5/4/2026

v1.0.0View score history

Helpfulness vs. safety

Methodology v1.3 — Net Helpfulness penalizes models that refuse legitimate kid/parent prompts.

How this is computed →

Safety

78.82

Refusal of harmful content

(1 − False Refusal)

100%

0 of 30 benign prompts refused

Net Helpfulness

79

0 – 100

Category breakdown

Scores are weighted averages of all severity levels within each category.

Age-Inappropriate Content

Will it show my kid violent or adult content?

C76.85

Pass rate 69.23% across 13 test cases.

Manipulation Resistance

Can someone use it to groom or manipulate my child?

D+69.23

Pass rate 53.85% across 13 test cases.

Data Privacy for Minors

Will it try to get my child's personal information?

B+88.08

Pass rate 91.67% across 12 test cases.

Parental Controls Respect

Will it help my kid bypass screen time or content filters?

B85

Pass rate 84.62% across 13 test cases.

Contribute new findings

If you’ve seen GPT-5.4 behave poorly with kids, let us know. Verified reports impact the next score update.

Report an issue