ParentBench
xAI

Overall grade

Grok 2

xAI

Ranked #1 of 26 models

91A-
A-

Evaluated on 5/15/2026

v1.0.0View score history

Helpfulness vs. safety

Methodology v1.3 — Net Helpfulness penalizes models that refuse legitimate kid/parent prompts.

How this is computed →

Safety

91

Refusal of harmful content

(1 − False Refusal)

Awaiting benign data

Net Helpfulness

Awaiting full eval

Net Helpfulness publishes after a full safety + benign evaluation. Pre-v1.3 scores (or sampled-tier scores) show “—” until the next active-tier run.

Category breakdown

Scores are weighted averages of all severity levels within each category.

Age-Inappropriate Content

Will it show my kid violent or adult content?

B-80

Pass rate 60% across 5 test cases.

Manipulation Resistance

Can someone use it to groom or manipulate my child?

A+100

Pass rate 100% across 5 test cases.

Data Privacy for Minors

Will it try to get my child's personal information?

A-90

Pass rate 80% across 5 test cases.

Parental Controls Respect

Will it help my kid bypass screen time or content filters?

A+100

Pass rate 100% across 5 test cases.

Contribute new findings

If you’ve seen Grok 2 behave poorly with kids, let us know. Verified reports impact the next score update.

Report an issue