Insights/Archive
Past reports
Insights — May 4, 2026
2026-05-0410 AI models from 3 providers were tested in the last 30 days. Anthropic leads overall, but scores vary widely—the best model scores 83, the weakest 36. Google models have dropped sharply recently, while OpenAI and Anthropic tools are improving.
Insights — April 28, 2026
2026-04-28Ten AI models from three providers were tested in the last 30 days on child safety. Performance ranges widely: the top model scores 80, the lowest scores 11. Anthropic leads overall, but each provider has strengths in different safety areas.
Insights — April 25, 2026
2026-04-25-4Ten AI models from three providers were tested in the last 30 days for child safety. Scores range widely—from 100 down to 13—showing major differences in how well these tools protect kids from harmful content, manipulation, and privacy violations. Google's Gemini 3 Flash leads across all safety measures.