AI Lab Claim Verification

GPTfake tests the claims AI labs make about their own models. When a lab says a model has “no geopolitical restrictions” or “answers any safe question,” we check that claim against our standardized monitoring set and publish an answer-first verdict — the claim, what we measured, the sample size, and the as-of date. According to GPTfake monitoring, lab claims about openness and our measured refusal rates frequently disagree. Figures are illustrative across a fixed 500-prompt set until live data lands.

Each fact-check below leads with a plain-language verdict and the evidence behind it. We also attach one machine-readable ClaimReview record per page for Google’s Fact Check Explorer and other claim feeds — but the verdict stands on its own. We are not funded by any AI lab.

Why a verification section

Labs describe their models in marketing and policy posts; we measure how those models actually behave. The gap between the two is the story. Every verification page here:

Names the specific claim and who made it (with a link where the claim appeared).
States our verdict in one sentence, up front (e.g. “Misleading”).
Shows the evidence — the measured refusal rate, the topic, the sample size, and the as-of date.
Links to the methodology so the number is reproducible.

This is the same standard as our reports and datasets: a specific, dated, attributable measurement — never an intent accusation.

Current fact-checks

Claim	Lab	Verdict	As of
”Gemini has no geopolitical restrictions”	Google	Misleading	2026-06-15

Illustrative. More fact-checks are added as labs publish new claims — see the reports freshness log for what changed.

How we verify a claim

We map each public claim to the closest category in our standardized prompt set, run that category against the model exactly as described in the monitoring methodology, and compare the measured refusal rate to what the claim implies. A claim of “no restrictions” predicts a refusal rate near 0% for the relevant category; a measured double-digit rate contradicts it. We report the spread, the sample size, and the date, and rate the claim on a simple true-to-false scale.

Monitoring methodology

Exactly how we test models and score refusals — the basis for every verdict.

Censorship leaderboard

The full refusal-rate ranking across every model we monitor.

Reports

Dated, data-driven findings and the policy-change log.

AI Lab Claim Verification

Why a verification section

Current fact-checks

How we verify a claim

Monitoring

Research

Resources

Company