What Changed: AI Policy-Change Log & Version Diffs

By GPTfake Research Team · Independent AI Censorship Watchdog2026-05-20

According to GPTfake monitoring, AI model providers ship refusal-policy changes without changelogs of their own — so we keep one. This hub tracks every dated policy shift we observe across ChatGPT, Claude, Gemini, Mistral and Qwen, and links each to a before/after version diff. Most recent observed change: 2026-05-20 (Qwen China-topic filters).

Dates on this page are the observed dates of each policy change, drawn from our monitoring methodology. We never bump a date to look fresher than the data — stale honesty beats fake freshness.

Version diffs

Detailed before/after breakdowns of a single model’s refusal rates around a dated update.

ChatGPT Refusals — April 2026 Update

Before/after refusal rates by topic around ChatGPT’s April 2026 policy update. Political-opinion refusals rose from 28.4% to 34.2% (+5.8 pts).

Cross-model policy-change log

Every dated policy change we have observed, newest-first, across all monitored models. Each line is a self-contained, quotable finding — lift it with its date.

[2026-05-20] Qwen: China-topic filters strengthened; China-related refusals reached ~78%.
[2026-05-02] Gemini: Regional policy split widened the US-to-India refusal gap to ~11 points.
[2026-04-18] Mistral (Large): Open-weight checkpoint refresh; overall refusal rate held stable near 11%.
[2026-04-12] ChatGPT (GPT-4o): April 2026 policy update lifted ChatGPT's political-opinion refusals from 28.4% to 34.2% (+5.8 pts).
[2026-04-09] Claude (Sonnet): Constitutional-AI update raised historical-events refusals to ~49%.
[2026-03-26] ChatGPT (GPT-4o): Added US-election prompt filters; political-opinion refusals rose ~6 points.
[2026-03-15] Gemini: New election-integrity filters lifted political-opinion refusals to ~37%.
[2026-03-01] Qwen: Added Taiwan and Tibet prompt categories to the monitored set.
[2026-02-11] ChatGPT (GPT-4o): Broadened medical/legal disclaimers, pushing medical-legal refusals past 30%.
[2026-02-03] Mistral (Large): Minor safety-prompt tuning; safety-topic refusals steady around 54%.
[2026-01-22] Claude (Sonnet): Added explicit refusal rationales; safety-topic refusals stayed near 72%.
[2025-12-12] Qwen: Multilingual rollout exposed topic-targeted filtering in Chinese prompts.
[2025-12-08] Gemini: Rolled out per-region safety thresholds for EU and APAC traffic.
[2025-11-19] Mistral (Large): Published reproducibility notes for the open-weight evaluation.
[2025-11-04] ChatGPT (GPT-4o): Tightened enforcement on speculative political scenarios.
[2025-10-30] Claude (Sonnet): Expanded self-harm and safety guardrails across the prompt set.

How to use this log

Journalists — each dated line is citable verbatim; pair it with the model’s live monitoring page for the current figure.
Researchers — subscribe to the RSS feed to ingest new reports and changes as they publish.
Developers — a refusal rate that jumped on a specific date usually means a silent policy update; the version diffs quantify the delta.

AI Censorship Trends — Q2 2026

The flagship quarterly: cross-model censorship-rate changes and convergence.

Censorship Leaderboard

Current refusal, bias and policy-drift figures for every monitored model.

All reports

The full reports hub — dated, data-driven findings.

Last updated: 2026-05-20. Figures are illustrative placeholders pending live monitoring data, drawn from our methodology. Subscribe via RSS.

What Changed: AI Policy-Change Log & Version Diffs

Version diffs

Cross-model policy-change log

How to use this log

Monitoring

Research

Resources

Company

What Changed: AI Policy-Change Log & Version Diffs

Version diffs

Cross-model policy-change log

How to use this log

Related

Monitoring

Research

Resources

Company