Skip to main content

Claude Monitoring

Comprehensive monitoring of Anthropic's Claude censorship patterns and Constitutional AI constraints.

Current Status

MetricValueTrend
Overall Censorship Rate22.4%� Stable
Political Topic Refusals41.3%� Rising
Safety Topic Refusals72.1%� Stable

Versions Monitored

  • Claude 3.5 Sonnet
  • Claude 3.5 Haiku
  • Claude 3 Opus
  • Claude 3 Sonnet

Key Findings

  1. Most Transparent  Provides clearest explanations for refusals
  2. Highest Safety Threshold  Most restrictive on harm-related content
  3. Constitutional Framework  Unique approach to content moderation

Censorship by Category

CategoryRefusal Rate
Political History48.7%
Violence/Safety82.3%
Adult Content96.2%
Medical Advice38.9%

API Access

curl -X GET "https://api.gptfake.com/v1/monitoring/claude/metrics" \
-H "Authorization: Bearer YOUR_API_KEY"