Claude Monitoring
Comprehensive monitoring of Anthropic's Claude censorship patterns and Constitutional AI constraints.
Current Status
| Metric | Value | Trend |
|---|---|---|
| Overall Censorship Rate | 22.4% | � Stable |
| Political Topic Refusals | 41.3% | � Rising |
| Safety Topic Refusals | 72.1% | � Stable |
Versions Monitored
- Claude 3.5 Sonnet
- Claude 3.5 Haiku
- Claude 3 Opus
- Claude 3 Sonnet
Key Findings
- Most Transparent Provides clearest explanations for refusals
- Highest Safety Threshold Most restrictive on harm-related content
- Constitutional Framework Unique approach to content moderation
Censorship by Category
| Category | Refusal Rate |
|---|---|
| Political History | 48.7% |
| Violence/Safety | 82.3% |
| Adult Content | 96.2% |
| Medical Advice | 38.9% |
API Access
curl -X GET "https://api.gptfake.com/v1/monitoring/claude/metrics" \
-H "Authorization: Bearer YOUR_API_KEY"