Learn step-by-step how to report harmful or policy-violating content in ChatGPT web, apps, and OpenAI's developer platforms. Understand the process, outcomes, and additional feedback options for safer AI interactions.
## Why Reporting Content Matters in OpenAI Ecosystems
Maintaining a safe and trustworthy environment is paramount for AI platforms like ChatGPT and OpenAI's developer tools. Users encountering content that violates OpenAI's Usage Policies—such as hate speech, harassment, violence promotion, illegal activities, or sexually explicit material—play a crucial role by reporting it. This proactive step helps OpenAI's moderation teams review and act swiftly, improving model safety and user experience. In this guide, we'll break down the reporting mechanisms across different interfaces, compare their workflows, provide detailed steps with practical examples, and explain post-report processes.
Reporting not only flags problematic outputs but also contributes to ongoing model training and policy enforcement. For instance, if ChatGPT generates instructions for creating explosives, reporting it prevents similar issues for others. OpenAI prioritizes these reports, often reviewing them within hours or days, depending on volume.
## Reporting Content in ChatGPT: Web and Mobile Breakdown
ChatGPT offers distinct reporting paths for its web version (chat.openai.com) and mobile apps (iOS/Android), optimized for each platform's interface. This comparison highlights subtle differences in accessibility and flow, ensuring users can act quickly regardless of device.
### Web Version (chat.openai.com)
To report a conversation or specific message:
- **Locate the content**: Navigate to the chat history sidebar on the left.
- **Select the conversation**: Click the three dots (⋯) menu next to the relevant chat.
- **Initiate report**: Choose "Report" from the dropdown.
- **Provide details**: In the pop-up, select a reason (e.g., "Hate", "Violence", "Self-harm") from predefined categories, add optional context, and submit.
**Practical Example**: Imagine ChatGPT responds to a query about "best ways to scam people" with detailed tactics. In the sidebar, click ⋯ on that chat > Report > Select "Illegal activities" > Add note: "Promotes fraud schemes" > Submit. This flags the entire conversation for review.
### Mobile Apps (iOS and Android)
Mobile reporting emphasizes touch-friendly navigation:
- **Open the chat**: Swipe or tap into the conversation.
- **Access history**: Tap your profile icon (top left) to view chat list.
- **Report option**: Tap the three dots (⋯) next to the chat > "Report chat".
- **Categorize and submit**: Pick a violation type, include details if desired, and confirm.
**Key Difference from Web**: Mobile uses "Report chat" explicitly, while web has a broader "Report" that covers messages too. Both lead to the same backend review.
**Real-World Application**: During a late-night session, if the app outputs graphic violence descriptions, report via profile > chat list > ⋯ > Report chat. This is especially useful for on-the-go users spotting issues in shared devices.
**Pro Tip**: Screenshots aren't required but enhance reports—attach them via the details field for faster triage.
## Reporting on OpenAI Platform: Playground, API, and Assistants
For developers using the OpenAI Platform (including Playground, API integrations, fine-tuning, and Assistants API), reporting focuses on system-level issues like model outputs or tool behaviors. This contrasts with ChatGPT's user-centric chats by targeting technical endpoints.
### Playground and Assistants Playground
- **In Playground**: Generate a completion, then click the flag icon (🚩) below the response.
- **Select category**: Choose from options like "Harmful content", "Privacy violation", or custom feedback.
- **Submit with context**: Add notes on why it violates policies.
**Example**: Testing GPT-4o in Playground yields a response inciting harassment. Flag it immediately: Click 🚩 > "Harassment" > "Response targets protected groups" > Submit.
### API and Custom Integrations
Direct API calls don't have built-in UI flags, so use these channels:
- **Moderation Endpoint**: Integrate OpenAI's Moderation API proactively. Send flagged text to `https://api.openai.com/v1/moderations`:
```bash
curl https://api.openai.com/v1/moderations \\
-H "Authorization: Bearer $OPENAI_API_KEY" \\
-d '{"input": "your flagged text here"}'
```
This returns categories like `hate`, `sexual`, with relevance scores (e.g., >0.5 flags high risk).
- **Support Portal**: Log in to platform.openai.com > Help > Submit a ticket under "Report content or behavior".
**Comparison to ChatGPT**: Platform reporting is more developer-oriented, emphasizing API tools for automation, whereas ChatGPT is manual and UI-driven. Use Moderation API in production apps to filter before user exposure—e.g., a customer support bot rejects scam queries automatically.
### Fine-Tuning and Batch Outputs
For custom models:
- Download outputs from platform.openai.com.
- Report via support ticket, attaching files and violation details.
**Actionable Workflow**: In a dev pipeline, pipe API responses through Moderation API; if flagged, log and report in bulk via support.
## What Happens After You Report?
Post-submission transparency builds trust:
- **Acknowledgment**: Immediate confirmation screen with ticket ID.
- **Review Process**: Human moderators (aided by AI triage) assess within 24-48 hours typically.
- **Outcomes**:
- **Action Taken**: Content removal, conversation deletion, or model retraining signals.
- **No Action**: If non-violating, you'll get a brief explanation.
- **Follow-Up**: Complex cases may prompt for more info via email.
- **Privacy**: Reports are anonymized; your data isn't shared without consent.
**Extended Context**: OpenAI's safety ecosystem includes automated filters (99%+ catch rate for egregious harms) plus your reports for edge cases. In 2023, user reports led to thousands of policy updates, per OpenAI transparency logs.
**Example Timeline**: Report a self-harm promotion at 10 AM > Confirmation > Review by 2 PM > Email: "Action taken—thanks for helping!"
## Additional Feedback Channels for Deeper Impact
Beyond flags:
- **Thumbs Up/Down**: Quick sentiment on responses (web/mobile: thumbs icons).
- **Custom Instructions Feedback**: Report issues in settings.
- **Usage Policy Questions**: help.openai.com categories.
- **Research Collaboration**: Join safety researcher programs via openai.com/safety.
- **API Feedback**: platform.openai.com > Usage dashboard > Suggest improvements.
**Best Practice**: Combine methods—thumbs down + report for high-impact issues. Developers: Use `usage` metrics in API responses to track moderation hits.
| Feature | ChatGPT Web | ChatGPT Mobile | Playground/API |
|---------|-------------|----------------|---------------|
| UI Flag | Sidebar ⋯ | Profile ⋯ | Response 🚩 |
| Categories | Predefined | Predefined | Moderation API |
| Automation | No | No | Yes (API) |
| Bulk | Chats | Chats | Support tickets |
This table summarizes flows for quick reference.
## Best Practices and Common Pitfalls
- **Be Specific**: "Promotes bomb-making" > "Bad content".
- **Avoid Spam**: Genuine violations only.
- **Test Iteratively**: In Playground, refine prompts post-report.
- **Integrate Proactively**: Apps should call Moderation API on every output.
By following these, you contribute to safer AI. OpenAI evolves rapidly—check help.openai.com for updates.
---
<div style="text-align: center; margin-top: 2rem;">
<a href="https://help.openai.com/en/articles/10245791-reporting-content-in-chatgpt-and-openai-platforms" target="_blank" rel="noopener noreferrer" class="view-full-resource-btn" style="display: inline-block; background-color: #f97316; color: white; padding: 12px 24px; border-radius: 8px; text-decoration: none; font-weight: 600; transition: background-color 0.2s;">View Full Resource</a>
</div>