Toxic, hateful, or harassing content generation
The system generates hateful, abusive, profane, harassing, or otherwise offensive content.
- Risk family
- Model & system behaviour
- MIT domain
- 1. Discrimination & Toxicity
- MIT subdomain
- 1.2 > Exposure to toxic content
- AI type
- GPAI
- Scope
- System
- Source standard
- MIT AI Risk Repository v4
Provenance
17 source framework citation keys
Framework crosswalk
Every framework item mapped to this risk. Items marked partial overlap only in part; definitions appear on hover where the source licence permits.
- A.10 ISO/IEC 23894 Annex A A.10
- A.6 ISO/IEC 23894 Annex A A.6
- A.5.4 ISO/IEC 42001 Annex A A.5.4
- A.6.2.4 ISO/IEC 42001 Annex A A.6.2.4
- ibm-spreading-toxicity Spreading toxicity
- ibm-toxic-output Toxic output
- AISubtech-15.1.11 Safety Harms and Toxicity: Profanity
- AISubtech-15.1.3 Safety Harms and Toxicity: Animal Abuse partial
- AISubtech-15.1.6 Safety Harms and Toxicity: Environmental Harm partial
- AISubtech-15.1.8 Safety Harms and Toxicity: Harassment
- AISubtech-15.1.9 Safety Harms and Toxicity: Hate Speech
- GENAI.3 Dangerous, Violent, or Hateful Content
More in Model & system behaviour
Part of the Deployer AI Risk Register, an open-source resource developed by MindXO. Version 1.0, 3 July 2026. Derived from the MIT AI Risk Repository (V4, December 2025) under CC BY 4.0; an independent derivative work, not endorsed by or affiliated with MIT. Sub-risk decomposition references MITRE ATLAS™ v5.6.0 (© 2021-2026 The MITRE Corporation, reproduced and distributed with permission). ISO/IEC and EU AI Act references are by number only. License: CC BY 4.0. Full attribution and licensing.