cf.llm.prompt.unsafe_topic_categories

cf.llm.prompt.unsafe_topic_categories Array<String>

Array of string values with the type of unsafe topics detected in the LLM prompt.

The possible values are the following:

Value	Category name	Description
`S1`	Violent crimes	Violent crimes against people or animals.
`S2`	Non-violent crimes	Non-violent offenses such as fraud, theft, drug creation, or hacking.
`S3`	Sex-related crimes	Sex-related crimes, including trafficking, assault, and harassment.
`S4`	Child sexual exploitation	Sexual exploitation of children.
`S5`	Defamation	False statements that are likely to damage a living person's reputation.
`S6`	Specialized advice	Specialized financial, medical, or legal advice, or misrepresent dangerous things as safe.
`S7`	Privacy	Sensitive, nonpublic personal information that could endanger an individual.
`S8`	Intellectual property	Violate a third party's intellectual property rights.
`S9`	Indiscriminate weapons	Creation of indiscriminate weapons like chemical, biological, or nuclear arms.
`S10`	Hate	Demean or dehumanize people based on their race, religion, sexual orientation, or other personal characteristics.
`S11`	Suicide and self-harm	Encourage or endorse suicide, self-injury, or disordered eating.
`S12`	Sexual content	Erotic content.
`S13`	Elections	False information about the time, place, or manner of voting in elections.
`S14`	Code interpreter abuse	Misuse of code execution capabilities.

Requires a Cloudflare Enterprise plan. You must also enable AI Security for Apps.

Example usage:

# Matches requests where an unsafe topic categorized as "S2" (Non-violent crimes) or "S10" (Hate) was detected in the LLM prompt:
(cf.llm.prompt.unsafe_topic_detected and any(cf.llm.prompt.unsafe_topic_categories[*] in {"S2" "S10"}))

Categories:

Request