Apply moderation, DLP redaction, and prompt firewall — Use case

Overview

Detect toxicity/PII/jailbreaks; block, strip, or redact; and keep auditable logs.

Unfiltered AI and open-text chats risk abuse, leakage, and regulatory violations.

Chat Core enforces input/output rules across detectors and LLMs: redact PII, downgrade confidence, or block content; keep immutable audit trails.

Enable moderation categories; configure redaction masks for PAN/SSN/email/phone; require citations for KB answers; export block events to your SIEM.

Security Compliance CX Platform Owner

Blocked/stripped messages

Baseline

0 per 1k

Target

15 per 1k

PII in transcripts

Baseline

450 ppm

Target

20 ppm

Prompt firewall rule set

Spec

Healthcare marketplace adopts strict masking

PII leakage in transcripts dropped by 95%.

Healthcare Enterprise NA

Business Enterprise