MC947833 - Microsoft Purview | Communication Compliance: Detect potentially risky generative AI interactions

Service

Microsoft Purview

Published

Dec 3, 2024

Tag

New feature
User impact
Admin impact

Platforms

Web

Summary

Microsoft Purview's Communication Compliance now includes detection for risky generative AI interactions, with new classifiers for prompt injection attacks and protected material. Public Preview began mid-November 2024, with General Availability details to follow. No admin action is required before the rollout.

More information

Introducing in Communication Compliance the ability to detect potentially risky generative AI interactions using Microsoft Azure AI Content Safety's Prompt shields and Protected materials classifiers. The Prompt shield classifier can detect risk of prompt injection attacks (jailbreak) by malicious users and the Protected material classifier can identify when generative AI responses contain branded or copyrighted material so organizations can maintain content originality and protect their reputations.

This message is associated with Microsoft 365 Roadmap ID 422334.

When this will happen:

Public Preview: We began rolling out mid-November 2024 and expect to complete by late December 2024.

General Availability (Worldwide): We will communicate the plan for General Availability in a separate post.

How this will affect your organization:

Communication Compliance admins can expect two new classifiers in the trainable classifier list: Prompt shield and Protected material classifier. These classifiers are configured by default in the Detect Microsoft Copilot interactions template policy. When a policy flags a potentially risky Generative AI interaction, you can see the new classifier names listed in the Conditions detected banner:

admin controls

This change is available by default for admins to configure.

What you need to do to prepare:

This rollout will happen automatically by the specified date with no admin action required before the rollout. You may want to notify your admins about this feature availability and update any policies that may benefit from the new classifiers.

No action is required for the new classifiers to be enabled in your tenant. When classifiers are visible in your tenant, you can configure them in any Communication Compliance policy looking at a generative AI workload.

Learn more: The Trainable classifiers section of Create and manage communication compliance policies | Microsoft Learn (will be updated before rollout)