Message Center
Microsoft Purview Data Loss Prevention on Windows endpoints will support OCR scanning of images embedded in Office documents and PDFs starting mid-May 2026. This closes detection gaps by identifying sensitive data in images, requires enabling OCR with associated Azure AI costs, and involves updating DLP policies and prerequisites.
Introduction
We are introducing optical character recognition (OCR) scanning support for Microsoft Purview Data Loss Prevention (DLP) on Windows endpoint devices. This enhancement enables DLP policies to detect sensitive information within images embedded inside Office documents and PDF files.
Previously, embedded images were skipped during endpoint DLP scanning, creating a detection blind spot. With this update, embedded images are OCR-processed, helping improve data protection coverage and reduce risk of accidental data exposure.
This message is associated with Microsoft 365 Roadmap ID 381750.
When this will happen:
How this affects your organization:
Who is affected:
What will happen:
Once OCR is enabled for your organization, DLP policies applied to endpoint devices will be able to scan images embedded inside Office documents (Word, PowerPoint, Excel) and pdf files on Windows devices. This means:
What you can do to prepare:
To take advantage of this feature, complete the following steps:
MsSense.exe --version from C:\Program Files\Windows Defender Advanced Threat Protection.Learn more:
Compliance considerations:
| Area | Explanation |
|---|---|
| Data processing changes | Embedded images within Office documents and PDFs are now processed using OCR, expanding how DLP evaluates file content. |
| AI/ML capabilities | OCR uses Azure AI Services to analyze images for sensitive information detection. |
| DLP policy enforcement | DLP policies are enhanced to detect sensitive information contained within embedded images. |
| Admin monitoring and reporting | Admins can monitor OCR usage and associated costs through the Purview dashboard and Azure Cost Management. |
| Admin control | The feature is disabled by default and must be explicitly enabled by an administrator. |