RM381750 - Microsoft Purview compliance portal: Data Loss Prevention for endpoints - Optical character recognition (OCR) support for embedded images in endpoint

Microsoft 365 Roadmap

Summary

This release will extend OCR support from standalone images (JPEG, JPG, PNG, BMP, TIFF, and PDF) to images embedded inside the following files and file types: Office files (XLSX, DOCX, PPTX), container files (zip, rar, 7z, and more), and PDF files. Image-only PDF files are already supported, and this this release will support hybrid PDF files containing images and searchable text. Updated May 20, 2026: We have paused rollout and will resume soon. Thank you for your patience.

Last Updated

May 20, 2026

Published Feb 28, 2024

View version history

Status

In development

Release

General Availability
Preview

Platforms

Web

Service

Microsoft Purview

Tag

In development
General Availability
Preview
Worldwide (Standard Multi-Tenant)

Cloud

Worldwide (Standard Multi-Tenant)

Description

This release will extend OCR support from standalone images (JPEG, JPG, PNG, BMP, TIFF, and PDF) to images embedded inside the following files and file types: Office files (XLSX, DOCX, PPTX), container files (zip, rar, 7z, and more), and PDF files. Image-only PDF files are already supported, and this this release will support hybrid PDF files containing images and searchable text. Updated May 20, 2026: We have paused rollout and will resume soon. Thank you for your patience.

GA date: May CY2026

Preview date: April CY2026

Version history

3 versions tracked

Updated 2 times since Apr 2, 2026. Microsoft Message Center only ever shows the current version; this archive preserves the history.

Compare any two versions

From
To
  1. May 20, 2026 · 11:15 PMLatest · v3

    Changed: Body, Tags, Status

  2. May 19, 2026 · 10:45 PMv2

    Changed: Tags, Status

  3. Apr 2, 2026 · 11:15 PMOriginal · v1

    Changed: Initial version