Filter Tagged PDF Content

A tagged PDF contains an additional layer of text for visually impaired readers. This text is used in text-to-speech features in various PDF viewing programs. You can enable filtering of tagged PDF text in the API.

Filtering the extra layer of tagged content might result in duplicate text in the output. This is the expected behavior.

To filter tagged PDF content

  1. Create a Session object.
  2. To filter tagged PDF Content using the C++ API, invoke tagged_pdf_content(true) on a Configuration object, and call filter() on a Session object.