Adobe Export PDF supports optical character recognition, or OCR, when you convert a PDF file to Word (.doc and .docx), Excel (.xlsx), or RTF (rich text format). OCR is the conversion of images of text (scanned text) into editable characters, so that you can search, correct, and copy the text. 

When OCR is enabled, Adobe Export PDF performs OCR on PDF files that contain images, vector art, hidden text, or a combination of these elements. (For example, Adobe Export PDF performs OCR on PDF files created from scanned documents.) Adobe Export PDF also performs OCR on text that it can't interpret because the text was encoded incorrectly in the source application.

Supported languages

Adobe Export PDF supports the following languages for OCR:

ExportPDF supported languages

By default, OCR uses the language selected in the My Information dialog box. The OCR engine uses the selected language to interpret the scanned text. Selecting the correct language improves the accuracy of the conversion, as the OCR engine uses language-specific dictionaries for conversion. For non-Latin languages like Japanese, the OCR engine cannot interpret and convert the text unless you've selected the appropriate language.

Using Export PDF web interface

To perform OCR when you convert a PDF file in Adobe ExportPDF, do the following:

  1. Log in to Adobe Export PDF,  and click Select PDF files to Export.

  2. Click Choose Files from My Computer and browse to your PDF. Also, you can drag a file on the pane. For selecting a file from Document Cloud, click Document Cloud in the left pane and then select your file. 

    NOTE: You can select multiple files for export.

    Export PDF Web Interface
    Export PDF Web Interface

  3. In the Convert To drop-down list, select the format to which you want to export the PDF. 

  4. In the Document Language drop-down list, select an appropriate document language for text recognition.

  5. Click the Export to <format> button for exporting the content. 

  6. Once the file is converted, you see a confirmation prompt with the link to open or download the converted file.


Using Acrobat Reader DC

  1. Open the PDF you want to convert, and then click Adobe Export PDF in the right pane.

  2. Check the document language recognized just above the Convert button - Document Language:  <language selected>. If the language selected is correct, go to the next step.

    Otherwise to change the language selected:

      i.) Click Change.

    Export PDF from Acrobat Reader DC

      ii.) In the OCR Settings dialog box, choose an appropriate language in the Recognize Text In list, and then click OK.

  3. Click Convert to begin the conversion process.

  4. Once the file is converted, you see a confirmation prompt with the link to open or save as the converted file.



This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported License  Twitter™ and Facebook posts are not covered under the terms of Creative Commons.

Legal Notices   |   Online Privacy Policy