-
Describe the bug (mandatory)I am not able to get a text from this specific pdf. Pdf is correctly displayed by Adobe Reader (also in Chrome, Firefox) To Reproduce (mandatory)
Expected behavior (optional)Get text out (as with other pdf files) Screenshots (optional)Your configuration (mandatory)
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
This is no bug, but a missing feature of the font - intended or not. |
Beta Was this translation helpful? Give feedback.
-
The PDF uses non-standard encoding which makes it impossible to extract text - not only for (Py-) MuPDF, but also for Adobe Acrobat, Nitro 5, and other PDF viewers. Please be aware that showing text and extracting it are feature that do not necessarily be connected - as is the case here. |
Beta Was this translation helpful? Give feedback.
The PDF uses non-standard encoding which makes it impossible to extract text - not only for (Py-) MuPDF, but also for Adobe Acrobat, Nitro 5, and other PDF viewers.
Confirm this by selecting some text and paste it in some word processor document.
Please be aware that showing text and extracting it are feature that do not necessarily be connected - as is the case here.
So all you can do is OCR-ing.