You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Traceback (most recent call last):
File "pdf2txt.py", line 115, in <module>
if __name__ == '__main__': sys.exit(main(sys.argv))
^^^^^^^^^^^^^^
File "pdf2txt.py", line 110, in main
interpreter.process_page(page)
File "/lib/python3.12/site-packages/pdfminer/pdfinterp.py", line 841, in process_page
self.render_contents(page.resources, page.contents, ctm=ctm)
File "/lib/python3.12/site-packages/pdfminer/pdfinterp.py", line 854, in render_contents
self.execute(list_value(streams))
File "/lib/python3.12/site-packages/pdfminer/pdfinterp.py", line 869, in execute
name = keyword_name(obj).decode('ascii')
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
UnicodeDecodeError: 'ascii' codec can't decode byte 0x85 in position 0: ordinal not in range(128)
The text was updated successfully, but these errors were encountered:
Description
Crash on non-ASCII input:
UnicodeDecodeError: 'ascii' codec can't decode byte 0x85 in position 0: ordinal not in range(128)
Steps to reproduce the bug
To make it easier, this will download mc3362.pdf.
wget https://github.com/user-attachments/files/16489263/mc3362.pdf && pdf2txt.py mc3362.pdf
Error produced
The text was updated successfully, but these errors were encountered: