Skip to content

Extracting Chinese some texts are not right. #2367

Answered by JorjMcKie
buptyyf asked this question in Q&A
Discussion options

You must be logged in to vote

I have another question. Why pdf reader in mac or chrome can read this pdf correctly?

Showing this PDF is not the problem. You talked about text extraction. If you create a page pixmap with PyMuPDF, you will get the right picture.
But if using e.g. Adobe Acrobat or any other PDF viewer and then selecting the text with the cursor you will get the same wrong result.

Replies: 2 comments 3 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
3 replies
@buptyyf
Comment options

@buptyyf
Comment options

@JorjMcKie
Comment options

Answer selected by JorjMcKie
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
not a bug not a bug / user error / unable to reproduce
2 participants
Converted from issue

This discussion was converted from issue #2366 on April 23, 2023 06:39.