Skip to content

Strange character converted as space #1329

Discussion options

You must be logged in to vote

This is a so-called ligature. These are frequent character combinations which are encoded as one single glyph in fonts which support that.
As usual: not all font support them or all of them.
I know that MuPDF supports 6 ligatures: fi, fl, ffi, ffl, ff, st. I am afraid there is no support for this ligature "ti" in MuPDF.
But you can try to let MuPDF decompose ligatures: this is one of the option bits in the flags integer in text extractions: switch off "TEXT_PRESERVE_LIGATURES" in those flags, e.g. set flags=0 and see what happens.

Replies: 4 comments 4 replies

Comment options

You must be logged in to vote
1 reply
@Yichen-fqyd
Comment options

Answer selected by Yichen-fqyd
Comment options

You must be logged in to vote
1 reply
@Yichen-fqyd
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@Yichen-fqyd
Comment options

@JorjMcKie
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants