Skip to content

Text extraction incomplete on Linux (resolution: missing system font) #288

Closed Answered by mara004
maivan-hoa asked this question in Q&A
Discussion options

You must be logged in to vote

I believe the culprit might be proprietary Windows fonts such as Arial and TimesNewRoman which distros can't ship for licensing reasons - so the PDF viewer substitutes with some other available font, which might not have the special chars.
Presumably it would work if you copy over the Windows fonts to Ubuntu.

See the attached inspection screenshot from Okular

Replies: 1 comment 9 replies

Comment options

You must be logged in to vote
9 replies
@mara004
Comment options

@maivan-hoa
Comment options

@mara004
Comment options

@mara004
Comment options

Answer selected by maivan-hoa
@maivan-hoa
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants