'Extract vectorized data from a pdf with non-embedded fonts
My question is about how PDF viewers handle fonts used in a PDF that are not embedded.
I'm using software (pdfjs-dist) to generate thumbnails from PDF's and noticed that text is often missing in the resulting image. Upon further inspection it turns out that fonts are missing on the OS (which is a node docker container (linux)).
Now, I have heard that it's possible to always generate a render from a PDF, using some sort of layer in the PDF that contains vectorized data which you can view, even when the fonts are not embedded in the PDF nor available in the operating system.
Has anyone heard of this mechanism that can point me to its technical name?
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|
