Duplicated fonts, actually, is what a PDF generator should concern, not this converter.
But I will try to make it smarter if a local font is matched for multiple external fonts in PDF.
Currently I have to store separated, reencoded fonts in HTML.
Having a font name, available at least as a clue would be valuable for post processing.
Imagine a JPG- My thought about this issue is that the fonts are not introduced by pdf2html EX, they have been there in the PDF file. I check it out in Acrobat or Adobe reader and the first one have 132 (and you can see how repeat the name of font) but the optimization one only around 40 fonts.
So probably you are looking for a PDF optimization tool. At first I thank that you are right and saw the pdf properties and is true that it had a duplicated fonts.. Tell me when can I delete the link...(The first without optimization) https:// @Toneti777 That's actually what I mean, if you can do that with a PDF optimizer, you should do it. You can always optimize the PDF file before feed it into pdf2html EX. I check it out in Acrobat or Adobe reader and the first one have 132 (and you can see how repeat the name of font) but the optimization one only around 40 fonts. @Toneti777 Seems that there is only one font “stored", other fonts refer to the same binary with different encodings.
Encoding, if you expect a good font at the client side, the text must be encoded in Unicode correctly, which is not reliable either -- there are too many PDF files whose text cannot be copied out correctly. But later I may export the embedded names in CSS, such that you may do post-procssing if you wan.
Since PDF is not designed for Web, actually I don't see these fonts often used in PDF (except for Times), at least in my test cases. And this should be done carefully since malicious code could be injected in this way.