I have noticed a significant increase in the size of the HTML files produced by the tool compared to previous versions. Specifically, the output files are now more than 10 times larger than before. For example, a PDF file that originally converted to an HTML file of approximately 1 MB now results in an output of over 10 MB.This increase in file size poses challenges in terms of storage, sharing, and loading times, especially for web-based applications where efficiency is critical. I wanted to bring this to your attention to understand if this is an expected change or if there might be an issue with the current configuration or version of the tool.If there are any optimizations or settings I can adjust to reduce the file size, I would greatly appreciate your guidance. Alternatively, if this is a known issue being addressed, please let me know if there is an estimated timeline for a fix.Thank you for your time and support. I look forward to your response.Best regards
The example file has been sent, please check it carefully! I am a developer, and after a brief investigation, I found that the converted file does not have any text tags.
@ Stefan Ziegler
We have added new new mode to the PDF to HTML converter. Previously there were just the "text only" and "embedded SVG" modes available. Yesterday we have added the "complete" mode which converts the whole PDF and outputs plain HTML combines with some CSS. The output size of this new mode is far smaller compared to the "embedded SVG" mode. We have noticed, that the "embedded SVG" currently produces relatively large outputs, but this is not that easy to change right now. It's on our backlog and we will try to optimize this. But for now, the new "complete" mode is the best option right now.
First of all, thank you for your reply and effort, but I tried the "complete" mode and the result was not satisfactory. Did you not consider the issue of language? After I completed the conversion, there were only letters and numbers, not Chinese. You can verify it with the example file I sent you last time. Thanks again!
test comment