0

I've got some large document scans with embedded OCR text on Internet Archive I'd like to read. Unfortunately the PDF pages render very slowly on my document readers (Okular, Evince, Zathura). I previously used the DJVU files for this reason, but since they stopped creating them I am out of options. I have tried to convert to DJVU myself with pdf2djvu, djvudigital, some online tools and even first going to JPEG and each time gotten very large files, as the programs seem to have trouble separating the foreground and background. So several questions:

  1. How did the Internet Archive team previously produce their DJVUs? Can their process be replicated or approximated?
  2. The second link suggests slow PDF rendering has been an issue for a while (at least over Linux). Are there any workarounds, like faster backends? I tried linearizing the files but that didn't improve things.

For testing the issue consider this volume of Poincaré's collected works

3
  • The PDFs open just fine in Acrobat Reader 11 running under Wine. Much faster than in native Linux PDF readers. Commented Nov 29, 2022 at 14:12
  • Thanks for confirming it is just a Linux issue. It would be very sad if my only option was to install Acrobat. Might this be a poppler issue? Commented Nov 29, 2022 at 14:33
  • 1
    I've no idea to be honest. Please give Firefox and Chrome a try - they have built-in PDF renderers. Commented Nov 29, 2022 at 14:37

0

You must log in to answer this question.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.