VincentYu comments on LessWrong Help Desk - free paper downloads and more (2014) - Less Wrong

30 Post author: jsalvatier 16 January 2014 05:51AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (279)

You are viewing a single comment's thread. Show more comments above.

Comment author: VincentYu 05 October 2015 04:04:30AM 3 points [-]

Got the whole PDF from HathiTrust. I think Chart I is missing from the scan.

Comment author: gwern 05 October 2015 04:39:38PM *  0 points [-]

Thanks. I added some metadata and it blew up to 14M, which is unfortunate. Chart I does seem to be missing in both the PDF and the online version; I suspect that it's missing from the physical copy at UMich ('pocket' sounds like something that might go missing).

Comment author: VincentYu 06 October 2015 08:20:55AM 2 points [-]

it blew up to 14M

The object streams for indirect objects have been unpacked and stripped away, leaving their contents uncompressed. Use qpdf to regenerate compressed object streams:

qpdf --object-streams=generate in.pdf out.pdf

(The --stream-data=compress option is already set by default.)

While you are at it, might as well re-linearize the PDF for online readers with low bandwidth:

qpdf --object-streams=generate --linearize in.pdf out.pdf
Comment author: gwern 06 October 2015 04:01:49PM 0 points [-]

That seems to work. I tried gs, Gscan2pdf, and pdf2djvu but they all either didn't reduce size or segfaulted.