You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

nino comments on The Library of Scott Alexandria - Less Wrong Discussion

45 Post author: RobbBB 14 September 2015 01:38AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (26)

You are viewing a single comment's thread. Show more comments above.

Comment author: nino 16 November 2015 05:58:01PM 3 points [-]

I actually went through every post and manually copied out the relevant part of the html code. Then I pasted everything into my text editor (fun fact: vim got quite slow handling the >3mb html file, but emacs handled the task really well) and cleaned it up, replacing all <br><br>'s with </p><p> and such. Then I put all the pictures into a folder and changed the references to point to my local files. Then I put it into calibre to create the epub and mobi versions.

In retrospect, I should have just written a script to do all that because it took way too long. The script would have had to handle the different sites differently (especially the livejournal stuff is pretty messy), but it would have been so much faster. Like seriously.