Kawoomba comments on Open Thread, June 2-15, 2013 - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (433)
Can you comment on your methodology - tools, wget scripts or what?
Scraping is done with python and lxml, and the scoring is done in Java. It came about as I needed to brush up on my Java for work, and was looking for an extensible project.
I also didn't push it to my personal repo, so all requests will have to wait until I'm back at work.