All discussion post titles, points, and dates as an excel sheet

15 Post author: Alexandros 03 June 2014 02:38PM

You can find it here.

Earlier today I wanted to quantify whether lesswrong has stopped being a well kept garden. So I wrote a scraper to produce the above dataset, so that anyone that wants to do the analysis, can.

All data is as of a few minutes ago.

For programmers: You can see the source here, it's made to run on scraperwiki, but it will time out after about 3000 articles. At that point you need to adjust the initial value of the uri variable to be the last uri printed. Repeating this process once more will allow you to reach the end. Have fun.

 

Comments (10)

Comment author: Lumifer 03 June 2014 05:24:11PM 3 points [-]

Earlier today I wanted to quantify whether lesswrong has stopped being a well kept garden.

Then you probably should start by quantifying what does "being a well kept garden" mean.

Comment author: Alexandros 03 June 2014 06:35:07PM 6 points [-]

True. I guess I was being a bit cheeky. LW is no longer being kept at all AFAICT (or just on maintenance), just wanted to see if it's on an upward or downward trajectory. I obviously think there is a problem, and I have a solution to suggest, but I wanted to double check my intuition with the numbers.

Comment author: Error 03 June 2014 05:07:55PM 3 points [-]

Authors might be an interesting field to add; one of the more plausible measures mentioned in the other thread was a drop in posts from specific prolific authors.

Comment author: Alexandros 03 June 2014 06:33:32PM 2 points [-]

post updated with code, go crazy! number of comments is another one I'd add if I ran it again.

Comment author: RichardKennaway 03 June 2014 07:56:50PM 2 points [-]

Earlier today I wanted to quantify whether lesswrong has stopped being a well kept garden.

Before you look at the numbers, what metrics are you going to use to quantify this?

Comment author: Alexandros 03 June 2014 08:41:34PM 2 points [-]

posts per month, upvotes per month. (i understand score is positive minus negative, but it cancels out). potentially comments per month too, but I didn't fetch that data. substitute month for your preferred granularity of course.

Comment author: Dr_Manhattan 03 June 2014 04:22:01PM 2 points [-]

for +10 points, post the scraper. (but put a throttle in by default)

Comment author: Alexandros 03 June 2014 06:32:59PM 3 points [-]

done

Comment author: Gunnar_Zarncke 03 June 2014 04:11:55PM 0 points [-]

I wanted to quantify whether lesswrong has stopped being a well kept garden.

I'm very curious about you results.

Comment author: Alexandros 03 June 2014 06:24:54PM 0 points [-]

Well, it's not being 'kept' anymore for one, but I didn't need analysis for that. I guess the question is if it is flourishing or dying out.