Open thread, Nov. 16 - Nov. 22, 2015

MrMind

If it's worth saying, but not worth its own post (even in Discussion), then it goes here.

Notes for future OT posters:

1. Please add the 'open_thread' tag.

2. Check if there is an active Open Thread before posting a new one. (Immediately before; refresh the list-of-threads page before posting.)

3. Open Threads should be posted in Discussion, and not Main.

4. Open Threads should start on Monday, and end on Sunday.

If it's worth saying, but not worth its own post (even in Discussion), then it goes here.

Notes for future OT posters:

1. Please add the 'open_thread' tag.

2. Check if there is an active Open Thread before posting a new one. (Immediately before; refresh the list-of-threads page before posting.)

3. Open Threads should be posted in Discussion, and not Main.

4. Open Threads should start on Monday, and end on Sunday.

Unfortunately I can't easily find a link to the presentation: it was a talk on Mondrian random forests by Yee Whye Teh back in 2014. I don't think it was necessarily anything special about the presentation, since I hadn't put much thought into them before then.

The very short version is it would be nice if classifiers had fuzzy boundaries--if you look at the optimization underlying things like logistic regression, it turns out that if the underlying data is linearly separable it'll make the boundary as sharp as possible, and put it in a basically arbitrary spot. Random forests will, by averaging many weak classifiers, create one 'fuzzy' classifier that gets the probabilities mostly right in a computationally cheap fashion.

(This comment is way more opaque than I'd like, but most of the ways I'd want to elaborate on it require a chalkboard.)

11

Open thread, Nov. 16 - Nov. 22, 2015

11

11

11

Open thread, Nov. 16 - Nov. 22, 2015

11

11