You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

gjm comments on Bragging Thread May 2015 - Less Wrong Discussion

6 Post author: Morendil 10 May 2015 01:25PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (39)

You are viewing a single comment's thread.

Comment author: jacob_cannell 11 May 2015 05:42:54AM *  19 points [-]

I've spent about 30% or so of my total productive hours for the last year working on a single programming problem: a high performance GPU implementation of a complex new algorithm. I"ve had the general idea in my head for over two years, and I've evaluated perhaps a dozen approaches, none of which reached even 1% work efficiency in the general case.

A few months ago I finally saw a path that could get up to 25% efficiency and beyond, but it was really complicated with dozens of subpasses which each required highly tuned specific parallel algorithms and lots of inner loop critical code.

Four days ago I finally saw a way to do all of that in a single pass (< ~ 1k lines of kernel code) while still meeting all of the constraints, and just today I got this new codepath compiling into a cubin that looks reasonably on target.

Comment author: shminux 11 May 2015 06:36:25AM *  2 points [-]

Do you work at Google or something?