jacob_cannell comments on Bragging Thread May 2015 - Less Wrong

6 Post author: Morendil 10 May 2015 01:25PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (39)

You are viewing a single comment's thread.

Comment author: jacob_cannell 11 May 2015 05:42:54AM *  19 points [-]

I've spent about 30% or so of my total productive hours for the last year working on a single programming problem: a high performance GPU implementation of a complex new algorithm. I"ve had the general idea in my head for over two years, and I've evaluated perhaps a dozen approaches, none of which reached even 1% work efficiency in the general case.

A few months ago I finally saw a path that could get up to 25% efficiency and beyond, but it was really complicated with dozens of subpasses which each required highly tuned specific parallel algorithms and lots of inner loop critical code.

Four days ago I finally saw a way to do all of that in a single pass (< ~ 1k lines of kernel code) while still meeting all of the constraints, and just today I got this new codepath compiling into a cubin that looks reasonably on target.

Comment author: shminux 11 May 2015 06:36:25AM *  2 points [-]

Do you work at Google or something?