Stuart_Armstrong comments on The autopilot problem: driving without experience - Less Wrong

23 Post author: Stuart_Armstrong 13 May 2013 12:42PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (37)

You are viewing a single comment's thread. Show more comments above.

Comment author: Stuart_Armstrong 13 May 2013 04:57:42PM 2 points [-]

Right, I don't understand what you're saying there. Can you develop it?

Comment author: Luke_A_Somers 13 May 2013 05:41:10PM 9 points [-]

So you have a batch of things that need to pass muster. The failure mode presented above is that you'll get bored with just saying 'pass, pass, pass...'

The corrective proposed is to ask for the worst item, whether or not it passes, in addition to asking for rejects.

It would be something to think about while looking at a bunch of good ones, and would keep one in practice... if one tries. If you just fake it and no one can tell because they're all passes anyway, then it doesn't work.

Comment author: Pentashagon 14 May 2013 07:11:58PM 1 point [-]

It may also be useful to identify the best thing. The difference between the best and worst is probably a useful measure of quality control as well as ensuring the tests are general enough to detect good as well as bad.

Comment author: Luke_A_Somers 14 May 2013 08:07:20PM 4 points [-]

If your process is good enough that this is a problem, then 'so good you can't tell it's not perfect' could well be the most common case. In any case, it's most important to concentrate the expertise around the border of OK and not.

Comment author: Stuart_Armstrong 13 May 2013 09:41:40PM 0 points [-]

Interesting. May be applicable to some of the situations we're studying...

Comment author: Luke_A_Somers 13 May 2013 10:17:31PM 1 point [-]

Just look out that you don't end up picking out something that's not the worst, and think you're still doing a good job.

Comment author: Decius 14 May 2013 03:12:57AM -1 points [-]

The failure mode presented above is that you'll get bored with just saying 'pass, pass, pass...'

That looks like an ideal case for automation...

Comment author: Luke_A_Somers 14 May 2013 02:19:47PM 2 points [-]

And then you miss the one in ten thousand that was no good.

Comment author: Decius 15 May 2013 12:15:06AM 0 points [-]

If you are using humans to mass-test for a failure rate of 1/10,000 you are doing something wrong. Ship ten thousand units, let the end-user test them at the time of use/installation/storage, and ship replacement parts to the user who got a defective part. That way no one human gets bored with testing that part (though they might get bored with inspecting good parts in general)

Comment author: Luke_A_Somers 15 May 2013 12:39:40AM 1 point [-]

Sounds great if failure is acceptable. I don't want my parachute manufacturer taking on that method, though.

Comment author: Decius 15 May 2013 03:04:13AM 0 points [-]

Don't you demand that your parachute packer inspects it when he packs it? Especially given that more than zero parachutes will be damaged after manufacture but before first use.