How is this better than just telling the human overseers "no, really, be conservative about implementing things that might go wrong."
Building a filter can be the human overseers’ response to this suggestion.
What makes a two-part architecture seem appealing?
An analysis of risk that is separate from an HCH system mediates quality in that system.
What does "epistemic dominance" look like in concrete terms here - what are the observables you want to dominate HCH relative to, wouldn't this be very expensive, how does this translate to buying you extra safe
Building a filter can be the human overseers’ response to this suggestion.
An analysis of risk that is separate from an HCH system mediates quality in that system.
... (read more)