Worldwork for Ethics
Abstract: An alternative to the now-predominating models of alignment, corrigibility and "CEV", following a critique of these. The critique to show, in substance: CEV and corrigibility have the exact same problems - in effect, they're isomorphs of one another, and each equally unobtainable. This briefly shown, and then, in flat contradiction to point 22 of “AGI Ruin: A List of Lethalities”, there is a quite different way to characterize, so achieve, alignment, via a refutation of Kant’s supposedly irrefutable categorical imperative which refutation also is included; from this, an ethic designed to be intrinsically applicable for any volitional, so by assumption algorithmic, behavior altogether. Suggestions for implementation of such also included. Epistemic status: If this argument did not seem more true than anything else, this author would not now be alive to write it. It is intuitively true, and, reasoned such that no refutation is obvious. Posting it here, and again, is in hopes of a critique, even a refutation that it has not yet been given, perhaps because it’s So Bad It’s Not Even Wrong; if so on your examination, then please write to say so. That done, next steps could go through very quickly. For, whereas it has always and still seems true, and important – it is no longer so important that one can base a life upon it, if it cannot be lived-for. Anthropic-affecting alignment strategies We begin by considering the cause of Yudkowsky's despair, in failing to make usable CEV or corrigibility; thus because they're functionally the same, or at least, they lead to the same problem. The method which follows, then, is not "door number three" relative to what the "List of Lethalities" calls the "only options" for alignment; following the critique of present approaches (and this is only one such, informal, refutation of CEV and corrigibility’s efficacy), is a second way. CEV is designed to result in an at-once manifested fulfillment of human wants – and that in
Wanted to be loved. Loved, and to live a life not only avoiding fear. Epiphany (4/22/2024): am a fuckup. Have always been a fuckup. Could never have made anyone happy or been happy, and a hypothetical world never being born would have been a better world. Deserved downvotes, it has to be all bullshit, but LessWrong was supposed to make people less wrong, and should’ve given a comment to show why bullshit, but you didn’t, so LessWrong is a failure, too. So sterile, here, no connection with the world – how were we ever supposed to change anything? Stupid especially to’ve thought anyone would ever care. All fucked-up.
Life was more enjoyable when... (read 2092 more words →)