kenneth myers
Towards A Unified Theory Of Alignment
Below is a first draft and I think solid, novel way of thinking about the alignment problem. There are many technical issues touched on that as yet have no solutions, however the hope is to unify the field into a consistent and robust paradigm. I'm looking for feedback. Thank you!...
The Ideal Speech Situation as a Tool for AI Ethical Reflection: A Framework for Alignment
Please be merciful, this is my first post. This is a potentially idealistic, but theoretically intriguing conceptual avenue for thinking about aligning AI systems. This is all extremely high level and doesn't propose any kind of concrete solutions. Rather, this is an attempt to reframe the alignment problem in what...
This of this as an alternative to CEV