Consciousness can be understood as an interpersonally-oriented perception of situations, where the mind of a social speciman instinctively focuses on agents or personas within any given context. Even inanimate or non-conscious aspects of reality are often personified – perceived as adversaries, allies, or caring lovers, dialing our sense of threat,...
Raymond is tired. He exhails exhaustly: >>I don't think we even know what alighnment is, like we are not able to define it.<< I hop up on my chair in the meditarian restaurant: >I disagree, if you give me 3 seconds, I can define it.< >>---<< >Can we narrow it...
1. The Spark of G.D. @David Duvenaud's recent Guardian op‑ed on gradual disempowerment [1] sketches an original foresight: it frames the AI‑alignment problem not as a single catastrophic explosion but as a slow erosion of human leverage. If intelligence is the ability to steer the future, then the central question...
My initial idea was Let's see where the small, interpretable, model makes the same inference as the huge, ¯dangerous, model and focus on those cases in the small model to help explain the bigger one. Quite likely I am wrong, but with a tiny chance for good impact, I have...