AI Safety is a group project for us all. We need everyone to participate - the ESFPs to the INTJs!
Capturing the essence and subtleties of core values needs input across a broad span of humanity.
Assumption 1 - large language models will be the basis of AGI.
Assumption 2 - One way to add the abstraction of a value like "kindness is good" into the model is to add a large corpus of written material on Kindness during training (or retraining).
The Kindness Project is a website with a prompt, like a college essay. Users add their stories to the open collection based on the prompt: "Tell a story about how you impacted or were... (read more)
There is a problem in that any group that is generating alpha would likely lose alpha/person if they allow random additional people into their group.
Think Renaissance Medallion fund. It's been closed to outside investment since near its inception 30 years ago. Prerequisites for the average person joining would be something like true-genius level Phd in a related STEM field.
An analogue which is closely related is poker players who use solvers to improve their game. The starting stakes are a bit lower. The solvers are like a few thousand dollars + equipment to run them, a class on how to use them runs a similar couple thousand bucks, and then there is the... (read more)