Zvi's latest newsletter has a section on this topic! https://thezvi.substack.com/i/151331494/good-advice
+1 to you continuing with this series.
Couple of thoughts:
1. I recently found out about this new-ish social media platform. https://www.heymaven.com/. Good chance they are researching alternative recommendation algorithms.
2. What particular actions do you think rationality/ea community could do that other big efforts have not done enough, e.g. projects by Tristan Harris or Jaron Lanier.
Thanks for the feedback! Have editted the post to include your remarks.
The 'evolutionary pressures' being discussed by CGP Grey is not the direct gradient descent used to train an individual model. Instead, he is referring to the whole set of incentives we as a society put on AI models. Similar to memes - there is no gradient descent on memes.
(Apologies if you already understood this, but it seems your post and Steven Byrne's post focus on training of individual models)
What is the status of this project? Are there any estimates of timelines?
Totally agree! This is my big weakness right now - hopefully as I read more papers I'll start developing a taste and ability to critique.
Huge thanks for writing this! Particularly liked the SVD intuition and how it can be used to understand properties of . One small correction I think. You wrote:
is simply the projection along the vector
I think is projection along the vector , so is projection on hyperplane perpendicular to
The physical object Eiffel Tower is correlated with itself.
It is highly predictive of the ability of the LLM to book flights to Paris, when I create an LLM-agent out of it and ask it to book a trip to see the Eiffel Tower.
I dont think we disagree here. To clarify, my belief is there are threat models / solutions that are not affected by whether the AI has 'real' beliefs, and there are other threats/solutions where it does matter.
I actually do not understand the distinction between Definition 2 and Definition 3. Don't need to resolve it here. I've editted post to include my uncertainty on this.