All of xavierrg's Comments + Replies

Sounds very cool. I am working on something similar -- behavioral evals for a component of deception (theory of mind). Feel free to reach out if keen to chat!