Evaluating LLaMA 3 for political sycophancy
TLDR: I evaluated LLaMA v3 (8B + 70B) for political sycophancy using one of the two datasets I created. The results for this dataset suggest that sycophancy definitely occurs in a blatant way for both models though more clearly for 8B than for 70B. There are hints of politically tainted...
Sep 28, 20242