LLMs Universally Learn a Feature Representing Token Frequency / Rarity
Summary * LLMs appear to universally learn a feature in their embeddings representing the frequency / rarity of the tokens they were trained on * This feature is observed across model sizes, in base models, instruction tuned models, regular text models, and code models * In models without tied weights,...
Jun 30, 202413