Linear steerability in continuous chain-of-thought reasoning
(This project was done as a ~20h application project to Neel Nanda's MATS stream, and is posted here with only minimal edits. The results seem strange, I'd be curious if there's any insights.) Summary Motivation Continuous-valued chain-of-thought (CCoT) is a likely prospective paradigm for reasoning models due to computational advantages,...