Posts

Sorted by New

Wiki Contributions

Comments

Sorted by

Other approaches of alignment are just as deserving to be skeptical of as mechanistic interpretability if faced with as much scrutiny.