Brain-inspired LLM alignment
How can we apply the lessons of brain-like cognitive architecture to modern LLMs? The core architecture described by Steven Byrnes is most concisely documented here. Some obvious comparisons between LLMs and brain-like architectures: 1. Base LLMs are purely autoregressive predictors. If you squint this can be made to fit the...