This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Reinforcement Learning using Layered Morphology (RLLM)
LW
Login
Reinforcement Learning using Layered Morphology (RLLM)
6
Intergenerational Knowledge Transfer (IKT)
MiguelDev
1mo
0
5
RLLMv10 experiment
MiguelDev
2mo
0
20
A T-o-M test: 'popcorn' or 'chocolate'
MiguelDev
2mo
13
7
Can RLLMv3's ability to defend against jailbreaks be attributed to datasets containing stories about Jung's shadow integration theory?
MiguelDev
2mo
2
4
Research Log, RLLMv3 (GPT2-XL, Phi-1.5 and Falcon-RW-1B)
MiguelDev
3mo
0
16
GPT2XL_RLLMv3 vs. BetterDAN, AI Machiavelli & Oppo Jailbreaks
MiguelDev
3mo
4
6
Research Log, RLLMv2: Phi-1.5, GPT2XL and Falcon-RW-1B as paperclip maximizers
MiguelDev
4mo
0
7
Reinforcement Learning using Layered Morphology (RLLM)
MiguelDev
5mo
0
5
An examination of GPT-2's boring yet effective glitch
MiguelDev
23d
3