x

LESSWRONG
LW

ceru23

Subscribe

Message

6

1

4y

Machines vs Memes Part 3: Imitation and Memes

This is the last in a series of three posts on the interlinkages between Memetics and AI Alignment. It was written as an output from the 2022 AI Safety Camp, for the ‘Impact of Memetics on Alignment’ team, coached by Daniel Kokotajlo and comprising Harriet Farlow, Nate Rush and Claudio...

Jun 1, 20227

ceru23

Subscribe

Message

6

1

4y

Machines vs Memes Part 3: Imitation and Memes

This is the last in a series of three posts on the interlinkages between Memetics and AI Alignment. It was written as an output from the 2022 AI Safety Camp, for the ‘Impact of Memetics on Alignment’ team, coached by Daniel Kokotajlo and comprising Harriet Farlow, Nate Rush and Claudio...

Jun 1, 20227

Machines vs Memes Part 3: Imitation and Memes

ceru23

4y

This is the last in a series of three posts on the interlinkages between Memetics and AI Alignment. It was written as an output from the 2022 AI Safety Camp, for the ‘Impact of Memetics on Alignment’ team, coached by Daniel Kokotajlo and comprising Harriet Farlow, Nate Rush and Claudio Ceruti. Please read on to post 1 and post 2. We are not AI Safety experts so any and all feedback is greatly appreciated :)

TL;DR: Linking the concept of imitation between the domains of memetic and AI safety leads to considering how misalignment could spread memetically; this helps to identify misaligned agents able to generate a contagious spread of memes of misalignment. In... (read 2021 more words →)

7