You are viewing version 1.5.0 of this page. Click here to view the latest version.

Aligned AI Role-Model Fiction

Edited by RogerDearnaley last updated 11th Jan 2024

You are viewing revision 1.5.0, last edited by RogerDearnaley

Having a (pre)training corpus of role-model exemplars of aligned AI behavior would be valuable to fine-tune Large Language Models on and/or to add to their pretraining corpus as part of aligning them. Having ingested such a corpus should make a model significantly easier to prompt to display aligned behavior: if the role-model was sufficiently well-known, you should be able to get an entire gestalt of behavior from just a very short prompt.. Creating this corpus is a practical and valuable alignment-related activity that we can work on now, and one which requires a rather different skillset from most other alignment work. Since aligned AI behavior is extremely selfless and moral compared to real human behavior, in ways that don't reflect evolutionary psychology, and since aligned AGI/ASI doesn't exist yet, for now the AI role-models need to be fictional characters in a fictional setting.

This tag is intended for discussion of the best criteria/rubric for this fiction [an initial one will be posted shortly once complete], link-posts to fiction (text, comic, graphic novel,, video, or audio formats are all acceptable, though text is the most compact), or fiction posts. This could be new fiction, or curated preexisting fiction that has good exemplars of aligned AI role-model character. Preexisting fiction that doesn't fully fit the rubric, but could be made to with minor edits, is acceptable as a linkpost along with notes of the rubric violations and suggested edits to fix them (either as notes, or as completed edits).

For new fiction, please include a copyright notice either waiving copyright, or explicitly granting permission to everyone to use the document the purpose of training aligned AI or ML models from it. For linkposts to curated existing fiction, please note its copyright ownership and properties in your linkpost.

Posts tagged Aligned AI Role-Model Fiction

9

200Alignment Pretraining: AI Discourse Causes Self-Fulfilling (Mis)alignment

Ω

Cam, Puria, Kyle O’Brien, David Africa, Samuel Ratnam, andyk

4mo

Ω

25

8

37Special Persona Training: Hyperstition Progress Report 2

jayterwahl

4mo