LESSWRONG
LW

10

Databases of human behaviour and preferences?

by Stuart_Armstrong

21st Apr 2020

AI Alignment Forum

1 min read

10

Ω 6

Hey there, lesserwrongers!

Starting on some of the computer science/neuroscience of my ideas for deducing human preference.

To do this, it would be useful to have datasets of human behaviour in relatively restricted situations. Possibly datasets of people playing simple games, or solving certain puzzles, or responding to messages, or something similar?

The question is intentionally vague, so that readers can come up with suggestions. What is needed is that the dataset be largish (more than a thousand humans at least), and be of real humans making non-trivial decisions in not-too-complicated circumstances.

Any suggestions?

New to LessWrong?

Getting Started

10

Ω 6

Databases of human behaviour and preferences?

2Stuart_Armstrong

4Gordon Seidoh Worley

3sudhanshu_kasewa

2Stuart_Armstrong

2Stuart_Armstrong

2Charlie Steiner

New Comment

9 comments, sorted by

Click to highlight new comments since: Today at 6:23 AM

Maybe Minecraft-related datasets can be helpful. I'm not familiar with them myself, but I found these two:

CraftAssist: A Framework for Dialogue-enabled Interactive Agents

paper
dataset (only humanbot.json seems potentially useful here.)
AN #60

MineRL: A Large-Scale Dataset of Minecraft Demonstrations

[-]Stuart_Armstrong5yΩ120

Thanks!

[-]Gordon Seidoh Worley5yΩ240

This challenge from 2018 basically asked about building a data set for training AI on human values (loosely construed so as to allow many approaches) and many of the submissions proposed ways to do it. You might find some interesting ideas there.

Caveat, I won the challenge by saying I didn't such an approach would work.

[-]sudhanshu_kasewa5y30

Perhaps these could be useful:

1) Human Decision-Making dataset https://osf.io/eagcd/ ; but from what I can tell, has less than 300 human participants

2) User rating dataset, e.g. Yahoo! Music or Netflix or Amazon product review datasets. These could be trimmed in various ways to reduce complicatedness. Netflix dataset is here : https://www.kaggle.com/netflix-inc/netflix-prize-data

Amazon product reivew is at http://liu.cs.uic.edu/download/data/ , but it says available upon request

3) Transactional data, e.g. https://data.world/uci/online-retail might shed some light on preferences (as transactional data could be a proxy for demand)

[-]Stuart_Armstrong5y20

Thanks!

https://hraf.yale.edu/ gl

[-]ChristianKl5y20

If you want more then just the individual level the World Values Survey is a great resource.

[-]Stuart_Armstrong5yΩ120

Suggested elsewhere by Max Daniel:

Ultimatum game or other widely studied games in psych/behavioral econ?
Ebay bidding, or other auctions?
Chess or other games?
Voting in elections
Gambling: casinos, online poker ...
Online dating behavior

Suggested by Ozzie Gooen:

This sounds a bit to me like psychology experiments with children, or perhaps some well studied psychology experiments (where there are large amounts of data, with relatively narrow options).
Websites would have more than enough data for narrow decisions, like, “Which ad will this user click”, or on Netflix, “Which movie/tv show will they select?”
There’s a fair bit of data for the main decisions of chess/starcraft/etc, Like, “which race will be chosen/ which character will be chosen / which strategy will be chosen”

Suggested by Jan Brauner:

Any ML dataset with labels. The labels were created by humans. E.g. ImageNet: a human was shown an image and had to choose one of 1000 options.

[-]Charlie Steiner5yΩ120

I think it's a little dull for the score to be a good proxy of human value, so games involving aesthetic choices are an obvious choice to me because of the plausibility of learning about interesting values without tons of knowledge of the world. (Compare RPGs that might also have limited actions and reflect human values, but require common-sense understanding of text to draw interesting conclusions about.)

'Sim' games all seem good for this (Also there's just something apropos about making a value learning AI build a nice house for the Sims), as do most contraption-building games. Though if the number of actions there is still too large, maybe you want something more on the speed of Color A Dinosaur, or just the character/avatar creation screen of some scrapeable thing.

On the other hand, maybe these things don't have enough planning, and you want something more like an open-world game that allows for self-expression. But I think the large action space is a barrier here.

More from Stuart_Armstrong

79Go home GPT-4o, you’re drunk: emergent misalignment as lowered inhibitions

Stuart_Armstrong, rgorman

2mo

12

170Using GPT-Eliezer against ChatGPT Jailbreaking

Stuart_Armstrong, rgorman

2y

85

67Alignment can improve generalisation through more robustly doing what a human wants - CoinRun example

Stuart_Armstrong

1y

9

Curated and popular this week

109Explaining British Naval Dominance During the Age of Sail

Arjun Panickssery

15m

6

172Learned pain as a leading cause of chronic pain

4d

34

383Orienting Toward Wizard Power

7d

85