Magus - LessWrong

I may have seen this post too late for it to be of any interest, but I believe the Avalon Hill board game Magic Realm is just straightforwardly cohabitative - or at least, it defines an effectively cohabitative win condition, even if it also defines a competitive one. It's also unusually open/simulationist* for a board game, which seems likely to be correlated.

(* not in the sense of "realistic", but in the sense of "we have made a world and rules to determine what happens when you do something in it")

From the fan-written tutorial:

"At the end of the game, a score is calculated for each character by comparing the character’s totals in the above five categories with the Victory Requirements that he selected prior to the start of the game. Each character with a positive score wins (there can be several winners). The character with the highest score is the victor (there can only be one victor).
Winning the game means that your character was successful in fulfilling the Victory Requirements that you selected. Being the victor means that your character was the most successful character in that game, even if you failed to get a winning score."

How to Convince my Son that Drugs are Bad

Magus3y10

I think that visiting a drug rehab center would be much less convincing (though much faster) than the above suggested method. This is because a drug rehab center will look bad whether or not the effects are very rare, since it's selected for people who got bad enough effects to be in a rehab center.

(If his argument is that the bad effects don't exist, a rehab center would be good evidence against that, but it sounds like he believes more that they're rare and mild enough to be worth it.)

In general, if you want to convince someone who is taking ideas from this community seriously of something, you want to show them evidence that would only exist if the thing you want to convince them of is true, and possibly even explicitly lay out why you expect that.

What New Desktop Should I Buy?

Magus3y21

I don't have much of an answer for you but wanted to explicitly thank you for posting this thread, I am in a similar situation and wouldn't have thought to ask here but should have.

Let's See You Write That Corrigibility Tag

Magus3y110

[Hi! Been lurking for a long time, this seems like as good a reason as any to actually put something out there. Epistemic status: low confidence but it seems low risk high reward to try. not intended to be a full list, I do not have the expertise for that, I am just posting any ideas at all that I have and don't already see here. this probably already exists and I just don't know the name.]

1) input masking, basically for oracle/task-AI you ask the AI for a program that solves a slightly more general version of your problem and don't give the AI the information necessary to narrow it down, then run the program on your actual case (+ probably some simple test cases you know the answer to to make sure it solves the problem).
this lets you penalize the AI for complexity of the output program and therefore it will give you something narrow instead of a general reasoner.
(obviously you still have to be sensible about the output program, don't go post the code to github or give it internet access.)

2) reward function stability. we know we might have made mistakes inputting the reward function, but we have some example test cases we're confident in. tell the AI to look for a bunch of different possible functions that give the same output as the existing reward function, and filter potential actions by whether any of those see them as harmful.

LESSWRONG
LW

Posts

Wikitag Contributions

Comments