Causal graphs and counterfactuals

Stuart_Armstrong

0 Causal graphs and counterfactuals

by Stuart_Armstrong

30th Aug 2016

AI Alignment Forum

1 min read

0

A problem that's come up with my definitions of stratification.

Consider a very simple causal graph:

In this setting, $A$ and $B$ are both booleans, and $A = B$ with $75 %$ probability (independently about whether $A = 0$ or $A = 1$ ).

Suppose I now want to compute the counterfactual: suppose I assume that $B = 0$ when $A = 0$ . What would happen if $A = 1$ instead?

The problem is that $P (B | A)$ seems insufficient to solve this. Let's imagine the process that outputs $B$ as a probabilistic mix of functions, that takes the value of $A$ and outputs that of $B$ . There are four natural functions here:

$f_{0} (x) = 0$
$f_{1} (x) = 1$
$f_{2} (x) = x$
$f_{3} (x) = 1 - x$

Then one way of modelling the causal graph is as a mix $0.75 f_{2} + 0.25 f_{3}$ . In that case, knowing that $B = 0$ when $A = 0$ implies that $P (f_{2}) = 1$ , so if $A = 1$ , we know that $B = 1$ .

But we could instead model the causal graph as $0.5 f_{2} + 0.25 f_{1} + 0.25 f_{1}$ . In that case, knowing that $B = 0$ when $A = 0$ implies that $P (f_{2}) = 2 / 3$ and $P (f_{0}) = 1 / 3$ . So if $A = 1$ , $B = 1$ with probability $2 / 3$ and $B = 1$ with probability $1 / 3$ .

And we can design the node $B$ , physically, to be one or another of the two distributions over functions or anything in between (the general formula is $(0.5 + x) f_{2} + x (f_{3}) + (0.25 - x) f_{1} + (0.25 - x) f_{0}$ for $0 \leq x \leq 0.25$ ). But it seems that the causal graph does not capture that.

Owain Evans has said that Pearl has papers covering these kinds of situations, but I haven't been able to find them. Does anyone know any publications on the subject?

Counterfactuals

Personal Blog

0

Causal graphs and counterfactuals

4IAFF-User-52

0Stuart_Armstrong

New Comment

2 comments, sorted by

top scoring

Click to highlight new comments since: Today at 11:50 PM

[-]IAFF-User-529yΩ240

The problem is indeed that $P (B | A)$ is insufficient to compute a unique counterfactual---additional causal information is needed. Pearl's approach is to specify each observable variable as a deterministic function of its parents in the causal graph. Any uncertainty must be represented by a set of "exogenous" variables $U$ , which can feature in the functions for the observables. (See chapter 7 of Causality, or also An Axiomatic Characterization of Causal Counterfactuals.)

For example, your first process could be represented by the following causal model:

$A (X) = X B (A, Y) = \neg (A \oplus Y) P (X) = p P (Y) = 0.75$

The other processes might have different structures, equations, and distributions $P (X, Y)$ ---it's not possible in general to distinguish these purely from the distribution $P (A, B)$ .

[-]Stuart_Armstrong9yΩ000

Thank you! That sentence is what I was looking for "Any uncertainty must be represented by a set of “exogenous” variables U".

I'd been doing that, but without any theoretical justification for it.

Moderation Log

Curated and popular this week