Michael Nielsen explains Judea Pearl's causality

gwern

The smoking node has a causal influence on the tar node, but there's also a random factor.

I don't see how this is true of either approach.

Let X_smokes and X_tar be the random variables associated with your nodes. Under the first approach, if there are no other "exogenous" Y-nodes, then there is a function f_tar such that X_tar = f_tar(X_smokes). Doesn't that mean that whether you have tar is entirely a function of whether you smoke?

Maybe I'm mistaken about what it means for one random variable to be a function of another. We can understand X_smokes and X_tar formally as functions from the sample space Ω of people* to the state space {0,1} of Boolean values, right? Usually, to say that one function f is a function of another function g is to say that, for some function F, f(x) = F(g(x)) for each element x of the domain. That is, the value of f at x is entirely determined by the value of g at x.

If this convention applies when the functions are random variables, then to say that X_tar = f_tar(X_smokes) is to say that, for each person 𝜔, X_tar(𝜔) = f_tar(X_smokes(𝜔)). Thus, for every smoker 𝜔, X_tar(𝜔) has the same value, namely f_tar(1). That is, the answer to whether a smoker has tar in their lungs is always the same. Similarly, among all nonsmokers, the answer f_tar(0) to whether they have tar in their lungs is always the same. Therefore, whether or not you smoke determines whether or not you have tar in your lungs.

Do people mean something different when they say that one random variable is a function of another? If so, what do they mean? If not, where is there room for a "random factor" when there are no exogenous Y-variables, even under the first approach described by Nielsen?

* ETA: I originally had the sample space Ω being the set of all possible worlds, which seems wrong on reflection.

30

Michael Nielsen explains Judea Pearl's causality

30

30

30

Michael Nielsen explains Judea Pearl's causality

30

30