I've written up a 2-page explanation and proof of Aumann's agreement theorem. Here is a direct link to the pdf via Dropbox.
The proof in Aumann's original paper is already very short and accessible. (Wei Dai gave an exposition closely following Aumann's in this post.) My intention here was to make the proof even more accessible by putting it in elementary Bayesian terms, stripping out the talk of meets and joins in partition posets. (Just to be clear, the proof is just a reformulation of Aumann's and not in any way original.)
I will appreciate any suggestions for improvements.
Update: I've added an abstract and made one of the conditions in the formal description of "common knowledge" explicit in the informal description.
Update: Here is a direct link to the pdf via Dropbox (ht to Vladimir Nesov).
Update: In this comment, I explain why the definition of "common knowledge" in the write-up is the same as Aumann's.
Update 2020-05-23: I fixed the Dropbox link and removed the Scribd link.
Here is a proof of the equivalence between my definition and Aumann's for "common knowledge". I'm assuming some familiarity with set partitions.
Aumann's definition is in terms of the Kolmogorov model of probability. In particular, a proposition is identified with the set of possible worlds in which the proposition is true.
Let P₁ be that partition of the possible worlds such that two worlds share the same block in P₁ if and only if I condition on the same body of knowledge when computing posterior probabilities in the two worlds*. Let P₂ be the analogous partition of the possible worlds for you. For each world w, let P₁(w) denote the block in my partition containing w, and let P₂(w) be the block in your partition containing w. Let P denote the finest common coarsening of our respective partitions**, and let P(w) be the block of P containing w.
Fix a proposition A. Let p be my posterior probability for A, and let q be yours (in the actual world). Let E be the set of worlds in which I assign posterior probability p to A, while you assign posterior probability q. Formally:
Let w₀ be the actual world. Aumann's definition says that our respective posterior probabilities are common knowledge (in the actual world) if P(w₀) ⊆ E.
On the other hand, I said that our posterior probabilities are common knowledge when there is a proposition C that is true in the actual world and satisfies the following three conditions:
For each world w in C, P₁(w) ⊆ C and P₂(w) ⊆ C.
For each world w in C, p = prob(A | P₁(w)).
For each world w in C, q = prob(A | P₂(w)).
These definitions are logically equivalent.
For, suppose that our posteriors are common knowledge in Aumann's sense. Then, by setting C = P(w₀), we get that the posteriors are also common knowledge in my sense.
On the other hand, suppose that the posteriors are common knowledge in my sense. It is given that C happened in the actual world, meaning that w₀ ∈ C. By Condition 1, C is a disjoint union of P₁-blocks and a disjoint union of P₂-blocks. This means that C is a block containing w₀ in some common coarsening of P₁ and P₂. Hence, C contains the block P(w₀) containing w₀ in the finest common coarsening of P₁ and P₂. That is, P(w₀) ⊆ C. Conditions 2 and 3 together imply that C ⊆ E. Thus we get that P(w₀) ⊆ E, so our posteriors are also common knowledge in Aumann's sense.
* That this relation induces a partition assumes, in effect, that I know what I don't know — i.e., that there are no unknown unknowns.
** Aumann calls this the meet of P₁ and P₂, because he considers coarsenings to be lower in the partial order of partitions. However, people often use the opposite convention, in which case P would be called the join.