A measure-theoretic generalization of logical induction

Vanessa Kosoy

% operators that are separated from the operand by a space

% operators that require brackets

% operators that require parentheses

% Paper specific

Logical induction is defined in terms of logical sentences and theories, but its principles are applicable in much greater generality and abstraction. Indeed, one such generalization was studied under the name "universal induction." We proposed a slightly different generalization in order to model reasoning with incomplete models. Here, we describe a formalism that includes all these cases and many more, using the language of measure theory. This provides the following advantages:

The formalism is applicable to event spaces substantially different from truth assignments or bit sequences, e.g. we can consider sequences of real numbers.
The formalism treats probabilities and expectations on the same footing, rather than constructing expectations as in section 4.8 of original paper. We consider this more convenient.
In our opinion, this language is more mathematically natural than the original formalism, at least for applications unrelated to formal logic.

On the other hand, we ignore all computational considerations. Obviously these are often important, but in the study of purely information-theoretic questions the use of numerical approximations only serves to obscure.

All proofs are in the Appendix.

##Results

Fix $X$ a compact Polish space. For example, $X$ might be $O^{ω}$ or the space of propositionally consistent truth assignments in some language or $[0, 1]^{ω}$ . The role of "pricings" is served by $P (X)$ : the space of probability measures on $X$ . A market is thus a sequence ${μ_{n} \in P (X)}_{n \in N}$ . The "deductive process" is replaced by a sequence of closed sets $X = X_{0} \supseteq X_{1} \supseteq X_{2} \supseteq \dots$ A trading strategy is a continuous function $τ : P (X) \to C (X)$ , were $P (X)$ is equipped with the weak* topology (as before) and $C (X)$ is the space of continuous functions from $X$ to $R$ equipped with the uniform convergence topology. Here, we should think of $τ (μ)$ as the share portfolio acquired by the strategy given pricing $μ$ , where the cost of the acquisition is understood to be $E_{μ} [τ (μ)]$ while the ultimate value of the portfolio is $τ (μ) (x)$ (in the following we will use the less clumsy notation $τ (μ, x)$ ; in fact, we can equivalently define $T (X)$ as the set of continuous functions from $P (X) \times X$ to $R$ ) for some $x \in ⋂_{n} X_{n}$ . We denote the set of trading strategies by $T (X)$ . A trader is a sequence ${T_{n} : P (X)^{n} \to T (X)}_{n \in N}$ , where the functions can be arbitrary (don't have to be continuous in any sense). The argument of these functions refers to the market pricings on previous days.

Analogously to Lemma 5.1.1 in "Logical Induction" (existence of "market maker"), we have:

#Proposition 1

For any $τ \in T (X)$ , there is $μ \in P (X)$ s.t.

$E_{x \sim μ} [τ (μ, x)] = max x \in X τ (μ, x)$

Analogous to Definition 5.2.1 ("budgeter"), we have:

#Proposition 2

Given $τ \in T (X)$ , define $W τ : P (X) \to C (X)$ by

$W τ (μ, x) := τ (μ, x) - E_{y \sim μ} [τ (μ, y)]$

Fix a trader $T$ . Define $Σ W T_{n} : P (X)^{n} \to C (X)$ by

$Σ W T_{n} ({μ_{m}}_{m < n}, x) := \sum m < n W T_{m} ({μ_{l}}_{l \leq m}, x)$

Define ${Σ W}_{min} T_{n} : P (X)^{n} \to R$ by

${Σ W}_{min} T_{n} ({μ_{m}}_{m < n}) := min x \in X_{n - 1} Σ W T_{n} ({μ_{m}}_{m < n}, x)$

(For $n = 0$ , the above definition is understood to mean 0)

Fix $b > 0$ . Assume $n \in N$ and ${μ_{m} \in P (X)}_{m < n}$ are s.t.

${Σ W}_{min} T_{n} ({μ_{m}}_{m < n}) > - b$

Then, we can define $N_{b} T_{n} ({μ_{m}}_{m < n}) : P (X) \to R$ by

$N_{b} T_{n} ({μ_{m}}_{m \leq n}) := max (1, max x \in X_{n} \frac{- W T_{n} ({μ_{m}}_{m \leq n}, x)}{b + Σ W T_{n} ({μ_{m}}_{m < n}, x)})^{- 1}$

Finally, define ${B_{b} T_{n} : P (X)^{n} \times P (X) \to C (X)}_{n \in N}$ by

$B_{b} T_{n} ({μ_{m}}_{m \leq n}) = {\begin{matrix} 0 if \exists m \leq n : {Σ W}_{min} T_{m} ({μ_{l}}_{l < m}) \leq - b, N_{b} T_{n} ({μ_{m}}_{m \leq n}) \cdot T_{n} ({μ_{m}}_{m \leq n}) otherwise \end{matrix}$

Then:

$B_{b} T$ is also a trader (i.e. it is continuous in the last argument).
If ${μ_{m}}_{m < n}$ is s.t. $\forall m \leq n : {Σ W}_{min} T_{m} ({μ_{l}}_{l < m}) > - b$ , then $\forall m < n : B_{b} T_{m} ({μ_{l}}_{l < m}) = T_{m} ({μ_{l}}_{l < m})$ .
${Σ W}_{min} B_{b} T_{n} \geq - b$

The analogue of Definition 5.3.2 ("trading firm") is as follows:

#Proposition 3

Consider a family of traders ${T^{k}}_{k \in N}$ and $ζ : N \times N \to [0, 1]$ s.t.

$\infty \sum k = 0 \infty \sum b = 1 ζ (b, k) b = b_{ζ} < \infty$

Define $T_{n}^{ζ}$ as follows:

$T_{n}^{ζ} = n \sum k = 0 \infty \sum b = 1 ζ (k, b) B_{b} T_{n}^{k}$

Then, the above sum is convergent and defines a trader. Moreover:

${Σ W}_{min} T_{n}^{ζ} \geq - b_{ζ}$

The analogue of Definition 5.4.1 (logical inductor construction):

#Proposition 4

Consider the setting of Proposition 3. Then, there are ${μ_{n}^{*} \in P (X_{n})}_{n \in N}$ s.t.

$E_{x \sim μ_{n}^{*}} [T_{n}^{ζ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x)] = max x \in X_{n} T_{n}^{ζ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x)$

Analogously to Definition 3.5.1 ("exploitation") we have:

#Definition

A market $μ$ is said to dominate a trader $T$ relatively to ${X_{n}}$ when

Denoting $i_{n} : X_{n} \to X$ the inclusion mapping, $μ_{n}$ is in the image of $i_{n *}$ .
The following set of real numbers is either unbounded from below or bounded from above:

$W (T, μ) := {Σ W T_{n + 1} ({μ_{m}}_{m \leq n}, x) ∣ n \in N, x \in X_{n}}$

Finally, the analogue of Theorem 5.4.2:

#Theorem

Consider the setting of Proposition 4, and assume that $\forall k, b : ζ (k, b) > 0$ . Then, ${i_{n *} μ_{n}^{*}}_{n \in N}$ dominates $T^{k}$ for every $k$ .

##Appendix

#Proposition A.1

Given $τ \in T (X)$ , define $E_{τ} : P (X) \times P (X) \to R$ by

$E_{τ} (ν, μ) := E_{ν} [τ (μ)]$

Then, $E_{τ}$ is continuous.

#Proof of Proposition A.1

Consider $μ_{i} \to μ$ and $ν_{i} \to ν$ . We have

$E_{ν_{i}} [τ (μ_{i})] = E_{ν_{i}} [τ (μ)] + E_{ν_{i}} [τ (μ_{i}) - τ (μ)]$

$| E_{ν_{i}} [τ (μ_{i})] - E_{ν_{i}} [τ (μ)] | \leq max | τ (μ_{i}) - τ (μ) |$

By continuity of $τ$ , $τ (μ_{i}) \to τ (μ)$ and therefore, $max | τ (μ_{i}) - τ (μ) | \to 0$ . We get

$lim i \to \infty | E_{ν_{i}} [τ (μ_{i})] - E_{ν_{i}} [τ (μ)] | = 0$

Since $ν_{i} \to ν$ and $τ (μ)$ is continuous, we have $E_{ν_{i}} [τ (μ)] \to E_{ν} [τ (μ)]$ and therefore

$lim i \to \infty E_{ν_{i}} [τ (μ_{i})] = E_{ν} [τ (μ)]$

#Proof of Proposition 1

Define $K \subseteq P (X) \times P (X)$ as follows:

$K := {(μ, ν) ∣ E_{ν} [τ (μ)] = max τ (μ)}$

For any $μ$ , denote $K (μ) := K \cap (μ \times P (X))$ . $K (μ)$ is convex due to linearity of expected value. $K (μ)$ is non-empty because given $x^{*} \in a r g m a x x \in X τ (μ, x)$ , $δ_{x} \in K (μ)$ .

Consider $μ_{i} \to μ$ and $ν_{i} \to ν$ s.t. $(μ_{i}, ν_{i}) \in K$ . We have

$E_{ν_{i}} [τ (μ_{i})] = max τ (μ_{i})$

By Proposition A.1, the left hand side converges to $E_{ν} [τ (μ)]$ . Since $τ (μ_{i}) \to τ (μ)$ , the right hand side converges to $max τ (μ)$ . We get:

$E_{ν} [τ (μ)] = max τ (μ)$

Therefore, $(μ, ν) \in K$ and $K$ is a closed set. Applying the Kakutani-Glicksberg-Fan theorem, we get the desired result.

#Proof of Proposition 2

$W τ$ is continuous by Proposition A.1. Therefore, $N_{b} T$ is continuous in the last argument and $B_{b} T$ is also continuous in the last argument.

Assume ${μ_{m} \in P (X)}_{m < n}$ is s.t.

$\forall m \leq n : {Σ W}_{min} T_{m} ({μ_{l}}_{l < m}) > - b$

Then, for any $m < n$ we are in the second case in the definition of $B_{b} T_{m} ({μ_{l}}_{l < m})$ . Moreover, we have

${Σ W}_{min} T_{m + 1} ({μ_{l}}_{l \leq m}) > - b$

$\forall x \in X_{m} : Σ W T_{m + 1} ({μ_{l}}_{l \leq m}, x) > - b$

$\forall x \in X_{m} : Σ W T_{m} ({μ_{l}}_{l < m}, x) + W T_{m} ({μ_{l}}_{l \leq m}, x) > - b$

$\forall x \in X_{m} : b + Σ W T_{m} ({μ_{l}}_{l < m}, x) > - W T_{m} ({μ_{l}}_{l \leq m}, x)$

Using the assumption again, the left hand side is positive. It follows that

$\forall x \in X_{m + 1} : 1 > \frac{- W T_{m} ({μ_{l}}_{l \leq m}, x)}{b + Σ W T_{m} ({μ_{l}}_{l < m}, x)}$

$N_{b} T_{m} ({μ_{l}}_{l \leq m}) = 1$

$B_{b} T_{m} ({μ_{l}}_{l \leq m}) = T_{m} ({μ_{l}}_{l \leq m})$

Now, fix any ${μ_{m} \in P (X)}_{m < n}$ . Let $m_{0} \in N$ be the largest number s.t. $m_{0} \leq n$ and

$\forall m \leq m_{0} : {Σ W}_{min} T_{m} ({μ_{l}}_{l < m}) > - b$

For any $m \leq m_{0}$ , we have

${Σ W}_{min} B_{b} T_{m} ({μ_{l}}_{l < m}) = {Σ W}_{min} T_{m} ({μ_{l}}_{l < m}) > - b$

(Note that the sum in the definition of ${Σ W}_{min} B_{b} T_{m}$ only involves $B_{b} T_{l}$ for $l < m \leq m_{0}$ )

For $m = m_{0} + 1$ , we have

$Σ W B_{b} T_{m_{0} + 1} ({μ_{l}}_{l \leq m_{0}}, x) = Σ W B_{b} T_{m_{0}} ({μ_{l}}_{l < m_{0}}, x) + W B_{b} T_{m_{0}} ({μ_{l}}_{l \leq m_{0}}, x)$

The first term only involves $B_{b} T_{l}$ for $l < m_{0}$ , and we are still in the first case in the definition of the second term, therefore

$Σ W B_{b} T_{m_{0} + 1} ({μ_{l}}_{l \leq m_{0}}, x) = Σ W T_{m_{0}} ({μ_{l}}_{l < m_{0}}, x) + N_{b} T_{m_{0}} ({μ_{l}}_{l \leq m_{0}}) W T_{m_{0}} ({μ_{l}}_{l \leq m_{0}}, x)$

If $x$ is s.t. $W T_{m_{0}} ({μ_{l}}_{l \leq m_{0}}, x) \geq 0$ , then

$Σ W B_{b} T_{m_{0} + 1} ({μ_{l}}_{l \leq m_{0}}, x) \geq Σ W T_{m_{0}} ({μ_{l}}_{l < m_{0}}, x)$

$Σ W B_{b} T_{m_{0} + 1} ({μ_{l}}_{l \leq m_{0}}, x) \geq {Σ W}_{min} T_{m_{0}} ({μ_{l}}_{l < m_{0}}) > - b$

If $x$ is s.t. $W T_{m_{0}} ({μ_{l}}_{l \leq m_{0}}, x) < 0$ , then

$Σ W B_{b} T_{m_{0} + 1} ({μ_{l}}_{l \leq m_{0}}, x) \geq Σ W T_{m_{0}} ({μ_{l}}_{l < m_{0}}, x) + \frac{b + Σ W T_{m_{0}} ({μ_{l}}_{l < m_{0}}, x)}{- W T_{m_{0}} ({μ_{l}}_{l \leq m_{0}}, x)} W T_{m_{0}} ({μ_{l}}_{l \leq m_{0}}, x)$

$Σ W B_{b} T_{m_{0} + 1} ({μ_{l}}_{l \leq m_{0}}) \geq - b$

We got that $Σ W B_{b} T_{m_{0} + 1} ({μ_{l}}_{l \leq m_{0}}, x) \geq - b$ for all $x \in X_{m_{0}}$ , and therefore

${Σ W}_{min} B_{b} T_{m_{0} + 1} ({μ_{l}}_{l \leq m_{0}}) = min x \in X_{m_{0}} Σ W B_{b} T_{m_{0} + 1} ({μ_{l}}_{l \leq m_{0}}, x) \geq - b$

Finally, consider $m > m_{0} + 1$ .

$Σ W B_{b} T_{m} ({μ_{l}}_{l < m}, x) = Σ W B_{b} T_{m - 1} ({μ_{l}}_{l < m - 1}, x) + W B_{b} T_{m - 1} ({μ_{l}}_{l < m}, x)$

Now we are in the first case in the definition of $W B_{b} T_{m - 1}$ , therefore the second term vanishes.

$Σ W B_{b} T_{m} ({μ_{l}}_{l < m}, x) = Σ W B_{b} T_{m - 1} ({μ_{l}}_{l < m - 1}, x)$

By induction on $m$ , we conclude:

${Σ W}_{min} B_{b} T_{m} ({μ_{l}}_{l < m}) \geq {Σ W}_{min} B_{b} T_{m - 1} ({μ_{l}}_{l < m - 1}) \geq - b$

#Proposition A.2

If $X, Y$ are compact Polish spaces and $f : X \times Y \to R$ is continuous, then $F : X \to C (Y)$ defined by $F (x) (y) := f (x, y)$ is continuous.

#Proof of Proposition A.2

We already proved an equivalent proposition: see "Proposition A.1" here.

#Proof of Proposition 3

The definition of $B_{b}$ implies that

$| B_{b} T_{n}^{k} ({μ_{m}}_{m \leq n}, x) | \leq | T_{n}^{k} ({μ_{m}}_{m \leq n}, x) |$

Observe that

$Z (k) := \infty \sum b = 1 ζ (k, b) \leq \infty \sum b = 1 ζ (k, b) b < \infty$

As a result, the definition of $T_{n}^{ζ}$ is pointwise absolutely convergent:

$\infty \sum b = 1 ζ (k, b) | B_{b} T_{n}^{k} ({μ_{m}}_{m \leq n}, x) | \leq Z (k) | T_{n}^{k} ({μ_{m}}_{m \leq n}, x) |$

Moreover, this series converges uniformly absolutely in $μ_{n}$ and $x$ :

$\infty \sum b = b_{0} ζ (k, b) | B_{b} T_{n}^{k} ({μ_{m}}_{m \leq n}, x) | \leq max \begin{matrix} ν \in P (X) y \in X \end{matrix} | T_{n}^{k} ({μ_{m}}_{m < n}, ν, y) | \infty \sum b = b_{0} ζ (k, b) \to b_{0} \to \infty 0$

By the uniform limit theorem, the series defines a continuous function of $μ_{n}$ and $x$ . By Proposition A.2, it follows that $T^{ζ}$ is a trader.

Now, let's examine $Σ W T^{ζ}$ :

$Σ W T_{n}^{ζ} ({μ_{m}}_{m \leq n}, x) = Σ W n \sum k = 0 \infty \sum b = 1 ζ (k, b) B_{b} T_{n}^{k} ({μ_{m}}_{m \leq n}, x)$

Since convergence is uniform absolute, the sum commutes with $Σ W$ .

$Σ W T_{n}^{ζ} ({μ_{m}}_{m \leq n}, x) = n \sum k = 0 \infty \sum b = 1 ζ (k, b) Σ W B_{b} T_{n}^{k} ({μ_{m}}_{m \leq n}, x)$

By Proposition 2:

$Σ W T_{n}^{ζ} ({μ_{m}}_{m \leq n}, x) \geq - n \sum k = 0 \infty \sum b = 1 ζ (k, b) b = - ζ_{b}$

#Proof of Proposition 4

We construct $μ_{n}^{*}$ recursively in $n$ . Define $τ_{n} \in T (X_{n})$ by

$τ_{n} (ν, x) := T_{n}^{ζ} ({i_{m *} μ_{m}^{*}}_{m < n}, i_{n *} ν, x)$

(pushforward is obviously continuous in the weak* topology)

Now construct $μ_{n}^{*}$ by applying Proposition 1 to $τ_{n}$ .

#Proof of Theorem

We have

$W T_{n}^{ζ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x) = T_{n}^{ζ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x) - E_{y \sim μ_{n}^{*}} [T_{n}^{ζ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, y)]$

By definition of $μ^{*}$

$W T_{n}^{ζ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x) = T_{n}^{ζ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x) - max y \in X_{n} T_{n}^{ζ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, y)$

$\forall x \in X_{n} : W T_{n}^{ζ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x) \leq 0$

$\forall x \in X_{n} : Σ W T_{n + 1}^{ζ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x) \leq 0$

Fix $k \in N$ and assume $inf W (T^{k}, i_{*} μ^{*}) > - b$ for some $b > 0$ (otherwise $T^{k}$ is dominated). Define $ξ : N \times N \to [0, 1]$ by

$ξ (j, c) := {\begin{matrix} ζ (j, c) when (j, c) \neq (k, b) 0 when (j, c) = (k, b) \end{matrix}$

We get $T_{n}^{ζ} = T_{n}^{ξ} + [[n \geq k]] ζ (k, b) B_{b} T^{k}$ and therefore

$\forall x \in X_{n} : Σ W T_{n + 1}^{ξ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x) + [[n \geq k]] ζ (k, b) Σ W B_{b} T_{n + 1}^{k} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x) \leq 0$

By Proposition 2 and the definition of $b$ , we can remove $B_{b}$ in the second term.

$\forall x \in X_{n} : Σ W T_{n + 1}^{ξ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x) + [[n \geq k]] ζ (k, b) Σ W T_{n + 1}^{k} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x) \leq 0$

$\forall x \in X_{n} : [[n \geq k]] ζ (k, b) Σ W T_{n + 1}^{k} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x) \leq - Σ W T_{n + 1}^{ξ} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x)$

By Proposition 3, the right hand side is bounded from above by $b_{ξ}$ , therefore

$\forall x \in X_{n} : [[n \geq k]] ζ (k, b) Σ W T_{n + 1}^{k} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x) \leq b_{ξ}$

$sup W (T^{k}, i_{*} μ^{*}) \leq max (\frac{b_{ξ}}{ζ (k, b)}, max \begin{matrix} n < k x \in X_{n} \end{matrix} Σ W T_{n + 1}^{k} ({i_{m *} μ_{m}^{*}}_{m \leq n}, x))$

LESSWRONG
LW

LESSWRONG
LW

6

A measure-theoretic generalization of logical induction

6

Ω 3

6

Ω 3

6

Ω 3