This is the third of three sets of fixed point exercises. The first post in this sequence is here, giving context.
Note: Questions 1-5 form a coherent sequence and questions 6-10 form a separate coherent sequence. You can jump between the sequences.
-
Let be a complete metric space. A function is called a contraction if there exists a such that for all , . Show that if is a contraction, then for any , the sequence converges. Show further that it converges exponentially quickly (i.e. the distance between the th term and the limit point is bounded above by for some )
-
(Banach contraction mapping theorem) Show that if is a complete metric space and is a contraction, then has a unique fixed point.
-
If we only require that for all , then we say is a weak contraction. Find a complete metric space and a weak contraction with no fixed points.
-
A function is convex if , for all and . A function is strongly convex if you can subtract a positive parabaloid from it and it is still convex. (i.e. is strongly convex if is convex for some .) Let be a strongly convex smooth function from to , and suppose that the magnitude of the second derivative is bounded. Show that there exists an such that the function given by is a contraction. Conclude that gradient descent with a sufficiently small constant step size converges exponentially quickly on a strongly convex smooth function.
-
A finite stationary Markov chain is a finite set of states, along with probabilistic rule for transitioning between the states, where represents the space of probability distributions on . Note that the transition rule has no memory, and depends only on the previous state. If for any pair of states , the probability of passing from to in one step is positive, then the Markov chain is ergodic. Given an ergodic finite stationary Markov chain, use the Banach contraction mapping theorem to show that there is a unique distribution over states which is fixed under application of transition rule. Show that, starting from any state , the limit distribution exists and is equal to the stationary distribution.
-
A function from a partially ordered set to another partially ordered set is called monotonic if implies that . Given a partially ordered set with finitely many elements, and a monotonic function from to itself, show that if or , then is a fixed point of for all .
-
A complete lattice is a partially ordered set in which each subset of elements has a least upper bound and greatest lower bound. Under the same hypotheses as the previous exercise, extend the notion of for natural numbers to for ordinals , and show that is a fixed point of for all with or and all ( means there is an injection from to , and means there is no such injection).
-
(Knaster-Tarski fixed point theorem) Show that the set of fixed points of a monotonic function on a complete lattice themselves form a complete lattice. (Note that since the empty set is always a subset, a complete lattice must be nonempty.)
-
Show that for any set , forms a complete lattice, and that any injective function from to defines a monotonic function from to . Given injections and , construct a subset of and a subset of of such that and .
-
(Cantor–Schröder–Bernstein theorem) Given sets and , show that if and , then . ( means there is an injection from to , and means there is a bijection)
Please use the spoilers feature - the symbol '>' followed by '!' followed by space -in your comments to hide all solutions, partial solutions, and other discussions of the math. The comments will be moderated strictly to hide spoilers!
I recommend putting all the object level points in spoilers and including metadata outside of the spoilers, like so: "I think I've solved problem #5, here's my solution <spoilers>" or "I'd like help with problem #3, here's what I understand <spoilers>" so that people can choose what to read.
Tomorrow's AI Alignment Forum Sequences post will be "Approval-directed agents: overview" by Paul Christiano in the sequence Iterated Amplification.
The next post in this sequence will be released on Saturday 24th November, and will be 'Fixed Point Discussion'.
(The second half made me realize how much more comfortable I am with abstract exercises than with regular Analysis à la Ex4.)
Ex6
If f(x)≤x, then all fn(x) are comparable to each other: we have
f(x)≤x⟹f(f(x))≤f(x)⟹f3(x)≤f2(x)
and so on. Furthermore, if n∈N is such that fn(x)≠fn−1(x), then fk(x)≠fk−1(x) for all k∈[n] as well (verify by looking at the contrapositive). Consequently (set n:=|P|), if fn(x) were not a fixed point of f, then fn+1(x)<fn(x), and hence {x,f(x),f2(x),...,fn+1(x)}⊆P, which means P would have |P|+2 elements.
If x≤f(x), we get x≤f(x)≤f2(x) and so on, leading to the same argument.
Ex7
Wlog, assume x≤f(x). Set f0(x):=x. Given any non-limit ordinal β, we find a predecessor α and set fβ(x):=f(fα(x)). Given any limit ordinal ω, we set fω(x):=sup{fα(x)|α∈ω}.
Suppose this doesn't define fα(x) for all ordinals α. Then, there is some smallest ordinal α∗ such that fα∗(x) is not defined. This immediately yields a contradiction (regardless of whether α∗ is a limit ordinal or not).
We want this construction to have the properties that f(x)=x⟹fβ(x)=x and that x≤y⟹fβ(x)≤fβ(y). Thus, let x,y∈L and β be an ordinal. If β has a predecessor, the check for both properties are easy. If not, then fβ(x)=sup{fα(x)|α∈β} and fβ(y)=sup{fα(y)|α∈β}. Then, for the first property, note that the upper-bound of a one-element set is just the element itself. For the second, note that each element in the first set is smaller than some element in the second set, so fβ(y) is an upper-bound for the first set, which implies that fβ(x)≤fβ(y) since fβ(x) is the lowest upper-bound.
Now, given an ordinal α, our construction defines a function ϕ:α→L. If f(fα(x))≠fα(x), then the chain doesn't become stationary at any earlier point either (to verify, take a smallest α such that [f(fα(x))≠fα(x) but the chain is stationary for smaller ordinals] and derive a contradiction), and hence ϕ is injective, proving that α≤|L|. (This is the generalized version of the argument from Ex6.)
Ex8
Let f:L→L be monotonic and let L′ be the set of fixed points of f. Then L′ inherits the partial order from L; what needs doing is verify the least upper-bound property. So let X⊆L′. Then, X has a least upper-bound u in L.
Let ω be some ordinal with ω>|L|. From the previous exercise, we know that f(fω(u))=fω(u). Choose the smallest α such that f(fα(u))=fα(u). Then, fα(u)∈L′ and x≤u≤fα(u), hence fα(u) is an upper-bound of X.
It remains to show that it is the least upper-bound. Thus, let u′∈L′ be another upper-bound of x. Then, u≤u′ in L, hence fα(u)≤fα(u′)=u′ (apply Ex7).
Ex9
A least upper-bound is obtained via ⋃ on all sets, and the greatest lower-bound via ⋂. (Easy checks.) Given any function f:A→B, we trivially have X⊆Y⟹f(X)⊆f(Y); injectivity is not needed.
We define
We need to verify that g(B−f(A′))=A−A′, then A′ and f(A′) are the desired sets.
x∈A(k−1)−A(k)=g(B−f(A(j)))⊆g(B−f(A′))
Ex10
Let A′ be the set constructed in Ex9. Then, we can define a bijection ϕ:A→B via
ϕ:x↦{f(x)x∈A′g−1(x)x∉A′