2024-08-18: Second meeting on Friday 2024-08-16 at 7pm in Notes, Trafalgar Square. At the first meeting, we went over the basic model theory and proof theory that's required to understand the statement of Löb's theorem, and stepped through the proof; but we haven't got close to the meat of what I want to understand yet. (More detailed notes from the first meeting.)
2024-08-14: First meeting on Friday 2024-08-16 at 7pm in Notes, Trafalgar Square. [Details].
2024-07-25: I'm organising a little reading group, undecided between in-person in London or remote. My own questions I want to get the answers to are in https://www.patrickstevens.co.uk/posts/2024-07-25-lob-theorem/ . Writing this here so we have a place to start coordinating how we do this. I'm away on holiday until next Sunday so won't be doing anything about the organisation immediately (will just be trying to understand the material myself in the meantime).
Kickstarter status: about a dozen people have expressed interest, in response to my open call for interest in London.
Prerequisites I'll be expecting of people coming in:
* Be fluent and familiar with Peano arithmetic
* Grok the difference between provability and truth
* Have been at one point capable of precisely stating Gödel's incompleteness theorems, even if you no longer could state them correctly now.
* Have at least read [the paper](https://intelligence.org/files/ProgramEquilibrium.pdf) even if you haven't understood it at all; know the crucial property of an ideal agent that it's trying to construct
* Have spent a little time musing on the nature of Löb's theorem (for which https://www.lesswrong.com/posts/ALCnqX6Xx8bpFMZq3/the-cartoon-guide-to-loeb-s-theorem may be helpful, although personally I found Wikipedia's proof much easier to follow); perhaps still have your brain twisted into pretzels trying to understand what it actually means.
I might end up doing a single session beforehand covering some of the prerequisites, if people are interested in that; comment, or find and upvote an existing comment, if so.
Notes from second meeting
Answer to exercise "What's the sentence you get from the diagonal lemma, L iff 'L has length less-than-or-equal-to 1000'?": len("replace('len(x) <= 1000", 'x' -> "replace('len(x) <= 1000', 'x' -> x)")") <= 1000. I think.
We went through the Smullyan paper and got up to the moment before "The Stability Predicament". The point of the paper turns out to be "if we interpret the box modality as 'belief', then a bunch of English sentences sound weird". The real moral of the paper is that it's awfully hard to be a knight or a knave on a knights-and-knaves island! (Making any statement about my beliefs involves extrapolating my entire future mind-state, for example, so only an extremely powerful knight or knave can make such a statement.) A bunch of the paper only requires that the reasoner believe the island to be a knight-knave island, but the reasoner is thereby making an extremely strong assumption.
Brief aside where I got confused about whether a symmetry argument meant we could never distinguish between knights and knaves: of course we can, because a knight can prove knighthood by saying "0 = 0".
After some thought, we eventually decided that Smullyan's "you will never believe I'm a knight" is equally well served by "you will eventually believe I'm a knave", although this wasn't obvious. The former is a statement about all of time; the latter is a statement only about an instant in time. The reasoner is assumed to never believe a false statement, though, so in fact once they believe something, it must be true.
General consensus that the notation of "type 1/2/3/4" is deeply annoying. I consistently refer to Type 1 as "deductively closed", and then types 2/3/4 are precisely adding one each of the Hilbert-Bernays properties.
Question about what the semantics are, if "belief" (= proof) is the syntax. Answer which I was mostly satisfied with at the time: we've got a sequence of statements from numbered knight-or-knaves, and then we freeze the world and get to reason about which speaker has what property, eventually assigning them a boolean "is-knight" label.
Identification of typo in paper: p348 "if there is a proposition q such that he believes q~(Bq implies q), then he will believe p"; that should read "… q~(Bq implies p)…".
Considerable confusion about why the self-fulfilling belief required the presence of a knight-or-knave at all, to say "If you ever believe I'm a knight, then the cure will work". Why couldn't the reasoner simply imagine this sentence to start off with? Answer: actually a later section makes explicit the condition under which the reasoner can imagine this sentence and hence perform all the reasoning of this sentence without a knight-or-knave: this is the "reflexive" property, that for every p there is q such that the reasoner believes q = (Bq implies p). This is of course true of PA, but that's a heavy-duty fact that requires the diagonal lemma or an extremely inspired quine.
We stopped before section 5 due to lack of time, but I at least had a lot of fun, and our waiter said it sounded very interesting!
Homework:
Interesting tidbits: