Tarski's truth sentences and MIRI's AI

halcyon

(Disclaimer: I have no training in or detailed understanding of these subjects. I first heard of Tarski from the Litany of Tarski, and then I Googled him.)

In his paper The Semantic Conception of Truth, Tarski says that he analyzes the claim, '"Snow is white" is true if and only if snow is white' as being expressed in two different languages. The whole claim in single quotes is expressed in a metalanguage, while "snow is white" is in another language.

For Tarski's proof to succeed, it is (if I understood him correctly) both necessary and sufficient for the metalanguage to be logically richer than the other language in certain ways. What these ways are is, according to Tarski, difficult to make general statements about without actually following his very involved technical proof.

If I remember correctly, this implies that the two languages cannot be identical. Tarski seems to be of the opinion that for a given language satisfying specific conditions, concepts of truth, synonymy, meaning, etc. can be defined for it in a metalanguage that is richer than it in logical devices, establishing a hierarchy of truth defining languages.

My main question is, since MIRI aims to mathematically prove Friendliness in recursively self-improving AI, is "essential richness" in language handling ability something we should expect to see increasing in the class of AIs MIRI is interested in, or is that unnecessary for MIRI's purposes? I understand that semantically defining truth and meaning may not be important either way. My principal motive is curiosity.

(Disclaimer: I have no training in or detailed understanding of these subjects. I first heard of Tarski from the Litany of Tarski, and then I Googled him.)

The Definability of Truth paper says that Kleene's logic makes it difficult to judge which statements are undefined because that answer also comes out as undefined. Does this mean the probabilistic approach adopted by MIRI is capable of separating cases where the truth of a statement is not infinitely certain because of purely verbal paradoxes from statements whose truth is probabilistic for other reasons? In particular, I'm interested to know whether it can discriminate between those and scientifically interesting paradoxes, but it's too soon to be asking questions like that if I'm not mistaken.

It is possible to construct probabilistic logics to normatively characterize the behavior of ideal goal-oriented agents, but the actual human brain probably strings together all sorts of partial, ad hoc, redundant and/or multiply realized implementations of abstract languages in a variety of ways. It is difficult to prove that an intelligence with an architecture like that will never do certain things in the future. In fact, it is probably a better idea to model a given brain physically than to describe the abstract mathematical reasoning followed by its workings, because the relevant wiring actually changes over time, and the same calculation could be performed in different ways.

It occurs to me that humans might learn languages with all sorts of "essential richness" by generalizing from the rules needed to achieve certain tasks. We may be born with the potential to learn some of these languages in this way, but can an AI running a pure probabilistic logic learn to generalize other abstract languages? It may not need to, mind you.

2

Tarski's truth sentences and MIRI's AI

2

2

2

Tarski's truth sentences and MIRI's AI

2

2