All of Johnicholas's Comments + Replies

If humans are bad at mental arithmetic, but good at, say, not dying - doesn't that suggest that, as a practical matter, humans should try to rephrase mathematical questions into questions about danger?

E.g. Imagine stepping into a field crisscrossed by dangerous laser beams in a prime-numbers manner to get something valuable. I think someone who had a realistic fear of the laser beams, and a realistic understanding of the benefit of that valuable thing would slow down and/or stop stepping out into suspicious spots.

Quantifying is ONE technique, and it's bee... (read more)

7Swimmer963 (Miranda Dixon-Luinenburg)
I don't think this would help at all. Humans have some built-in systems to respond to danger that is shaped like a tiger or a snake or other learned stimuli, like when I see a patient go into a lethal arrhythmia on the heart monitor. This programmed response to danger pumps you full of adrenaline and makes you very motivated to run very fast, or work very hard at some skill that you've practiced over and over. Elite athletes perform better under the pressure of competition; beginners perform worse. An elite mathematician might do math faster if they felt they were in danger, but an elite mathematician is probably motivated to do mental arithmetic in the first place. I place around 95% confidence that generic bad-at-mental-arithmetic human would perform worse if they felt they were in danger than if they were in a safe classroom environment. If a patient is in cardiac arrest, I'm incredibly motivated to do something about it, but I don't trust my brain with even the simplest mental arithmetic. (Which is irritating, actually). This doesn't address the reward part of your situation, the "something valuable" at the end of the road. Without the danger, or with some mild thrill-adding danger, this might be a workable idea.

Steve Keen's Debunking Economics blames debt, not automation.

Essentially, many people currently feel that they are deep in debt, and work to get out of debt. Keen has a ODE model of the macroeconomy that shows various behaviors, including debt-driven crashes.

Felix Martin's Money goes further and argues that strong anti-inflation stances by central bank regulators strengthen the hold of creditors over debtors, which has made these recent crashes bigger and more painful.

0[anonymous]
Having read Debunking Economics, I second this. The ODE model is pretty interesting actually. Among other things the thesis is that the second derivative of debt has a strong impact on aggregate demand. You can enhance aggregate demand (intrinsically without a central authority, as people like having stuff) with an accelerating rate of debt accumulation, which both eventually causes problems on its own in terms of instability and distribution of money within the population and pulls aggregate demand backwards from the future when the debt must be repaid. Such accelerating debt also lets you maintain exponentiation for longer if you are faced with external limits on the rate of expansion on the real economy because more and more of the money is in the form of financial instruments (which can effectively be wished out of existence) rather than tied to physical capital. Keeps an illusory boom going long after its physical basis is gone.

The statements, though contradictory, refer to two different thought experiments.

[This comment is no longer endorsed by its author]Reply

The two comments, though contradictory, refer to two different thought experiments.

0[anonymous]
I see. Thanks for the explanation.

Is it reasonable to take this as evidence that we shouldn't use expected utility computations, or not only expected utility computations, to guide our decisions?

If I understand the context, the reason we believed an entity, either a human or an AI, ought to use expected utility as a practical decision making strategy, is because it would yield good results (a simple, general architecture for decision making). If there are fully general attacks (muggings) on all entities that use expected utility as a practical decision making strategy, then perhaps we shou... (read more)

Magic Haskeller and Augustsson's Djinn are provers (or to say it another way, comprehensible as provers, or to say it another way, isomorphic to provers). They attempt to prove the proposition, and if they succeed they output the term corresponding (via the Curry-Howard Isomorphism) to the proof.

I believe they cannot output a term t :: a->b because there is no such term, because 'anything implies anything else' is false.

The type constructors that you're thinking of are Arrow and Int. Forall is another type constructor, for constructing generic polymorphic types. Some types such as "Forall A, Forall B, A -> B" are uninhabited. You cannot produce an output of type B in a generic way, even if you are given access to an element of type A.

The type corresponding to a proposition like "all computable functions from the reals to the reals are continuous" looks like a function type consuming some representation of "a computable function" and produc... (read more)

-2abramdemski
...except in Haskell, where we can make functions for generating instances given a desired type. (Such as Magic Haskeller.) I still feel like it's accurate to say that general-purpose languages correspond to trivialism. Thanks, though! I didn't think of the generic version. So, in general, inhabited types all act like "true" in the isomorphism. (We have A->B wherever B is a specific inhabited type.) I'm fully aware of that, and I enjoy the correspondence. What bothers me is that people seem to go from a specific isomorphism (which has been extended greatly, granted) to a somewhat ill-defined general principle. My frustration comes largely from hearing mistakes. One extreme example is thinking that (untyped!) lambda calculus is the universal logic (capturing first-order logic and any higher-order logic we might invent; after all, it's Turing complete). Lesser cases are usually just ill-defined invocations (ie, guessing that the isomorphism might be relevant in cases where it's not obvious how it would be).

I think you may be sincerely confused. Would you please reword your question?

If your question is whether someone (either me or the OP) has committed a multiplication error - yes, it's entirely possible, but multiplication is not the point - the point is anthropic reasoning and whether "I am a Bolzmann brain" is a simple hypothesis.

The arithmetical hierarchy is presuming a background of predicate logic; I was not presuming that. Yes, the type theory that I was gesturing towards would have some similarity to the arithmetical hierarchy.

I was trying to suggest that the answer to "what is a prediction" might look like a type theory of different variants of a prediction. Perhaps a linear hierarchy like the arithmetical hierarchy, yes, perhaps something more complicated. There could be a single starting type "concrete prediction" and a type constructor that, given sour... (read more)

Perhaps there is a type theory for predictions, with concrete predictions like "The bus will come at 3 o'clock", and functions that output concrete predictions like "Every monday, wednesday and friday, the bus will come at 3 o'clock" (consider the statement as a function taking a time and returning a concrete prediction yes or no).

An ultrafinitist would probably not argue with the existence of such a function, even though to someone other than an ultrafinitist, the function looks like it is quantifying over all time. From the ultrafinitist's point of view, you're going to apply it to some concrete time, and at that point it's going to output a some concrete prediction.

1cousin_it
Ultrafinitism doesn't seem relevant here. The "type theory" you're thinking of seems to be just the arithmetical hierarchy. A "concrete prediction" is a statement with only bounded quantifiers, "this Turing machine eventually halts" is a statement with one existential quantifier, "this Turing machine halts for every input" is a statement with one universal and one existential quantifier, and so on. Or am I missing something?

The prevalence of encodings means that we might not be able to "build machines with one or the other". That is, given that there are basic alternatives A and B and A can encode B and B can encode A, it would take a technologist specializing in hair-splitting to say whether a machine that purportedly is using A is "really" using A at its lowest level or whether it is "actually" using B and only seems to use A via an encoding.

If in the immediate term you want to work with many-sorted first order logic, a reasonable first step wo... (read more)

3abramdemski
In this case, that's not true. The many-sorted logic, with axioms and all to emulate 2nd-order logic, has different properties than plain 1st-order logic (even though we may be emulating it in a plain 1st-order engine). For example, in 2nd-order logic, we can quantify over any names we use. In 1st-order logic, this is not true: we can quantify over 1st-order entities, but we are unable to quantify over 2nd-order entities. So, we can have named entities (predicates and relations) which we are unable to quantify over. One consequence of this is that, in 1st-order logic, we can never prove something non-tautological about a predicate or relation without first making some assumptions about that specific predicate or relation. Statements which share no predicates or relations are logically independent! This limits the influence of concepts on one another, to some extent. This sort of thing does not hold for 2nd-order logic, so its translation doesn't hold for the translation of 2nd-order logic into 1st-order logic, either. (Basically, all statements in this many-sorted logic will use the membership predicate, which causes us to lose the guarantee of logical independence for other named items.) So we have to be careful: an encoding of one thing into another thing doesn't give us everything.

It seems to me like this discussion has gotten too far into either/or "winning". The prevalance of encodings in mathematics and logic (e.g. encoding the concept of pairing in set theory by defining (a, b) to be the set {{a}, {a, b}}, the double negation encoding of classical logic inside intuitionist logic, et cetera) means that the things we point to as "foundations" such as the ZF axioms for set theory are not actually foundational, in the sense of necessary and supporting. ZF is one possible system which is sufficiently strong to enc... (read more)

0abramdemski
I agree that we should weigh possible foundations against desired results and respect multiple possibilities as you say. However, we need a formalization of this. It might be that 1st order vs 2nd order is not important. I would suggest, however, that the puzzle I presented in the post is important. The proof-theoretic structure of 1st vs 2nd order might not be a big deal. (A learning system which prefers compact first-order theories can learn the desired many-sorted logic.) The structure of reasonable probabilistic beliefs over these two domains, though, is another thing yet! (A learner which prefers compact first-order theories cannot mimic the desired behavior which I described.) You won't automatically get the desired behavior by constructing some sort of intuitive learner based on informal principles. So, we need to discuss formalism.

Feeling our way into a new formal system is part of our (messy, informal) pebblecraft. Sometimes people propose formal systems starting with their intended semantics (roughly, model theory). Sometimes people propose formal systems starting with introduction and elimination rules (roughly, proof theory). If the latter, people sometimes look for a semantics to go along with the syntax (and vice versa, of course).

For example, lambda calculus started with rules for performing beta reduction. In talking about lambda calculus, people refer to it as "functio... (read more)

6Vladimir_Nesov
Nice. This seems like a surprisingly well-motivated way of reducing everything to physics: there's just "syntactic" machinery made out of physics, and any semantics that might be attributed to parts of this machinery is merely a partially informal device (i.e. a collection of cognitive skills) that human mathematicians might use as an aid for reasoning about the machinery. Even when the machinery itself might in some sense be said to be semantically reasoning about something or other, this description of the machinery can be traced back to human mathematicians who use it as a partially informal device for understanding the machinery, and so it won't strictly speaking be a property of the machinery itself. In other words, in this view semantics is an informal art primarily concerned with advancement of human understanding, and it's not fundamental to the operation of intelligence in general, it's not needed for properly designing things, responding to observations or making decisions, any more than curiosity or visual thinking.
0abramdemski
Ok, right. I think I didn't fully appreciate your point before. So the fact that a particular many-sorted first-order logic with extra axioms is proof-theoretically equivalent to (a given proof system for) 2nd-order logic should make me stop asking whether we should build machines with one or the other, and start asking instead whether the many-sorted logic with extra axioms is any better than plain first-order logic (to which the answer appears to be yes, based on our admittedly 2nd-order reasoning). Right?

This and previous articles in this series emphasize attaching meaning to sequences of symbols via discussion and gesturing toward models. That strategy doesn't seem compatible with your article regarding sheep and pebbles. Isn't there a way to connect sequences of symbols to sheep (and food and similar real-world consequences) directly via a discipline of "symbolcraft"?

I thought pebblecraft was an excellent description of how finitists and formalists think about confusing concepts like uncountability or actual infinity: Writing down "... is ... (read more)

2abramdemski
I think I agree. This gives me the feeling that every time the discussion revolves around models, it is getting off the point. We can touch proof systems. We can't touch models. We can say that models are part of our pebblecraft... The question, though, is whether we should trust particular parts of our pebblecraft. Should we prefer to work with first-order logic, or 2nd-order logic? Should we believe in such a thing as a standard model of the natural numbers? Should we trust proofs which make use of those concepts?

Mathematics and logic are part of a strategy that I'll call "formalization". Informal speech leans on (human) biological capabilities. We communicate ideas, including ideas like "natural number" and "set" using informal speech, which does not depend on definitions. Informal speech is not quite pointing and grunting, but pointing and grunting is perhaps a useful cartoon of it. If I point and grunt to a dead leaf, that does not necessarily pin down any particular concept such as "dead leaves". It could just as well ind... (read more)

I'm concerned that you're pushing second order logic too hard, using a false fork - such and so cannot be done in first order logic therefore second-order logic. "Second order" logic is a particular thing - for example it is a logic based on model theory. http://en.wikipedia.org/wiki/Second-order_logic#History_and_disputed_value

There are lots of alternative directions to go when you go beyond the general consensus of first-order logic. Freek Wiedijk's paper "Is ZF a hack?" is a great tour of alternative foundations of mathematics - firs... (read more)

If you taboo one-predicate 'matter', please specialize the two-place predicate (X matters to Y) to Y = "the OP's subsequent use of this article", and use the resulting one-place predicate.

I am not worried about apparent circularity. Once I internalized the Lowenheim-Skolem argument that first-order theories have countable "non-standard" models, then model theory dissolved for me. The syntactical / formalist view of semantics, that what mathematicians are doing is manipulating finite strings of symbols, is always a perfectly good model, ... (read more)

Does it matter if you don't have formal rules for what you're doing with models?

Do you expect what you're doing with models to be formalizable in ZFC?

Does it matter if ZFC is a first-order theory?

-1Qiaochu_Yuan
"Does it matter if X" is not a question; "matter" is a two-place predicate (X matters to Y). What you seem to be worried about is the following: you need some set theory to talk about models of first-order logic. ZFC is a common axiomatization of set theory. But ZFC is itself a first-order theory, so it seems circular to use ZFC to talk about models of first-order logic. But if this is what you're worried about, you should just say so directly.

It may not be possible to draw a sharp line between things that exist from the things that do not exist. Surely there are problematic referents ("the smallest triple of numbers in lexicographic order such that a^3+b^3=c^3", "the historical jesus", "the smallest pair of numbers in lexicographic order such that a^3+24=c^2", "shakespeare's firstborn child") that need considerable working with before ascertaining that they exist or do not exist. Given that difficulty, it seems like we work with existence explicitly, as a... (read more)

One model for time travel might be a two dimensional piece of paper with a path or paths drawn wiggling around on it. If you scan a "current moment" line across the plane, then you see points dancing. If a line and its wiggles are approximately perpendicular to the line of the current moment, then the dancing is local and perhaps physical. Time travel would be sigmoid line, first a "spontaneous" creation of a pair of points, then the cancellation of one ("reversed") point with the original point.

An alternative story is of a li... (read more)

I understand your point - it's akin to the Box quote "all models are wrong but some are useful" - when choosing among (false) models, choose the most useful one. However, it is not the case that stronger assumptions are more useful - of course stronger assumptions make the task of proving easier, but the task as a whole includes both proving and also building a system based on the theorems proven.

My primary point is that EY is implying that second-order logic is necessary to work with the integers. People work with the integers without using seco... (read more)

2Will_Sawin
My primary point is actually that I don't care if math is useful. Math is awesome. This is obviously an extremely rare viewpoint, but very common among. But I do agree with that quote, more or less. I think that potentially some models are true, but those models are almost certainly less useful for most purposes than the crude and easy to work with approximations. I agree that second-order logic is not necessary to work with the integers. Second-order logic is necessary to work with the integers and only the integers, however. Somewhat problematically, it's not actually possible to work with second-order logic. What sort of practical tasks are you thinking of?

If you were writing software for something intended to traverse the Interplanetary transfer network then you would probably use charts and atlases and transition functions, and you would study (symplectic) manifolds and homeomorphisms in order to understand those more-applied concepts.

If an otherwise useful theorem assumes that the manifold is orientable, then you need to show that your practical manifold is orientable before you can use it - and if it turns out not to be orientable, then you can't use it at all. If instead you had an analogous theorem that applied to all manifolds, then you could use it immediately.

4Will_Sawin
There's a difference between assuming that a manifold is orientable and assuming something about set theory. The phase space is, of course, only approximately a manifold. On a very small level it's - well, something we're not very sure of. But all the math you'll be doing is an approximation of reality. So some big macroscopic feature like orientability would be a problem to assume. Orientability corresponds to something in physical reality, and something that clearly matters for your calculation. The axiom of choice or whatever set-theoretic assumption corresponds to nothing in physical reality. It doesn't matter if the theorems you are using are right for the situation, because they are obviously all wrong, because they are about symplectic dynamics on a manifold, and physics isn't actually symplectic dynamics on a manifold! The only thing that matters is how easily you can find a good-enough approximation to reality. More foundational assumptions make this easier, and do not impede one's approximation of reality. Note that physicists frequently make arguments that are just plain unambiguously wrong from a mathematical perspective.

If you assume A and derive B you have not proven B but rather A implies B. If you can instead assume a weaker axiom Aprime, and still derive B, then you have proven Aprime implies B, which is stronger because it will be applicable in more circumstances.

0Will_Sawin
In what "circumstances" are manifolds and homeomorphisms useful?

I agree with this statement - and yet you did not contradict my statement that second order logic is also not part of mainstream mathematics.

A topologist might care about manifolds or homeomorphisms - they do not care about foundations of mathematics - and it is not the case that only one foundation of mathematics can support topology. The weaker foundation is preferable.

6Will_Sawin
The last sentence is not obvious at all. The goal of mathematics is not to be correct a lot. The goal of mathematics is to promote human understanding. Strong axioms help with that by simplifying reasoning.

Second-order logic is not part of standard, mainstream mathematics. It is part of a field that you might call mathematical logic or "foundations of mathematics". Foundations of a building are relevant to the strength of a building, so the name implies that foundations of mathematics are relevant to the strength of mainstream mathematics. A more accurate analogy would be the relationship between physics and philosophy of physics - discoveries in epistemology and philosophy of science are more often driven by physics than the other way around, and ... (read more)

4Eliezer Yudkowsky
Your view is not mainstream.
4[anonymous]
I think this is a fallacy of gray. Mathematicians have been using infinite model theory and second-order logic for a while, now; this is strong evidence that they are good and reasonable. Edit: Link formatting, sorry. I wish there was a way to preview comments before submitting....
1DSimon
I strongly recommend that, if you haven't already, you learn enough introductory calculus to understand what it means to take the limit of an expression as a variable approaches infinity. You are making a common mistake here by conflating your intuitive understanding about infinity with its meaning in stricter mathematical contexts.

My understanding is that the essay's effect is via the horror a reader feels at the alternate-world presented in the essay. It opens the reader's eyes somewhat to the degree that sexism is embedded in everyday grammar and idiom. My understanding is that it is not a persuasive essay in the usual sense.

Please elaborate.

I agree that if you don't look at the numbers, but at the surrounding text, you get the sense that the numbers could be paraphrased in that way.

So does h, labeled "I hear universe" mean "I hear the universe tell me something at all", or "I hear the universe tell me that they love me" or "I hear the universe tell me what it knows, which (tacitly according to the meaning of knows) is accurate"?

I thought it meant "I have a sensation as if the universe were telling me that they love me", but the highest probabi... (read more)

0MaoShan
I took h to mean "I accurately receive the information that the universe conveys", which in this case regarding the state of my partner loving me or not, I would still accurately hear the universe, otherwise it would be not-h. Since I am considering possible states, partner-not-loving-me/universe-tells-me/me-hearing-that would be the second most likely possibility, because the other two variables are less in doubt (for the person in the example). If this person were in real life, they probably are frustrated, wondering why on earth it feels like their partner is trying to drive a wedge in the relationship, when obviously they are in love, because the universe can magically read their minds and the crystal auras never lie.
3DaFranker
Congratulations, you've cleared the hidden test of making sure that this isn't all just a password in your head! IMO, which one it was intended to be is irrelevant as long as you understand both cases. Understanding these things enough to be able to untangle them like this sounds like it's really the whole point of the article.

Twice in this article, there are tables of numbers. They're clearly made-up, not measured from experiment, but I don't really understand exactly how made-up they are - are they carefully or casually cooked?

Could people instead use letters (variables), with relations like 'a > b', 'a >> b', 'a/b > c' and so on in the future? Then I could understand better what properties of the table are intentional.

1MaoShan
There's nothing about the tables that was not explained in the previous installment of this series; click the links if you're still confused. I came to this knowing nothing about that type of notation, but the tables told me even more than the bubble diagrams--and here's the secret. Looking at the table tells you next to nothing. It's only when you think about the situations that the probabilities quantify, then they make sense. Although, as an additional step, he could have explained each of the situations in sentence form in a paragraph, but probably felt the table spoke for itself. The second table, for instance, (if I am interpreting correctly) can be paraphrased as: I believe that my partner loves me, and that the universe knows it, and I can get this answer from the universe. I would also know that if my partner didn't love me, because the universe would know it and I would hear that. It's probably one of those two. Of course it could be that I don't hear the universe, or the universe is lying to me, or that the universe doesn't magically pick up our thoughts (how unromantic!), but I really don't believe that to be true, I only admit that it's possible. I am rational, after all.
8Kindly
In my experience, using variables instead of numbers when it's not absolutely necessary makes things ridiculously harder to understand for someone not comfortable with abstract math.

I think it would be valuable if someone pointed out that a third party watching, without controlling, a scientist's controlled study is in pretty much the same situation as the three-column exercise/weight/internet use situation - they have instead exercise/weight/control group.

This "observe the results of a scientist's controlled study" thought experiment motivates and provides hope that one can sometimes derive causation from observation, where the current story arc makes a sortof magical leap.

6Vaniver
Indeed; one way to think about this is to consider nature as a scientist whose shoulder we can look over. The leap only seems magical until you understand what the moving parts inside are. So let's try going in the reverse direction, and see if that helps make it clearer. Suppose there are three binary variables, A, B, and C, and they are pairwise dependent on each other: that is, P(A) isn't P(A|B), but we haven't looked at P(A|BC). Alice says that A causes both B and C. Bob says that A causes B, which causes C. Charlie says that A and B both cause C. (Each of these is a minimal description of the model- any arcs not mentioned don't exist, which means there's no direct causal link between those two.) Unfortunately, A, B, and C are easy to measure but hard to influence, so running experiments is out of the question, but fortunately we have lots of observational data to do statistics on. We take a look at the models and realize that they make falsifiable predictions: If Alice is right, then B and C should be conditionally independent given A: that is, P(B|AC)=P(B|A) and P(C|AB)=P(C|A). If Bob is right, then A and C should be conditionally independent given B: that is, P(A|BC)=P(A|B) and P(C|AB)=P(C|B). If Charlie is right, then A and B should be independent, and only become dependent given C. We know Charlie's wrong immediately, since the variables are unconditionally pairwise dependent. To test if Alice or Bob are right, we look at the joint probability distribution and marginalize, like described in the post. Suppose we find that both Alice and Bob are wrong, and so we can conclude that their models are incorrect, just like we could with Charlie's. In general, we don't look at three proposed models. What we do instead is a procedure that will implicitly consider each of the 25 acyclic causal models that could describe a set of three binary variables, ruling them out until only a small set are left. Note that an observation that, say, A and C are uncorrela

There are some aspects of maps - for example, edges, blank spots, and so on, that seem, if not necessary, extremely convenient to keep as part of the map. However, if you use these features of a map in the same way that you use most features of a map - to guide your actions - then you will not be guided well. There's something in the sequences like "the world is not mysterious" about people falling into the error of moving from blank/cloudy spots on the map to "inherently blank/cloudy" parts of the world.

The slogan "the map is not ... (read more)

You might enjoy Crutchfield's epsilon machines, and Shalizi's CSSR algorithm for learning them:

http://masi.cscs.lsa.umich.edu/~crshalizi/notabene/computational-mechanics.html

There's cognitive strategies that (heuristically) take advantage of the usually-persistent world. Should I be embarrassed, after working and practicing with pencil and paper to solve arithmetic problems, that I do something stupid when someone changes the properties of pencil and paper from persistent to volatile?

What I'd like to see is more aboveboard stuff. Suppose that you notify someone that you're showing them possibly-altered versions of their responses. Can we identify which things were changed when explicitly alerted? Do we still confabulate (probably)? Are the questions that we still confabulate on questions that we're more uncertain about - more ambiguous wording, more judgement required?

4TheOtherDave
I don't have citations handy, but IIRC in general inattentional blindness effects are greatly diminished if you warn people ahead of time, which should not be surprising. I don't know what happens if you warn people between the filling-out-the-questionaire stage and the reading-the-(possibly altered)-answers stage; I expect you'd get a reduced rate of acceptance of changed answers, but you'd also get a not-inconsiderable rate of rejection of unchanged answers. More generally: we do a lot of stuff without paying attention to what we're doing, but we don't keep track of what we did or didn't pay attention to, and on later recollection we tend to confabulate details into vague memories of unattended-to events. This is a broken system design, and it manifests in a variety of bugs that are unsurprising once we let go of the intuitive but false belief that memory is a process of retrieving recordings into conscious awareness. It frequently startles me how tenacious that belief is.

Yes, I (and Stross) am taking auditors, internal and external, as a model. Why do you comment specifically on internal auditors?

1AlanCrowe
Ordinary audit is audit of the accounts; it is focused on money. Internal audit has a wider remit. Expanding the remit of audit is a natural idea. I thought it was interesting and unexpected that it was already being done. I would never have come across the Institute of Internal Audit if it hadn't been for my brother getting a job with them.

There's a lot of similarity between the statistical tests that a scientist does and the statistical tests that auditors do. The scientist is interested in testing that the effect is real, and the auditor is testing that the company really is making that much money, that all its operations are getting aggregated up into the summary documents correctly.

Charlie Stross has a character in his 'Rule 34', Dorothy Straight, who is an organization-auditor, auditing organizations for signs of antisocial behavior. As I understood it, she was asking whether the organi... (read more)

1AlanCrowe
Sounds like internal audit

As I understand it, you're dividing the agent from the world; once you introduce a reward signal, you'll be able to call it reinforcement learning. However, until you introduce a reward signal, you're not doing specifically reinforcement learning - everything applies just as well to any other kind of agent, such as a classical planner.

0royf
That's an excellent point. Of course one cannot introduce RL without talking about the reward signal, and I've never intended to. To me, however, the defining feature of RL is the structure of the solution space, described in this post. To you, it's the existence of a reward signal. I'm not sure that debating this difference of opinion is the best use of our time at this point. I do hope to share my reasons in future posts, if only because they should be interesting in themselves. As for your last point: RL is indeed a very general setting, and classical planning can easily be formulated in RL terms.

The arrows all mean the same thing, which is roughly 'causes'.

Chess is a perfect-information game, so you could build the board entirely from the player's memory of the board, but in general, the state of the world at time t-1, together with the player, causes the state of the world at time t.

0duwease
Ah, so what we're really talking about here is situations where the world state keeps changing as the memory builds its model.. or even just a situation where the memory has an incomplete subset of the world information. Reading the second article's example, which makes the limitations of the memory explicit, I understand. I'd say the chess example is a bit misleading in this case, as the discrepancies between the memory and world are a big part of the discussion -- and as you said, chess is a perfect-information game.

It might be valuable to point out that nothing about this is reinforcement learning yet.

1royf
I'm not sure why you say this. Please remember that this introduction is non-standard, so you may need to be an expert on standard RL to see the connection. And while some parts are not in place yet, this post does introduce what I consider to be the most important part of the setting of RL. So I hope we're not arguing over definitions here. If you expand on your meaning of the term, I may be able to help you see the connection. Or we may possibly find that we use the same term for different things altogether. I should also explain why I'm giving a non-standard introduction, where a standard one would be more helpful in communicating with others who may know it. The main reason is that this will hopefully allow me to describe some non-standard and very interesting conclusions.

Those are interesting reviews but I didn't know they were speeches in SIAI's voice.

Thanks for posting this!

I am also grateful to Holden for provoking this - as far as I can tell, the only substantial public speech from SIAI on LessWrong. SIAI often seems to be far more concerned with internal projects than communicating with its supporters, such as most of us on LessWrong.

2lukeprog
Also see How to Purchase AI Risk Reduction, So You Want to Save the World, AI Risk & Opportunity: A Strategic Analysis...

I don't think Strange7 is arguing Strange7's point strongly; let me attempt to strengthen it.

A button that does something dangerous, such as exploding bolts that separate one thing from another thing, might be protected from casual, accidental changes by covering it with a lid, so that when someone actually wants to explode those bolts, they first open the lid and then press the button. This increases reliability if there is some chance that any given hand motion is an error, but the errors of separate hand motions are independent. Similarly 'are you sure'... (read more)

7Eliezer Yudkowsky
This is reasonable, but note that to strengthen the validity, the conclusion has been weakened (unsurprisingly). To take a system that you think is fundamentally, structurally safe and then further build in error-delaying, error-resisting, and error-reporting factors just in case - this is wise and sane. Calling "adding impediments to some errors under some circumstances" hardwiring and relying on it as a primary guarantee of safety, because you think some coded behavior is firmly in place locally independently of the rest of the system... will usually fail to cash out as an implementable algorithm, never mind it being wise.

The distinction between hardwiring and softwiring is, at above the most physical, electronic aspects of computer design, a matter of policy - something in the programmer's mind and habits, not something out in the world that the programmer is manipulating. From any particular version of the software's perspective, all of the program it is running is equally hard (or equally soft).

It may not be impossible to handicap an entity in some way analogous to your suggestion, but holding fiercely to the concept of hardwiring will not help you find it. Thinking abo... (read more)

The thing that is most like an agent in the Tool AI scenario is not the computer and software that it is running. The agent is the combination of the human (which is of course very much like an agent) together with the computer-and-software that constitutes the tool. Holden's argument is that this combination agent is safer somehow. (Perhaps it is more familiar; we can judge intention of the human component with facial expression, for example.)

The claim that Tool AI is an obvious answer to the Friendly AI problem is a paper tiger that Eliezer demolished. H... (read more)

4JGWeissman
Answering that was the point of section 3. Summary: Lots of other people also have their own favored solutions they think are obvious, none of which are also Tool AI. You shouldn't really expect that SIAI would have addressed your particular idea before you or anyone else even talked about it.

Minor text correction;

"dedicated committee of human-level AIs dedicated" repeats the same adjective in a small span.

More wide-ranging:

Perhaps the paper would be stronger if it explained why philosophers might feel that convergence is probable. For example, in their experience, human philosophers / philosophies converge.

In a society, where the members are similar to one another, and much less powerful than the society as a whole, the morality endorsed by the society might be based on the memes that can spread successfully. That is, a meme like '... (read more)

-1Stuart_Armstrong
I'm deliberately avoiding that route. If I attack, or mention, moral realism in any form, philosophers are going to get defensive. I'm hoping to skirt the issue by narrowing the connotations of the terms (efficiency rather than intelligence and, especially, rationality).

There was an incident of censorship by EY relating to acausal trading - the community's confused response (chilling effects? agreement?) to that incident explains why there is no overall account.

7Wei Dai
No, I think it's more that the idea (acausal trading) is very speculative and we don't have a good theory of how it might actually work.

There's two uses of 'utility function'. One is analogous to Daniel Dennett's "intentional stance" in that you can choose to interpret an entity as having a utility function - this is always possible but not necessarily a perspicuous way of understanding an entity - because you might end up with utility functions like "enjoys running in circles but is equally happy being prevented from running in circles".

The second form is as an explicit component within an AI design. Tool-AIs do not contain such a component - they might have a relevance or accuracy function for evaluating answers, but it's not a utility function over the world.

4NancyLebovitz
Is that a problem so long as some behaviors are preferred over others? You could have "is neutral about running in circles, but resists jumping up and down and prefers making abstract paintings". Wouldn't that depend on the Tool-AI? Eliezer's default no-akrasia AI does everything it can to fulfill its utility function. You presumably want it to be as accurate as possible or perhaps as accurate as useful. Would it be a problem for it to ask for more resources? To earn money on its own initiative for more resources? To lobby to get laws passed to give it more resources? At some point, it's a problem if it's going to try to rule the world to get more resources.....

You're right, it's infeasible to care about individual memes (or for that matter, the vast majority of individual animals) the way we care about other humans. I don't have an answer to your question, I'm trying to break a logjam of humancentric ethical thinking.

Forgive me for passing on my confusion here, but I'm not certain that consciousness/sentience, is anything more than 'recognizably human'. You and I have a common brain architecture and one of our faculties is picking that out from trees and rocks. Perhaps there are plenty of evolved, competent alie... (read more)

No. This is regarding the 'possibly irrelevant rant' which I marked explicitly as a 'possibly irrelevant rant'. The concepts in the rant seemed nearby and inspirational to the main article in my mind when I wrote it, which is why I included it, but I cannot articulate a direct connection.

Analogous in that people once discriminated against other races, other sexes, but over time with better ethical arguments, we decided it was better to treat other races, other sexes as worthy members of the "circle of compassion". I predict that if and when we interact with another species with fairly similar might (for example if and when humans speciate) then humancentrism will be considered as terrible as racism or sexism is now.

Moral realism (if I understand it correctly) is the position that moral truths like 'eating babies is wrong' are o... (read more)

2Hul-Gil
I feel like you're trying to say we should care about "memetic life" as well as... other life. But the parallel you draw seems flawed: an individual of any race and sex is still recognizably conscious, and an individual. Do we care about non-sentient life, memetic or otherwise? Should we care?
2A1987dM
I don't think he would put it that way. He defines good as “that which leads to sentient beings living, to people being happy, to individuals having the freedom to control their own lives, to minds exploring new territory instead of falling into infinite loops, to the universe having a richness and complexity to it that goes beyond pebble heaps, etc.”, not as ‘what humans value’, and considers it a “moral miracle” that humans value what leads to sentient beings living etc. etc. (Of course, the reason why we're talking about what leads to sentient beings living etc. etc. in the first place is that that's what we value, so IMO being surprised that we value that would be --as Feynman put it (though he was talking about something else)-- like being surprised that I can see the car with the number plate AC 443 MW.)
0JoachimSchipper
Does this have anything to do with the main point of your article? I can find no links, except an unproven theory used only as an analogy.
Load More