Comment author: Quill_McGee 02 October 2014 04:45:38AM 2 points [-]

I assume you either linked to this in the post, or it has been mentioned in the comments, but I did not catch it in either location if it was present, so I'm linking to it anyway: http://intelligence.org/files/Non-Omniscience.pdf contains a not merely computable but tractable algorithm for assigning probabilities to a given set of first-order sentences.

Comment author: Quill_McGee 08 June 2014 03:29:50PM 0 points [-]

"S proves that A()=1 ⇒ U()=42. But S also proves that A()=1 ⇒ U()=1000000, therefore S proves that A()≠1" I don't see how this follows. Perhaps it is because, if the system was sound, it would never prove more than one value for U() for a given a, therefore by the principle of explosion it proves A()≠1? But that doesn't seem to actually follow. I'm aware that this is an old post, but on the off chance that anyone ever actually sees this comment, help would be appreciated.

Comment author: Gunnar_Zarncke 05 June 2014 07:13:20AM 4 points [-]

Which of the proposed solutions do you prefer?

Submitting...

Comment author: Quill_McGee 06 June 2014 01:20:11AM *  0 points [-]

Personally, I fall on the 'all of the above(except idea A)' side of the fence. I primarily use LessWrong for the Main board, as it is an excellent source of well-edited, well-considered articles, containing interesting or useful ideas. I want the remainder of the site to thrive because if there is not a large, active userbase and new users being attracted, then I would expect to see the types of content I want to see become less frequent. All of these ideas seem like good things to do, keeping in mind that if these do not actually support the goal of making good Main articles more frequent, then they are not good things, and it seems possible that some of these could backfire.

Comment author: Quill_McGee 31 May 2014 07:37:17AM 1 point [-]

Well, this comes up different ways under different interpretations. If there is a chance that I am being simulated, that is this is part of his determining my choice, then I give him $100. If the coin is quantum, that is there will exist other mes getting the money, I give him $100. If there is a chance that I will encounter similar situations again, I give him $100. If I were informed of the deal beforehand, I give him $100. Given that I am not simulated, given that the coin is deterministic, and given that I will never again encounter Omega, I don't think I give him $100. Seeing as I can treat this entirely in isolation due to these conditions, I have the choice between -$100 and $0, of which two options the second is better. Now, this runs into some problems. If I were informed of it beforehand, I should have precommitted. Seeing as my choices given all information shouldn't change, this presents difficulty. However, due to the uniqueness of this deal, there really does seem to be no benefit to any mes from giving him the money, and so it is purely a loss.

In response to The Allais Paradox
Comment author: Quill_McGee 14 April 2014 01:20:46AM *  0 points [-]

My resolution to this, without changing my intuitions to pick things that I currently perceive as 'simply wrong', would be that I value certainty. A 9/10 chance of winning x dollars is worth much less to me than a 10/10 chance of winning 9x/10 dollars. However, a 2/10 chance of winning x dollars is worth only barely less than a 4/10 chance of winning x/2 dollars, because as far as I can tell the added utility of the lack of worrying increases massively as the more certain option approaches 100%. Now, this becomes less powerful the closer the odds, are, but slower than the dollar difference between the two change. So a 99% chance of x is barely effected by this compared to a 100% chance of .99x, but still by a greater value than .01x, and the more likely option still dominates. I might take a 99% chance of x over a 100% chance of .9x, however, and I would definitely prefer a 99% chance of x over a 100% chance of 0.8x.

EDIT: Upon further consideration, this is wrong. If presented with the actual choice, I would still prefer 1A to 1B, but to maintain consistency I will now choose 2A > 2B.

Comment author: [deleted] 08 April 2014 03:29:54PM *  2 points [-]

Unfortunately "bias" in statistics is completely unrelated to what we're aiming for here.

In ugly, muddy words, what we're thinking is that we give the value-learning algorithm some sample of observations or world-states as "good", and possibly some as "bad", and "good versus bad" might be any kind of indicator value (boolean, reinforcement score, whatever). It's a 100% guarantee that the physical correlates of having given the algorithm a sample apply to every single sample, but we want the algorithm to learn the underlying causal structure of why those correlates themselves occurred (that is, to model our intentions as a VNM utility function) rather than learn the physical correlates themselves (because that leads to the agent wireheading itself).

Here's a thought: how would we build a learning algorithm that treats its samples/input as evidence of an optimization process occurring and attempts to learn the goal of that optimization process? Since physical correlates like reward buttons don't actually behave as optimization processes themselves, this would ferret out the intentionality exhibited by the value-learner's operator from the mere physical effects of that intentionality (provided we first conjecture that human intentions behave detectably like optimization).

Has that whole "optimization process" and "intentional stance" bit from the LW Sequences been formalized enough for a learning treatment?

Comment author: Quill_McGee 09 April 2014 06:08:43AM *  2 points [-]

http://www.fungible.com/respect/index.html This looks to be very related to the idea of "Observe someone's actions. Assume they are trying to accomplish something. Work out what they are trying to accomplish." Which seems to be what you are talking about.

Comment author: pwno 17 March 2010 06:04:28PM *  0 points [-]

Here's a good one: Why do you procrastinate?

The more correct answer is normally hidden by cognitive dissonance or just difficult to discover for inexperienced introspecters.

Comment author: Quill_McGee 03 October 2012 09:28:15AM 1 point [-]

(aware that this is 2 years late, just decided to post) I find that I work, on average,somewhere between 2-3 times as fast when I am right up next to a deadline,than when I have plenty of time.

In response to The Crackpot Offer
Comment author: Douglas_Knight2 09 September 2007 02:41:27PM 7 points [-]

While many people have mentioned similar disappointments, no one has echoed "I'll get that theorem eventually...even though my first try failed!" That was what seemed like a really bad sign when I read the essay before the comments. But I think we're really bad at communicating feelings, so I don't know how the feelings relate, how strong they were, and especially, how the commenters see the parallels with their reactions.

Comment author: Quill_McGee 03 October 2012 08:11:38AM 1 point [-]

Does it count if the state of trying lasted for a long(but now ended) time? because if so, I kept on trying to create a bijection between the reals and the wholes, until I was about 13 and found an actual number that I could actually write down that none of my obvious ideas could reach, and find an equivalent for all the non obvious ones.( 0.21111111..., by the way)

View more: Prev