All of Nebu's Comments + Replies

Nebu10

Imagine someone named Omega offers to play a game with you. Omega has a bag, and they swear on their life that exactly one of the following statements is true:

  1. They put a single piece of paper in the bag, and it has "1" written on it.
  2. They put 10 trillion pieces of paper in the bag, numbered "1", "2", "3", etc. up to ten trillion.

Omega then has an independent neutral third party reach into the bag and pull out a random piece of paper which they then hand to you. You look at the piece of paper and it says "1" on it. Omega doesn't get to look at the piece of p... (read more)

Nebu50

as they code I notice nested for loops that could have been one matrix multiplication.

 

This seems like an odd choice for your primary example.

  • Is the primary concern that a sufficiently smart compiler could take your matrix multiplication and turn it into a vectorized instruction?
    • Is it only applicable in certain languages then? E.g. do JVM languages typically enable vectorized instruction optimizations?
  • Is the primary concern that a single matrix multiplication is more maintainable than nested for loops?
    • Is it only applicable in certain domains then (e.g
... (read more)
1Phil Scadden
I had similar thought, but then most of the languages I use don't support matrix operations directly in the language anyway. Great believer in "tried and true" numerical analysis libraries though.
5Gurkenglas
It's not my one trick, of course, but it illustrates my usefulness. It's more maintainable not just because it is shorter but also because it has decades of theory behind it. Drawing the connection unlocks inspiration from entire branches of math. And the speedups from standing on the shoulders of giants go far beyond the constant factors from vectorized instructions.
Answer by Nebu40

For something to experience pain, some information needs to exist (e.g. in the mind of the sufferer, informing them that they are experiencing pain). There are known information limits, e.g. https://en.wikipedia.org/wiki/Bekenstein_bound or https://en.wikipedia.org/wiki/Landauer%27s_principle

These limits are related to entropy, space, energy, etc., so if you further assume the universe is finite (or perhaps equivalently, that the malicious agent can only access a finite portion of the universe due to e.g. speed-of-light limits), then there is an upon b... (read more)

Nebu10

Yeah, which I interpret to mean you'd "lose" (where getting $10 is losing and getting $200 is winning). Hence this is not a good strategy to adopt.

Nebu30
99% of the time for me, or for other people?

99% for you (see https://wiki.lesswrong.com/wiki/Least_convenient_possible_world )

More importantly, when the fiction diverges by that much from the actual universe, it takes a LOT more work to show that any lessons are valid or useful in the real universe.

I believe the goal of these thought experiments is not to figure out whether you should, in practice, sit in the waiting room or not (honestly, nobody cares what some rando on the internet would do in some rando waiting room).

Instead, the goal is to provide unit... (read more)

Nebu30

Any recommendations for companies that can print and ship the calendar to me?

2Stuart_Armstrong
No idea, sorry - I got my copies printed locally.
Nebu10

Okay, but then what would you actually do? Would you leave before the 10 minutes is up?

2avturchin
I will toss a coin to decode should I go or stay
Nebu40
why do I believe that it's accuracy for other people (probably mostly psych students) applies to my actions?

Because historically, in this fictional world we're imagining, when psychologists have said that a device's accuracy was X%, it turned out to be within 1% of X%, 99% of the time.

-4Dagon
99% of the time for me, or for other people? I may not be correct in all cases, but I have evidence that I _am_ an outlier on at least some dimensions of behavior and thought. There are numerous topics where I'll make a different choice than 99% of people. More importantly, when the fiction diverges by that much from the actual universe, it takes a LOT more work to show that any lessons are valid or useful in the real universe.
Nebu10

I really should get around to signing up for this, but...

Nebu10

Seems like the survey is now closed, so I cannot take the survey at the moment I see the post.

Nebu00
suppose Bob is trying to decide to go left or right at an intersection. In the moments where he is deciding to go either left or right, many nearly identical copies in nearly identical scenarios are created. They are almost entirely all the same, and if one Bob decides to go left, one can assume that 99%+ of Bobs made the same decision.

I don't think this assumption is true (and thus perhaps you need to put more effort into checking/arguing its true, if the rest of your argument relies on this assumption). In the moments where Bob is trying to decide ... (read more)

1Evan Ward
There are trillions of quantum operations occurring in one's brain all the time. Comparatively, we make very few executive-level decisions. Further, these high-level decisions are often based off a relatively small set of information & are predictable given that set of information. I believe this implies that a person in the majority of recently created worlds makes the same high-level decisions. It's hard to imagine numerous different decisions we could make in any given circumstance given the relatively ingrained decision procedures we seem to walk the Earth with. I know that your Occam's prior for Bob in a binary decision is .5. That is a separate matter from how many Bobs make the same decision given the same set of external information and the same high-level decision procedures inherited from the Bob in a recent (perhaps just seconds ago) common parent world. This decision procedure does plausibly effect recently created Everett worlds and allow people in them to coordinate amongst 'copies' of themselves. I am not saying I can coordinate w/ far past sister worlds. I can, I am theorizing, coordinate myselves in my soon-to-be-generated child worlds because there is no reason to think quantum operations would randomly delete this decision procedure from my brain over a period of seconds or minutes.
Nebu10

I see some comments hinting at towards this pseudo-argument, but I don't think I saw anyone make it explicitly:

Say I replace one neuron in my brain with a little chip that replicates what that neuron would have done. Say I replace two, three, and so on, until my brain is now completely artificial. Am I still conscious, or not? If not, was there a sudden cut-off point where I switched from conscious to not-conscious, or is there a spectrum and I was gradually moving towards less and less conscious as this transformation occurred?

If I am still conscious... (read more)

Nebu10

I played the game "blind" (i.e. I avoided reading the comments before playing) and was able to figure it out and beat the game without ever losing my ship. I really enjoyed it. The one part that I felt could have been made a lot clearer was that the "shape" of the mind signals how quickly they move towards your ship; I think I only figured that out around level 3 or so.

Nebu30
I'm not saying this should be discussed on LessWrong or anywhere else.

You might want to lead with that, because there have been some arguments in the last few days that people should repeal the "Don't talk about politics" rule on a rationality-focused Facebook group, and I thought you were trying to argue for in favor of repealing those rules.

But I'm saying that the impact of this article and broader norm within the rationalsphere made me think in these terms more broadly. There's a part of me that wishes I'd never read
... (read more)
Nebu20

From a brief skim (e.g. "A Democratic candidate other than Yang to propose UBI before the second debate", "Maduro ousted before end of 2019", "Donald Trump and Xi Jinping to meet in March 2019", etc.), this seems to be focused on "non-personal" (i.e. global) events, whereas my understanding is the OP is interested in tracking predictions for personal events.

Nebu20

Spreadsheet sounds "good enough" if you're not sure you even want to commit to doing this.

That said, I'm "mildly interested" in doing this, but I don't really have inspiration for questions I'd like to make predictions on. I'm not particularly interested in doing predictions about global events and would rather make predictions about personal events. I would like a site that lets me see other people's personal predictions (really, just their questions they're prediction an answer to -- I don't car... (read more)

1[anonymous]
Yes, exactly. I dropped the idea of writing my own software because I realised that would be overcommitting too early. You might like to search by tag "Personal" on the PredictionBook linked below by habryka: https://www.google.com/search?sitesearch=predictionbook.com&q=personal&x=0&y=0
Nebu30

I think this technique only works for one-on-one (or a small group), live interactions. I.e. it doesn't work well for online writings.

The two components that are important for ensuring this technique is successful is:

1. You should tailor the confusion to the specific person you're trying to teach.

2. You have to be able to detect when the confusion is doing more damage than good, and abort it if necessary.

NebuΩ250
Note: I'm not sure if at the beginning of the game, one of the agents [of AlphaStar] is chosen according to the Nash probabilities, or if at each timestep an action is chosen according to the Nash probabilities.

It's the former. During the video demonstration, the pro player remarked how after losing game 1, in game 2 he went for a strategy that would counter the strategy AlphaStar used in game 1, only to find AlphaStar had used a completely different strategy. The AlphaStar representatives responded saying there's actually 5 AlphaStar agen... (read more)

2Rohin Shah
Thanks!
Nebu10

I'm assuming you think wireheading is a disastrous outcome for a super intelligent AI to impose on humans. I'm also assuming you think if bacteria somehow became as intelligent as humans, they would also agree that wireheading would be a disastrous outcome for them, despite the fact that wireheading is probably the best solution that can be done given how unsophisticated their brains are. I.e. the best solution for their simple brains would be considered disastrous by our more complex brains.

This suggests the possibility that maybe the best solution that can be applied to human brains would be considered disastrous for a more complex brain imagining that humans somehow became as intelligent as them.

2Vakus Drake
While I consider wireheading only marginally better than oblivion the more general issue is the extent to which you can really call something alignment if it leads to behavior that the overwhelming majority of people consider egregious and terrible in every way. It really doesn't make sense to talk to talk about there being a "best" solution here anyway because that basically begs the question with regards to certain moral philosophy. >I'm also assuming you think if bacteria somehow became as intelligent as humans, they would also agree that wireheading would be a disastrous outcome for them, despite the fact that wireheading is probably the best solution that can be done given how unsophisticated their brains are. I.e. the best solution for their simple brains would be considered disastrous by our more complex brains. This assumption doesn't hold and somewhat misses my point entirely. As I talked about in my comment bacteria don't seem like they meaningfully have thoughts or preferences so the idea of making a super smart bacteria is rather like making a superintelligent rock. I can remove those surface level issues if I just replace say "bacteria" with say "mice" in which case there's a different misunderstanding involved here. The main issue here is that it seems like you are massively anthropomorphizing animals. If a species of animal doesn't have a certain degree of intelligence it's unlikely to have a value system that actually cares about the external world. However it would be a form of anthropocentrism to expect that an "uplifted" version of an animal would necessarily start gaining certain terminal human values just because it's smarter. So my point more generally is that you seem to need (in natural life at least) a degree of intelligence and socialness to both be able to and have evolved a mind design that cares about the external world. So most animals can have their values easily and completely encompassed by wireheading so there's no reason no
Nebu10

I feel like this game has the opposite problem of 2-4-6. In 2-4-6, it's very easy to come up with a hypothesis that appear to work with every set of test cases you come up with, and thus become overconfident in your hypothesis.

In your game, I had trouble coming up with any hypothesis that would fit the test cases.

Nebu140

Yeah, but which way is the arrow of causality here? Like, was he already a geeky intellectual, and that's why he's both good at calculus/programming and he reads SSC/OB/LW? Or was he "pretty average", started reading SSC/OB/LW, and then that made him become good at calculus/programming?

Yes, genetics + randomness determines most variation in human behavior, but the SSC/LW stuff has helped provide some direction and motivation.

Nebu40

Would any "participants in nuclear war" (for lack of a better term) be interested in killing escaping rich westerners?

Nebu10
Just don't ask your AI system to optimize for general and long-term preferences without a way for you to say "actually, stop, I changed my mind".

I believe that reduces to "solve the Friendly AI problem".

3Rohin Shah
(Pedantic note: the right way to say that is "the Friendly AI problem reduces to that".) I'm replying to the quote from the first comment: What I'm trying to say is that once you have a "do what we mean" system, then don't explicitly ask your AI system to optimize for general and long-term preferences without a way for you to say "actually, stop, I changed my mind". I claim that the hard part there is in building a "do what we mean" system, not in the "don't explicitly ask for a bad thing" part.
Nebu10

It's not clear to me that for all observers in our universe, there'd be a distinction between "a surgeon from a parallel universe suddenly appears in our universe, and that surgeon has memories of existing in a universe parallel to the one he now finds himself in." vs "a surgeon, via random quantum fluctuations, suddenly appears in our universe, and that surgeon has memories of existing in a universe parallel to the one he now finds himself in."

In your example, rather than consider all infinitely many parallel universes, you c... (read more)

3Roko Jelavić
Perhaps I should have been more specific, I'm talking about a scenario where there is an actual machine (like a time machine but instead of travelling in time you travel between universes) in which you step and press a button, and then you appear in a parallel universe. It's not a question who claims anything, nor it is a question of random fluctuations, it's a question of whether that kind of machine can be built or not. If it can be built, then increasing quantum diversification reduces xrisk, because then the travelers can travel around and repopulate other universes. It is simplest to imagine a scenario where all 10 universes have such machines and you can only travel from one machine to another, so you step into the machine in your universe and you step out of the machine in another universe. There is also no point in talking about the exact number of such-and-such universes, all that matters is the proportion of the universes in which something happens, there is an infinite number of every possible universe. I talked about 10 of them to simplify the principle, which holds for any n of universes.
Nebu00

So does that mean a GLUT in the zombie world cannot be conscious, but a GLUT in our world (assuming infinite storage space, since apparently we were able to assume that for the zombie world) can be conscious?

Nebu10

suppose that we (or Omega, since we're going to assume nigh omniscience) asked the person whether JFK was murdered by Lee Harvey Oswald or not, and if they get it wrong, then they are killed/tortured/dust-specked into oblivion/whatever.

Okay, but what is the utility function Omega is trying to optimize?

Let's say you walk up to Omega, tell it "was JFK murdered by Lee Harvey Oswald or not? And by the way, if you get this wrong, I am going to kill you/torture you/dust-spec you."

Unless we've figured out how to build safe oracles, with very high pro... (read more)

1jollybard
That wasn't really my point, but I see what you mean. The point was that it is possible to have a situation where the 0 prior does have specific consequences, not that it's likely, but you're right that my example was a bit off, since obviously the person getting interrogated should just lie about it.
Nebu40

I also inferred rape from the story. It was the part about how in desperation, he reached out and grabbed at her ankle. And then he was imprisoned in response to that.

Nebu00

But what then makes it recommend a policy that we will actually want to implement?

First of all, I'm assuming that we're taking as axiomatic that the tool "wants" to improve itself (or else why would it have even bothered to consider recommending that it be modified to improve itself?); i.e. improving itself is favorable according to its utility function.

Then: It will recommend a policy that we will actually want to implement, because its model of the universe includes our minds and it can see that if it recommends a policy we will actually want to implement leads it to a higher ranked state in its utility function.

Nebu00

To steelman the parent argument a bit, a simple policy can be dangerous, but if an agent proposed a simple and dangerous policy to us, we probably would not implement it (since we could see that it was dangerous), and thus the agent itself would not be dangerous to us.

If the agent were to propose a policy that, as far as we could tell, appears safe, but was in fact dangerous, then simultaneously:

  1. We didn't understand the policy.
  2. The agent was dangerous to us.
Nebu10

Can you be a bit more specific in your interpretation of AIXI here?

Here are my assumptions, let me know where you have different assumptions:

  • Traditional-AIXI is assumed to exists in the same universe as the human who wants to use AIXI to solve some problem.
  • Traditional-AIXI has a fixed input channel (e.g. it's connected to a webcam, and/or it receives keyboard signals from the human, etc.)
  • Traditional-AIXI has a fixed output channel (e.g. it's connected to a LCD monitor, or it can control a robot servo arm, or whatever).
  • The human has somehow pre-provide
... (read more)
Nebu00

I think LearnFun might be informative here. https://www.youtube.com/watch?v=xOCurBYI_gY

LearnFun watches a human play an arbitrary NES games. It is hardcoded to assume that as time progresses, the game is moving towards a "better and better" state (i.e. it assumes the player's trying to win and is at least somewhat effective at achieving its goals). The key point here is that LearnFun does not know ahead of time what the objective of the game is. It infers what the objective of the game is from watching humans play. (More technically, it observes ... (read more)

Nebu20

The analogy with cryptography is an interesting one, because...

In cryptography, even after you've proven that a given encryption scheme is secure, and that proof has been centuply (100 times) checked by different researchers at different institutions, it might still end up being insecure, for many reasons.

Examples of reasons include:

  • The proof assumed mathematical integers/reals, of which computer integers/floating point numbers are just an approximation.
  • The proof assumed that the hardware the algorithm would be running on was reliable (e.g. a reliable s
... (read more)
Nebu00

Also, if you're going to measure information content, you really need to fix a formal language first, or else "the number of bits needed to express X" is ill-defined.

Basically, learn model theory before trying to wield it.

I don't know model theory, but isn't the crucial detail here whether or not the number of bits needed to express X is finite or infinite? If so, then it seems we can handwave the specific formal language we're using to describe X, in the same way that we can handwave what encoding for Turing Machines generally when talking ab... (read more)

0[anonymous]
Let's say I have a specific model of the real numbers in mind, and lets pretend "number of bits needed to describe X" means "log2 the length of the shortest theory that proves the existence of X." Fix a language L1 whose constants are the rational numbers and which otherwise is the language of linear orders. Then it takes a countable number of propositions to prove the existence of any given irrational number (i.e., exists x[1] such that x[1] < u[1], ..., exists y[1] such that y[1] > v[1], ..., x[1] = y[1], ... x[1] = x[2], ..., where the sequences u[n] and v[n] are strict upper and lower bounding sequences on the real number in question). Now fix a language L2 whose constants are the real numbers. It now requires one proposition to prove the existence of any given irrational number (i.e., exists x such that x = c). The difference between this ill-defined measure of information and Kolomogrov complexity is that Turing Machines are inherently countable, and the languages and models of model theory need not be. (Disclaimer: paper-machine2011 knew far more about mathematical logic than paper-machine2016 does.)
Nebu20

I can't directly observe Eliezer winning or losing, but I can make (perhaps very weak) inferences about how often he wins/loses given his writing.

As an analogy, I might not have the opportunity to play a given videogame ABC against a given blogger XYZ that I've never met and will never meet. But if I read his blog posts on ABC strategies, and try to apply them when I play ABC, and find that my win-rate vastly improves, I can infer that XYZ also probably wins often (and probably wins more often than I do).

Nebu00

I guess I'm asking "Why would a finite-universe necessarily dictate a finite utility score?"

In other words, why can't my utility function be:

  • 0 if you give me the entire universe minus all the ice cream.
  • 1 if you give me the entire universe minus all the chocolate ice cream.
  • infinity if I get chocolate ice cream, regardless of how much chocolate ice cream I receive, and regardless of whether the rest of the universe is included with it.
Nebu10

I suspect that if we're willing to say human minds are Turing Complete[1], then we should also be willing to say that an ant's mind is Turing Complete. So when imagining a human with a lot of patience and a very large notebook interacting with a billion year old alien, consider an ant with a lot of patience and a very large surface area to record ant-pheromones upon, interacting with a human. Consider how likely it is that human would be interested in telling the ant things it didn't yet know. Consider what topics the human would focus on telling the ant, ... (read more)

Nebu00

Why not link to the books or give their ISBNs or something?

There are at least two books on model theory by Hodges: ISBN:9780521587136 and ISBN:9780511551574

Nebu00

Why would we give the AI a utility function that assigns 0 utility to an outcome where we get everything we want but it never turns itself off?

The designer of that AI might have (naively?) thought this was a clever way of solving the friendliness problem. Do the thing I want, and then make sure to never do anything again. Surely that won't lead to the whole universe being tiled with paperclips, etc.

Nebu10

Alternately, letting "utility" back in, in a universe of finite time, matter, and energy, there does exist a maximum finite utility which is the sum total of the time, matter, and energy in the universe.

Why can't my utility function be:

  • 0 if I don't get ice cream
  • 1 if I get vanilla ice cream
  • infinity if I get chocolate ice cream

?

I.e. why should we forbid a utility function that returns infinity for certain scenarios, except insofar that it may lead to the types of problems that the OP is worrying about?

0Usul
I was bringing the example into the presumed finite universe in which we live, where Maximum Utility = The Entire Universe. If we are discussing a finite-quantity problem than infinite quantity is ipso facto ruled out.
Nebu10

But what about prediction markets?

Nebu81

Yes, this is a parable about AI safety research, with the humans in the story acting as the AI, and the aliens acting as us.

Nebu20

Right, I suspect just having heard about someone's accomplishments would be an extremely noisy indicator. You'd want to know what they were thinking, for example by reading their blog posts.

Eliezer seems pretty rational, given his writings. But if he repeatedly lost in situations where other people tend to win, I'd update accordingly.

0Lumifer
If you define rationality as winning, why does it matter what his writings seem like?
1ChristianKl
But what about the other case? People who don't seem rational given their writings but who repeatedly win?
Nebu00

I assume that you accept the claim that it is possible to define what a fair coin is, and thus what an unfair coin is.

If we observe some coin, at first, it may be difficult to tell if it's a fair coin or not. Perhaps the coin comes from a very trustworthy friend who assures you that it's fair. Maybe it's specifically being sold in a novelty store and labelled as an "unfair coin" and you've made many purchases from this store in the past and have never been disappointed. In other words, you have some "prior" probability belief that the c... (read more)

2Lumifer
Hm, OK. So you are saying that the degree of rationalism is an unobservable (hidden) variable and what we can observe (winning or losing) is contaminated by noise (luck). That's a fair way of framing it. The interesting question then becomes what kind of accuracy can you achieve in the real world given that the noise level are high, information available to you is limited, and your perception is imperfect (e.g. it's not uncommon to interpret non-obvious high skill as luck).
Nebu00

People who win are not necessarily rationalists. A person who is a rationalist is more likely to win than a person who is not.

Consider someone who just happens to win the lottery vs someone who figures out what actions have the highest expected net profit.

Edit: That said, careful not to succumb to http://rationalwiki.org/wiki/Argument_from_consequences maybe Genghis Khan really was one of the greatest rationalists ever. I've never met the guy nor read any of his writings, so I wouldn't know.

2Lumifer
Even ignoring the issue that "rationalist" is not a binary variable, I don't know how in practice will you be able to tell whether someone is a rationalist or not. Your definition depends on counterfactuals and without them you can't disentangle rationalism and luck.
Nebu00

Actually, I think "Rationalists should WIN" regardless of what their goals are, even if that includes social wrestling matches.

The "should" here is not intended to be moral prescriptivism. I'm not saying in an morally/ethically ideal world, rationalists would win. Instead, I'm using "should" to help define what the word "Rationalist" means. If some person is a rationalist, then given equal opportunity, resources, difficult-of-goal, etc., they will on average, probabilistically win more often than someone who was not ... (read more)

2Lumifer
There is a bit of a problem here in that the list of the greatest rationalists ever will be headed by people like Genghis Khan and Prophet Muhammad.
Nebu20

rationalists as people who make optimal plays versus rationalists as people who love truth and hate lies

It's only possible for us to systematically make optimal plays IF we have a sufficient grasp of truth. There's only an equivocation in the minds of people who don't understand that one goal is a necessary precursor for the other.

No, I think there is an equivocation here, though that's probably because of the term "people who love truth and hate lies" instead of "epistemic rationalist".

An epistemic rationalist wants to know truth... (read more)

Nebu00

The problem with the horses of one color problem is that you are using sloppy verbal reasoning that hides an unjustified assumption that n > 1.

I'm not sure what you mean. I thought I stated it each time I was assuming n=1 and n=2.

In the induction step, we reason "The first horse is the same colour as the horses in the middle, and the horses in the middle have the same colour as the last horse. Therefore, all n+1 horses must be of the same colour". This reasoning only works if n > 1, because if n = 1, then there are no "horses in the middle", and so "the first horse is the same colour as the horses in the middle" is not true.

Nebu00

I think this argument is misleading.

Re "for game theoretical reasons", the paperclipper might take revenge if it predicted that doing so would be a signalling-disincentive for other office-supply-maximizers from stealing paperclips. In other words, the paperclip-maximizer is spending paperclips to take revenge solely because in its calculation, this actually leads to the expected total number of paperclips going up.

1Armok_GoB
That assumes the scenario is iterated, I'm talking it'd precomit to do so even in a one-of scenario. The resxzt of you argument was my point, that the same reasoning goes for anger.
Nebu00

What does it mean for a program to have intelligence if it does not have a goal?

This is a very interesting question, thanks for making me think about it.

(Based on your other comments elsewhere in this thread), it seems like you and I are in agreement that intelligence is about having the capability to make better choices. That is, two agents given an identical problem and identical resources to work with, the agent that is more intelligent is more likely to make the "better" choice.

What does "better" mean here? We need to define some... (read more)

Nebu20

Feedback:

Need an example? Sure! I have two dice, and they can each land on any number, 1-6. I’m assuming they are fair, so each has probability of 1/6, and the logarithm (base 2) of 1/6 is about -2.585. There are 6 states, so the total is 6* (1/6) * 2.585 = 2.585. (With two dice, I have 36 possible combinations, each with probability 1/36, log(1/36) is -5.17, so the entropy is 5.17. You may have notices that I doubled the number of dice involved, and the entropy doubled – because there is exactly twice as much that can happen, but the average entropy is

... (read more)
Load More