All of AntonTimmer's Comments + Replies

Here is an example which I believe is directionally correct, it took me roughly 20 minutes to come up with it. The prompt is "how do living systems create meaning "?:

  1. My life feels like it has meaning (sensory-motor behavior and conceptual intentional aspects). Looking at it through an evolutionary perspective, it is highly likely that meaning assignment is the way through which living systems survived. Thus, there has to be some base biological level at which meaning is created through cell-cell communication/ bioelectricity/ biochemistry /biosensoring e
... (read more)

Maybe a different framework to look at it:

  1. The map tries to represent the territory faithfully.
  2. The map consciously misrepresent the territory. But you can still infer through the malevolent map some things about the territory.
  3. The map does not represent the territory at all but pretends to be 1. Difference to 2 is that 2 is still taking the territory as base case and changing it while 3 is not at all trying to look at the territory.
  4. The map is the territory. Any reference on the map is just a reference to another part of the map. Claiming that the map mi
... (read more)
2Nathan Young
Do you find this an intuitive framework? I find the implication that conversation fits neatly into these boxes or that these are the relevant boxes a little doubtful. Are you able to quickly give examples in any setting of what 1,2,3 and 4 would be?

I wouldn't ascribe human morality to the process of evolution. Morality is a bunch of if..., then statements. Morality seems to be more of a cultural thing and helps coordination. Morality is obviously influenced by our emotions such as disgust, love etc but these can be influenced heavily by culture, upbringing and just genes. Now let's assume the AI is getting killed if it behaves "unmoral", how can you be sure that it does not evolve to be deceptive ?

Today I though about how it is weird that so many people go into soft sciences (social sciences etc.) instead of STEM fields. I think one of the reasons may be that feedback loops are way bigger. In STEM fields most of the time you will be shown that you are wrong. However in soft sciences you can go on without ever noticing that you made a wrong judgement (outside view). Maybe alignment should look more into how people came up with theories in soft sciences ? Since it seems like the feedback loops are bigger.

3Dagon
It's possible that those that choose the social sciences do so because the questions are more important, or at least more salient to their social and moral universe.  How do people behave, and how do we make them happier, more productive, less violent, whatever certainly SOUNDS more useful than why does the thin-surface of drying fluids avoid micro-wrinkles (yes, I know someone with a Ph.D. in watching paint dry). It's also possible that the longer (or often, nonexistent) feedback cycles are a feature that are sought by some - it's kind of brutal finding out you're wrong on little details over and over, especially when that feedback doesn't really tell you whether you're right or wrong about the bigger picture.

I misused the definition of a pivotal act which makes it confusing. My bad! 

I understood the phrase pivotal act more in the spirit of out-off distribution effort. To rephrase it more clearly: Do "you" think an out-off distribution effort is needed right now ? For example sacrificing the long term (20 years) for the short term (5 years) or going for high risk-high reward strategies. 

Or should we stay on our current trajectory, since it maximizes our chances of winning ? (which as far as I can tell is "your" opinion)

7Rohin Shah
To the extent I understand you (which at this point I think I do), yes, "we" think we should stay on our current trajectory.

As far as I can tell the major disagreements are about us having a plan and taking a pivotal act. There seems to be general "consensus" (Unclear, Mostly Agree, Agree) about what the problems are and how an AGI might look. Since no pivotal acts is needed either you think that we will be able to tackle this problem with the resources we have and will have, you have (way) longer timelines (let's assume Eliezer timeline is 2032 for argument's sake) or you expect the world to make a major shift in priorities concerning AGI.

Am I correct in assuming this or am I missing some alternatives ?

(I'm on the DeepMind alignment team)

There's a fair amount of disagreement within the team as well. I'll try to say some things that I think almost everyone on the team would agree with but I could easily be wrong about that.

you think that we will be able to tackle this problem with the resources we have and will have

Presumably even on a pivotal act framing, we also have to execute a pivotal act with the resources we have and will have, so I'm not really understanding what the distinction is here? But I'm guessing that this is closest to "our" belief of the... (read more)

This seems to boil down to the "AI in the box" problem. People are convinced that keeping an AI trapped is not possible. There is a tag which you can look up (AI Boxing) or you can just read up here.

Reading this 13 years later is quite interesting when you think about how far the LW community and EA community have come. 

"If AGI systems can become as smart as humans, imagine what one human/organization could do by just replicating this AGI."

I feel the same. I think there are just a lot of problems which one could try to solve/solve which are increasing the good in the world. The difference between alignment and the rest seems to be the probability at which humans will go extinct is much higher.

Curtis Yarvin might be intresting. There are two post I would recommend mainly general theory of collaboration and one about Monarchism. He gives an interesting perspective.