User Comment Replies

Societal and technological progress as sewing an ever-growing, ever-changing, patchy, and polychrome quilt

I'm glad to see someone talking about pragmatism!

I find it interesting that the goal of a lot of alignment work seems to be to align AI with human values, when humans with human values spend so much of their time in (often lethal) conflict. I'm more inclined to the idea of building AI with a value-set that is complementary to human values in some widely-desirable way, rather than literally having a bunch of AIs that behave like humans.

I wonder if this perspective intersects with some of your points about thick and thin moralities, as well as social technol... (read more)

Making alignment a law of the universe

juggins2mo21

I see the LLM side of this as a first step, both as a proof of concept and because agents get built on top of LLMs (for the forseeable future at least).

I think that, no, it isn't any easier to align an agent's environment as to align the agent itself. I think for perfect alignment, that will last in all cases and for all time, they amount to the same thing, and this is why the problem is so hard. When an agent or any AI learns new capbilities, it draws the information it needs out of the environment. It's trying to answer the question: "Given the informati... (read more)

1Davey Morse2mo

Same page then. I do think a good initial map of the territory might help an agent avoid catastrophic short-term behavior. I hazard that a good map would be as big as possible, across both time and space. Time--because it's only over eons that identifying with all life may be selected for in AGI. Space--because a physically bounded system is more likely to see itself in direct competition to physical life than a distributed/substrate independent mind.

juggins3mo32

Thanks for the comment! Taking your points in turn:

- I am curious that you see this as me saying superintelligent AI will be less dangerous, as to me it means it will be more. It will be able to dominate you in the usual hyper-competent sense but also may accidentally screw up some super-advanced physics and kill you that way too. It sounds like I should have stressed this more. I guess there are people that think AI sucks and will continue to suck, and therefore why worry about existential risk, so maybe by stressing AI fallibility I'm riding their energy... (read more)

3AnthonyC3mo

Ah, yes, that does clear it up! I definitely am much more on board, sorry I misread the first time, and the footnote helps a lot. As for the questions I asked that weren't clear, they're much less relevant now that I have your clarification. But the idea was: I'm off the opinion that we have a lot more know-how buried and latent in all our know-that data such that many things humans have never done or even thought of being able to do could nevertheless be overdetermined (or nearly so) without additional experimental data.

LESSWRONG
LW

All of juggins's Comments + Replies