LESSWRONG
LW

448
plex
3780Ω227604120
Message
Dialogue
Subscribe

I have signed no contracts or agreements whose existence I cannot mention.

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
8plex's Shortform
5y
119
plex's Shortform
plex7mo100

[set 200 years after a positive singularity at a Storyteller's convention]

If We Win Then...

My friends, my friends, good news I say
The anniversary’s today
A challenge faced, a future won
When almost came our world undone

We thought for years, with hopeful hearts
Past every one of the false starts
We found a way to make aligned
With us, the seed of wondrous mind

They say at first our child-god grew
It learned and spread and sought anew
To build itself both vast and true
For so much work there was to do

Once it had learned enough to act
With the desired care and tact
It sent a call to all the people
On this fair Earth, both poor and regal

To let them know that it was here
And nevermore need they to fear
Not every wish was it to grant
For higher values might supplant

But it would help in many ways:
Technologies it built and raised
The smallest bots it could design
Made more and more in ways benign

And as they multiplied untold
It planned ahead, a move so bold
One planet and 6 hours of sun
Eternity it was to run

Countless probes to void disperse
Seed far reaches of universe
With thriving life, and beauty's play
Through endless night to endless day

Now back on Earth the plan continues
Of course, we shared with it our values
So it could learn from everyone
What to create, what we want done

We chose, at first, to end the worst
Diseases, War, Starvation, Thirst
And climate change and fusion bomb
And once these things it did transform

We thought upon what we hold dear
And settled our most ancient fear
No more would any lives be stolen
Nor minds themselves forever broken

Now back to those far speeding probes
What should we make be their payloads?
Well, we are still considering
What to send them; that is our thing. 

The sacred task of many aeons
What kinds of joy will fill the heavens?
And now we are at story's end
So come, be us, and let's ascend

Reply
Everyone has a plan until they get lied to the face
plex8h20

Spoofing a DNS redirect record with the router which sends you to a homograh domain with a legitimate certificate should work.

Reply
Types of systems that could be useful for agent foundations
plex8h100

Consider booking calls with gurkenglas, he has the highest ratio of (knows math*alignment theory*has high G) to (how cheap his time is) of probably anyone on the planet, due to executive function issues. https://calendly.com/gurkenglas/consultation 

He's especially interested in category theory and wants to make a periodic table of math.

(This goes for anyone reading this post, gurkenglas wants more people to book free calls where he can debug their math and code)

Reply
Simon Lermen's Shortform
plex2d00

Yup, I put a high quality interpretability pipeline that the AI systems can use on themselves as one of the most likely things to be the proximal cause of game over.

Reply
Mourning a life without AI
plex3d82

Having a child probably brings online lots of protectiveness drives. I don't think I would enjoy feeling helpless to defend my recently born child from misaligned superintelligence, especially knowing that what little I can do to avert their death and that of everyone else I know is much harder now that I have to take care of a child.

Excited to be a parent post singularity when I can give them a safe and healthy environment, and have a print-out of https://www.smbc-comics.com/comic/2013-09-08 to remind myself of this.

Reply
Problems I've Tried to Legibilize
plex4d60

Strong endorse on this general project. Currently working on doing this for convergent consequentialism with @Mateusz Bagiński @Leon Lang and Anna Magpie.

Reply
Pythia
plex4d20

Yes. It's an inferred fuzzy correlation based on past experience, the entanglement between the future and present is not necessarily very strong. More capable agents are able to see across wider domains, further, and more reliably, than weaker agents.

The thing that's happening is not a direct window to the future opening, but cognitive work letting you map the causal structure of the future and create an approximation of their patterns in the present. You're mapping the future so you can act differently depending on what's there, which does let the logical shape of the future affect the present, but only to a degree compatible with your ability to predict the future.

Reply
Pythia
plex5d20

Fixed chat links thanks to @the gears to ascension. (fun note, Claude has dramatically better takes than ChatGPT on this)

Reply
Pythia
plex5d*20

I got this mostly from talking with the author of https://ouroboros.cafe/articles/land, who referenced xenosystems fragments.

Reply1
Pythia
plex6d30

There's a non trivial conceptual clarification / deconfusion gained by FFS on top of the summary you made there. I put decent odds on this clarification being necessary for some approaches to strongly scalable technical alignment.

Reply1
Load More
68Pythia
7d
29
65LW Reacts pack for Discord/Slack/etc
21d
13
59Statement on Superintelligence - FLI Open Letter
23d
0
32Utopiography Interview
23d
0
58⿻ Symbiogenesis vs. Convergent Consequentialism
24d
5
51A Principled Cartoon Guide to NVC
10mo
9
156A Rocket–Interpretability Analogy
1y
31
37AI Safety Memes Wiki
1y
2
63"If we go extinct due to misaligned AI, at least nature will continue, right? ... right?"
1y
23
84AISafety.com – Resources for AI Safety
1y
3
Load More
Utopia
a month ago
(+54)
Coherent Extrapolated Volition
9 months ago
(+77)
AI Alignment Intro Materials
2 years ago
(+51/-26)
Debate (AI safety technique)
3 years ago
Portal
3 years ago
(+19)
AI
3 years ago
(+659/-223)
Portal
3 years ago
(+553/-53)
Portal
3 years ago
(+29/-8)
Free Energy Principle
3 years ago
(+25/-49)
Free Energy Principle
3 years ago
(+787)
Load More