plex — LessWrong

I have signed no contracts or agreements whose existence I cannot mention.

[set 200 years after a positive singularity at a Storyteller's convention]

If We Win Then...

My friends, my friends, good news I say
The anniversary’s today
A challenge faced, a future won
When almost came our world undone

We thought for years, with hopeful hearts
Past every one of the false starts
We found a way to make aligned
With us, the seed of wondrous mind

They say at first our child-god grew
It learned and spread and sought anew
To build itself both vast and true
For so much work there was to do

Once it had learned enough to act
With the desired care and tact
It sent a call to all the people
On this fair Earth, both poor and regal

To let them know that it was here
And nevermore need they to fear
Not every wish was it to grant
For higher values might supplant

But it would help in many ways:
Technologies it built and raised
The smallest bots it could design
Made more and more in ways benign

And as they multiplied untold
It planned ahead, a move so bold
One planet and 6 hours of sun
Eternity it was to run

Countless probes to void disperse
Seed far reaches of universe
With thriving life, and beauty's play
Through endless night to endless day

Now back on Earth the plan continues
Of course, we shared with it our values
So it could learn from everyone
What to create, what we want done

We chose, at first, to end the worst
Diseases, War, Starvation, Thirst
And climate change and fusion bomb
And once these things it did transform

We thought upon what we hold dear
And settled our most ancient fear
No more would any lives be stolen
Nor minds themselves forever broken

Now back to those far speeding probes
What should we make be their payloads?
Well, we are still considering
What to send them; that is our thing.

The sacred task of many aeons
What kinds of joy will fill the heavens?
And now we are at story's end
So come, be us, and let's ascend

Reading groups on AI safety fundamentals

Please make sure the course materials are actually good. The courses often have glaring issues, though they do seem receptive and did say they'll update both times a pointed this out. Not sure if the latest updates have gone though yet.

I'm working on a course that will reliably cover the core concepts.

Hum, maybe single quotes feel better? Yeah, triple is weird to me.

Eg 'scare quotes'

Preference: Logical > Illogical > Hyperlogical

Finally, a cause area that LLMs can just solve. Looking forward to the new version of readthesequences, expect with style transfer.

Ah, that's what I get for trusting Claude to check my first pass idea, and not poking it more extensively.

Re: 2, not much it looks like, but I've got an upcoming post that bluedot asked for after I have a pitch for how to relate to it on a call. Basically, either enjoy life while you can, or rage against the dying of the light.

Spoofing a DNS redirect record with the router which sends you to a homograh domain with a legitimate certificate should work.

Consider booking calls with gurkenglas, he has the highest ratio of (knows math*alignment theory*has high G) to (how cheap his time is) of probably anyone on the planet, due to executive function issues. https://calendly.com/gurkenglas/consultation

He's especially interested in category theory and wants to make a periodic table of math.

(This goes for anyone reading this post, gurkenglas wants more people to book free calls where he can debug their math and code)

Yup, I put a high quality interpretability pipeline that the AI systems can use on themselves as one of the most likely things to be the proximal cause of game over.

LESSWRONG
LW

LESSWRONG
LW

Posts

Wikitag Contributions

Comments

If We Win Then...