LESSWRONG
LW

Kabir Kumar

Posts

Sorted by New

2Kabir Kumar's Shortform

5mo

26

2Making progress bars for Alignment

3mo

0

18AI & Liability Ideathon

4mo

2

2Kabir Kumar's Shortform

5mo

26

Wikitag Contributions

Comments

Sorted by

You can just wear a suit

Kabir Kumar1d30

In sixth form, I wore a suit for 2 years. Was fun! Then, got kinda bored of suits

Recent AI model progress feels mostly like bullshit

Kabir Kumar1d10

Why does it seem very unlikely?

How We Might All Die in A Year

Kabir Kumar4d1-1

The companies being merged and working together seems unrealistic.

Thoughts on “AI is easy to control” by Pope & Belrose

Kabir Kumar6d140

the fact that good humans have been able to keep rogue bad humans more-or-less under control

Isn't stuff like the transatlantic slave trade, genocide of native americans, etc evidence that the amount isn't sufficient??

shouldn't we try to get media attention?

Answer by Kabir KumarMar 23, 202530

pauseai, controlai, etc, are doing this

1

The Case Against AI Control Research

Kabir Kumar15d10

Helps me decide which research to focus on

Kabir Kumar's Shortform

Kabir Kumar22d10

Both. Not sure, its something like lesswrong/EA speak mixed with the VC speak.

Kabir Kumar's Shortform

Kabir Kumar23d21

What I liked about applying for VC funding was the specific questions.

"How is this going to make money?"

"What proof do you have this is going to make money"

and it being clear the bullshit that they wanted was numbers, testimonials from paying customers, unambiguous ways the product was actually better, etc. And then standard bs about progress, security, avoiding weird wibbly wobbly talk, 'woke', 'safety', etc.

With Alignment funders, they really obviously have language they're looking for as well, or language that makes them more and less willing to put more effort into understanding the proposal. Actually, they have it more than the VCs. But they act as if they don't.

Kabir Kumar's Shortform

Kabir Kumar23d8-1

it's so unnecessarily hard to get funding in alignment.

they say 'Don't Bullshit' but what that actually means is 'Only do our specific kind of bullshit'.

and they don't specify because they want to pretend that they don't have their own bullshit

Kabir Kumar23d30

I would not call this a "Guide".

It's more a list of recommendations and some thoughts on them.