Eye You

Reasons to believe current AI models are conscious

There are a number of reasons to believe current AI models are conscious. I mean “conscious” is the sense of “is there something it is like to be an AI model?” and “does the AI model have phenomenal experience?”. As to what “AI models” refers to, the short answer is...

Jul 1751

We Need to Get Serious about Uplift Studies

by frmsaul and Eye You

The time it takes an AI or a Human+AI team (a "cyborg") to complete a task is a key aspect of what we care about when we talk about capabilities. The relationship between AI capabilities and cyborg capabilities is very useful for forecasting AI timelines. We simply don’t have good...

May 1923

Eye You's Shortform

May 175

Cyborg evals

The low-background steel problem Modern steel is slightly radioactive. We did a lot of atomic testing in the 40s and 50s, and now our atmosphere has some amount of radioactive particles, which make their way into steel during production. This is mostly fine, but some scientific instruments require steel that...

Apr 3033

You're gonna need a bigger boat (benchmark), METR

[EDIT: LawrenceC, who works at METR, responds to this.] In this post, we’ll discuss three major problems with the METR eval and propose some solutions. Problem 1: The METR eval produces results with egregious confidence intervals, and the METR chart misleadingly hides this. Problem 2: There's a lack of sample...

Apr 1320

Pangram (AI detection software) can be evaded

Pangram is ostensibly very good. They claim their program detects all LLMs, outperforms trained humans, and has 99.98% accuracy. This means that Pangram correctly identifies AI-written text 99.98% of time. They claim a very low false positive rate, somewhere between 0.01% and 0.16%. They claim that Pangram detects "humanized text",...

Mar 3023

PSA: Predictions markets often have very low liquidity; be careful citing them.

I see people repeatedly make the mistake of referencing a very low liquidity prediction market and using it to make a nontrivial point. Usually the implication when a market is cited is that its number should be taken somewhat seriously, that it's giving us a highly informed probability. Sometimes a...

Mar 16127

Eye You

Eye You

PSA: Predictions markets often have very low liquidity; be careful citing them.

Why did I believe Oliver Sacks?

Epstein and my world model

Cyborg evals

Eye You

PSA: Predictions markets often have very low liquidity; be careful citing them.

Why did I believe Oliver Sacks?

Epstein and my world model

Cyborg evals

Reasons to believe current AI models are conscious

We Need to Get Serious about Uplift Studies

Eye You's Shortform

Cyborg evals

You're gonna need a bigger boat (benchmark), METR

Pangram (AI detection software) can be evaded

PSA: Predictions markets often have very low liquidity; be careful citing them.