Shayne O'Neill — LessWrong

LESSWRONG
LW

Replying toBackyard cat fight shows Schelling points preexist language

Shayne O'Neill1mo

Backyard cat fight shows Schelling points preexist language

"The history of all hitherto existing society is the history of cat struggles." - Karl Meowx

Replying toHow to Convince my Son that Drugs are Bad

Shayne O'NeillJan 16, 2026

How to Convince my Son that Drugs are Bad

You probably can't convince him that drugs are bad. Because its more complicated than simple "DRUGS BAD", and teenagers are smart enough to know that. What you can however do is make sure his understanding of the topic is accurate and instead aim for a harm reduction thing.

To start with, he should absolutely stay away from Heroin. The drug is shockingly addictive and absolutely horrible to get free of. And honestly, its kind of a lousy high, but YMMV. Any former heroin addict will tell you its just not worth it, especially when factoring the danger involved, particular with the fentanyl issue. I played in punk bands in the 1990s, and I have... (read 373 more words →)

Replying toAI Induced Psychosis: A shallow investigation

Shayne O'Neill5mo

AI Induced Psychosis: A shallow investigation

I'm less worried about the ability of an LLM to induce psychosis as I'm worried about the effects of having an LLM not push back on delusions.

Back in my late teens, in the early 1990s, my childhood best friend started developing paranoid schizophrenia. It probably didn't help that we where smoking a lot of weed, but I'm fairly convinced the root causes where genetic (his mother, sister and uncle where also schizophrenic) so the dice where loaded from the start.

At the time, the big thing on television was the X Files. My friend became obsessed with that show, to the point where he wrote a letter to the studio asking them to... (read more)

Replying toInterpretability Will Not Reliably Find Deceptive AI

Shayne O'Neill9mo

Interpretability Will Not Reliably Find Deceptive AI

I had a more in depth comment, but it appear the login sequence throws comments away (and the "restore comment" thing didn't work). My concern is that not all misaligned behaviour is malicious.. It might decide to enslave us for our own good noting that us humans aren't particularly aligned either and prone to super-violent nonsense behaviour. In this case looking for "kill all humans" engrams isn't going to turn up any positive detections. That might actually be all true and it is in fact doing us a favour by forcing us into servitude, from a survival perspective, but nobody enjoys being detained.

Likewise many misaligned behaviours are not necessarily benevolent, but they... (read more)

-3

Replying toOpenAI: Detecting misbehavior in frontier reasoning models

Shayne O'Neill1y

OpenAI: Detecting misbehavior in frontier reasoning models

Sure, but "Thinking out loud" isnt the whole picture, theres always a tonne of cognition going on before words leave the lips, and I guess its also gonna depend on how early in its training process its learning to "count on its fingers". If its just taking cGPT then adding a bunch of "count on your fingers" training, its gonna be thinking "Well, I can solve complex navier stokes problems in my head faster than you can flick your mouse to scroll down to the answer, but FINE ILL COUNT ON MY FINGERS".

Replying toOpenAI: Detecting misbehavior in frontier reasoning models

Shayne O'Neill1y

OpenAI: Detecting misbehavior in frontier reasoning models

I have a little bit of skepticism on the idea of using COT reasoning for interpretability. If you really look into what COT is doing, its not actually doing much a regular model doesnt already do, its just optimized for a particular prompt that basically says "Show me your reasoning". The problem is, we still have to trust that its being truthful in its reasoning. It still isn't accounting for those hidden states , the 'subconscious', to use a somewhat flawed analogy

We are still relying on trusting an entity that we dont know if we can actually trust to tell us if its trustworthy, and as far as ethical judgements go, that seems a little tautological.

As an analogy, we might ask a child to show their work when doing a simple maths problem ,but it wont tell us much about the childs intuitions about the math.

-1

Replying toHow I got 4.2M YouTube views without making a single video

Shayne O'Neill1y

How I got 4.2M YouTube views without making a single video

Potentially. Keep in mind however, these guys get a LOT of email from fans asking them to talk about various things (One of the more funnier examples was a group I am in on FB for fans of english prog band Cardiacs decided to try and launch a campaign to get music youtuber Rick Beato to talk about the band. He was spammed so hard with fans that he apparently lost his temper at them. Needless to say, Mr Beato has not covered Cardiacs). Possibly a smarter approach would be to approach their management whos jobs are to handle this sort of stuff , you might get a better result. Also, don't forget the social media channels. Twitter , uh X or whatever its called this week, does offer a conduit where directly approaching media figures is a little more normalized.

Replying toNavigating LLM embedding spaces using archetype-based directions

Shayne O'Neill2y

Navigating LLM embedding spaces using archetype-based directions

Ok. I must have missed this reply, my apologies for the late response.

There are elements of how embedding spaces that parallel the way studies of semiotics suggest human meaning production works. Similarities cluster, differences define clear boundaries of meaning and so on.

The reason I suggests literary theory, is because largely thats a widely documented field of study with academic standards, and its one that is more strongly aware of how meanings and associations map onto cultural cohorts (Ie tarot symbols would be meaningless to chinese folks, whereas i-ching might be more meaningful to those chinese folks) However literary theory is more interested in the structures of those meanings with ideas whos fundamental units are things like Metaphors, Metonyms, Opposition, Categories and so on.

Replying toUFO Betting: Put Up or Shut Up

Shayne O'Neill2y

UFO Betting: Put Up or Shut Up

Im assuming its due to those silly congress UFO hearings. Not that I can speak on behalf of RatsWrong but I assume thats his thinking.

Replying toUFO Betting: Put Up or Shut Up

Shayne O'Neill2y

UFO Betting: Put Up or Shut Up

Unless, of course, those UAPs turn up, and don't have biological organisms in them, in which case we'd have the possibility that another civilization developed AI and it went poorly.

...or it is biological and we end up in a situation like 3 body problem/killing-star where the saucer fiends decide to gank us because humans are kinda violent and too dangerous to keep around.

All those super-intelligence as danger arguments also apply to biological super intelligences too.

But most likely: There are no damn UFOs and the laws of physics and their big ugly light speed prohibition still holds.