Gianluca Calcagni

All the Following are Distinct

In an artificial being, all the following: * Consciousness * Emotionality * Intelligence * Personality * Creativity * Volition are distinct properties that, in theory, may be activated independently. Let me explain. What I Hope to Achieve By publishing this post, I hope to develop and standardise some useful terms and concepts often found in many other posts. I also wish to explain in which sense some properties are linked to others - and, importantly, when they are not. I hope this post will be of help and provide a common context for future further discussions. What I am not trying to achieve is some universal definition of consciousness/intelligence/… simply because that’s too hard! To avoid controversies, I will focus on operational facets and nothing more[1]. I am still aware that the final result will be very opinionated, so feel free to challenge me and open my mind. The Basics Let’s suppose you walk around a new planet and you find some kind of system (artificial or natural - it doesn’t matter) that is able to “autonomously” execute general computations (digital, analog, quantum - it doesn’t matter, and it doesn’t matter which ones and why). Think of it as a universal Turing machine[2]: we’ll call such a system a computational being. Of course, I could just call it a computer - but the problem is that most people don’t recognise themselves as “computers”, hence the term has some semantic bias; also, would you consider an ecosystem as a “computer”? And yet an ecosystem can run computations, if you are inclined to interpret in that way the lifecycle of its inhabitants while they maintain (unknowingly) a stable equilibrium. So, let me use “computational being” as an umbrella term and as the foundation for the rest of the discussion. * Natural examples of computational beings: any living being is also a computational being. Selfish genes favour, by means of natural selection, computations that support their own reproduction. The competition betwee

16Aug 2, 2024

Gianluca Calcagni

Message

How Business Solved (?) the Human Alignment Problem

When I looked at mesa-optimization for the first time, my mind immediately associated it with a familiar problem in business: human individuals may not be “mesa optimizers” in a strict sense, but they can act as optimizers and they are expected to do so when a manager delegates (=base optimization)...

Dec 31, 2024-2

I Recommend More Training Rationales

Some time ago I happened to read the concept of training rationale described by Evan Hubinger, and I really liked it. In case you are not aware: training rationales are a bunch of questions that ML developers / ML teams should ask themselves in order to self-assess pros and cons...

Dec 31, 20242

Can AI Quantity beat AI Quality?

AI definitely poses an existential risk, in the sense that it can generate models with the hidden (possibly undetectable?) intention of competing against humanity for resources. The more intelligent the model, the higher its chance of success! The thought of an AI takeover is so scary that I won’t even...

Oct 2, 20242

An Opinionated Look at Inference Rules

If you ask around what are the typical ways to infer information, most people will answer: Deductions, Inductions, and Abductions. Of course, there are more ways than that, but there is no unified approach in their classification. I want to challenge that. The reason why I am unhappy with the...

Sep 3, 2024-5

All the Following are Distinct

Aug 2, 202416

Control Vectors as Dispositional Traits

I have been reading recently about a technique that can be used to partially control the behaviour of Large Language Models: the technique is exploiting control vectors[1] to alter the activation patterns of the LLMs and trigger some desired behaviour. While the technique does not provide guarantees, it gives high...

Jun 23, 202411

LESSWRONG
LW

LESSWRONG
LW

Gianluca Calcagni

Gianluca Calcagni

Gianluca Calcagni

All the Following are Distinct

Control Vectors as Dispositional Traits

Can AI Quantity beat AI Quality?

I Recommend More Training Rationales

Gianluca Calcagni

How Business Solved (?) the Human Alignment Problem

I Recommend More Training Rationales

Can AI Quantity beat AI Quality?

An Opinionated Look at Inference Rules

All the Following are Distinct

Control Vectors as Dispositional Traits

How Business Solved (?) the Human Alignment Problem

I Recommend More Training Rationales

Can AI Quantity beat AI Quality?

An Opinionated Look at Inference Rules

All the Following are Distinct

Control Vectors as Dispositional Traits

All the Following are Distinct

Control Vectors as Dispositional Traits

Can AI Quantity beat AI Quality?

I Recommend More Training Rationales