LESSWRONG
LW

Lysandre Terrisse — LessWrong

How could I tell someone that consciousness is not the primary concern of AI Safety?

8mo

In the PDF version of the Dive into Deep Learning book, at page 27, we can read this:

Frequently, questions about a coming AI apocalypse and the plausibility of a singularity have been raised in non-technical articles. The fear is that somehow machine learning systems will become sentient and make decisions, independently of their programmers, that directly impact the lives of humans. To some extent, AI already affects the livelihood of humans in direct ways: creditworthiness is assessed automatically, autopilots mostly navigate vehicles, decisions about whether to grant bail use statistical data as input. More frivolously, we can ask Alexa to switch on the coffee machine.
Fortunately, we are far from a sentient AI

... (read 619 more words →)

Is OpenAI net negative for AI Safety?

Lysandre Terrisse

I recently saw a post arguing that top AI labs should shut down. This let me wonder whether the AI Safety community thinks OpenAI is net negative for AI safety. I chose OpenAI because I consider it as the most representative top AI lab (in the sense that, if we ask someone to think about an AI lab, they would probably think about that one), but if you want, you can also talk about other AI labs as well.

Replying toInterlude for Behavioral Economics

Lysandre Terrisse1y

Interlude for Behavioral Economics

People from “primitive” societies give more than people from more developed societies, and the more primitive the society, the stronger the effect.

I am not sure what you intended to say here, but the word "primitive" definitely looks like a red flag. As I don't think I am the only one to believe this, I would ask you to please change the wording or delete this sentence.

-1

Thirty random thoughts about AI alignment

Lysandre Terrisse

Why does this post exist?

In order to learn more about my own opinion about AI safety, I tried to write a thought every day before going to bed. Of course, I failed doing this every day, and this is the reason why I have only thirty arguments since January. However, as I am still happy of the result, I am sharing them. Most of these thoughts have been inspired by many arguments I have read over the years. I tried to cite them in my thoughts, but sometimes I couldn't remember the source. For instance, I am pretty sure that the phrasing of the third thought is inspired by something I have... (read 8511 more words →)

Disproving and partially fixing a fully homomorphic encryption scheme with perfect secrecy

Lysandre Terrisse

Summary

In the last blog post, I introduced my plan to make the safest cryptographic box in the world, and to make it widely available. This would, in theory, make it possible to run infinitely dangerous programs (including superintelligences) safely. This cryptographic box was supposed to use a scheme that is fully homomorphic and perfectly secret at the same time. However, in the last four months, something has changed:

I discovered that the symmetric Quantum Fully Homomorphic Encryption scheme with perfect secrecy that I was using has a flaw.
I managed to find a partial fix for the scheme and to implement it. I will prove that this fix enables us to safely evaluate every

... (read 5271 more words →)

Replying toPlanning to build a cryptographic box with perfect secrecy

Lysandre Terrisse2y

Planning to build a cryptographic box with perfect secrecy

Thanks for the comment! I don't think you are interpreting this post wrong.

Why would we want to run it if we can't see what it does?

I don't think that running a superintelligence is ever useful. What I argue is that running it inside a cryptographic box is better than running it outside of the box. The goal of this project is that, if a superintelligence is run, then it doesn't automatically destroy the world. The goal isn't to run a superintelligence to do useful tasks.

Like maybe for $n = 2^{16}$ , it would not be enough to persuade a person to do something that would unbox the AI (but it might).

Personally, I think that making it... (read more)

Replying toPlanning to build a cryptographic box with perfect secrecy

Lysandre Terrisse2y

Planning to build a cryptographic box with perfect secrecy

Thank you! Personally, I think that, if a layperson were trying to help me, they could do it by trying to find flaws in the plan. I already mentioned that the One-Time Pad used to fail during WWII in an unexpected way, despite the fact that it had a proof of perfect secrecy. If someone were to find a flaw in the plan, it would help me a lot (although it would also prove that my goal is impossible).

Planning to build a cryptographic box with perfect secrecy

Lysandre Terrisse

Summary

Since September 2023, I started learning a lot of math and programming skills in order to develop the safest cryptographic box in the world (and yes, I am aiming high). In these four months, I learned important things you may want to know:

Fully Homomorphic Encryption (FHE) schemes with perfect secrecy do exist.
These FHE schemes do not need any computational assumption.
These FHE schemes are tractable (in the worst case, encrypting a program before running it makes it three times slower).
We can therefore run infinitely dangerous programs without obtaining any information about them or their outputs. This may be useful in order to run a superintelligence without destroying the world.
However, these schemes work only

... (read 3057 more words →)