sen

Replying toSlack matters more than any outcome

I think you're missing an important edge case where all of your resolved subsystems are in agreement that their collective desires are simultaneously compatible and unattainable without enormous amounts of motivation, which is something that an arms race can provide. Adaptation isn't just about spinning cycles and causing stress. It does have actual tangible outcomes, and not all of those outcomes are bad. Though I think for most people, your advice is probably close enough to the right advice.

Replying toCognitive Emulation: A Naive AI Safety Proposal

sen3y

Cognitive Emulation: A Naive AI Safety Proposal

Thank you. You phrased the concerns about "integrating with a bigger picture" better than I could. To temper the negatives, I see at least two workable approaches, plus a framing for identifying more workable approaches.

Enable other safety groups to use and reproduce Conjecture's research on CogEms so those groups can address more parts of the "bigger picture" using Conjecture's findings. Under this approach, Conjecture becomes a safety research group, and the integration work of turning that research into actionable safety efforts becomes someone else's task.
Understand the societal motivations for taking short-term steps toward creating dangerous AI, and demonstrate that CogEms are better suited for addressing those motivations, not just the motivations of

... (read 353 more words →)

Replying toCognitive Emulation: A Naive AI Safety Proposal

sen3y*

Cognitive Emulation: A Naive AI Safety Proposal

What interfaces are you planning to provide that other AI safety efforts can use? Blog posts? Research papers? Code? Models? APIs? Consulting? Advertisements?

A foundation model approach to value inference

sen

Epistemic status: shower thoughts.

I'm going to write this out as a pseudo-proof. Please pardon the lack of narrative structure. Conceptually, I'm splitting the problem of value inference into three sub-problems:

Finding a "covering set" of all causal implications of a person's values. The goal here is to describe a concrete "values" dataset. Modeling that dataset should be sufficient to model values.
Creating a model of that covering set. The goal here is to show that it is feasible to model values, along with a bunch of other stuff that we eventually want to separate out.
Factoring the model to separate the effects of values from the effects of other variables. The goal is to show

... (read 851 more words →)

Replying to[Link] Wavefunctions: from Linear Algebra to Spinors

sen3y

[Link] Wavefunctions: from Linear Algebra to Spinors

Ah. Thank you, that is perfectly clear. The Wikipedia page for Scalar Field makes sense with that too. A scalar field is a function that takes values in some canonical units, and so it transforms only on the right of f under a perspective shift. A vector field (effectively) takes values both on and in the same space, and so it transforms both on the left and right of v under a perspective shift.

I updated my first reply to point to yours.

Replying to[Link] Wavefunctions: from Linear Algebra to Spinors

sen3y

[Link] Wavefunctions: from Linear Algebra to Spinors

Reading the wikipedia page on scalar field, I think I understand the confusion here. Scalar fields are supposed to be invariant under changes in reference frame assuming a canonical coordinate system for space.

Take two reference frames P(x) and G(x). A scalar field S(x) needs to satisfy:

S(x) = P'(x)S(x)P(x) = G'(x)S(x)G(x)
Where P'(x) is the inverse of P(x) and G'(x) is the inverse of G(x).

Meaning the inference of S(x) should not change with reference frame. A scalar field is a vector field that commutes with perspective transformations. Maybe that's what you meant?

I wouldn't use the phrase "transforms trivially" here since a "trivial transformation" usually refers to the identity transformation. I wouldn't use a head... (read more)

Replying to[Link] Wavefunctions: from Linear Algebra to Spinors

sen3y

[Link] Wavefunctions: from Linear Algebra to Spinors

Interesting. That seems to contradict the explanation for Lie Algebras, and it seems incompatible with commutators in general, since with commutators all operators involved need to be compatible with both composition and precomposition (otherwise AB - BA is undefined). I guess scalar fields are not meant to be operators? That doesn't quite work since they're supposed used to describe energy, which is often represented as an operator. In any case, I'll have to keep that in mind when reading about these things.

Replying to[Link] Wavefunctions: from Linear Algebra to Spinors

sen3y

[Link] Wavefunctions: from Linear Algebra to Spinors

Thanks for the explanation. I found this post that connects your explanation to an explanation of the "double cover." I believe this is how it works:

Consider a point on the surface of a 3D sphere. Call it the "origin".
From the perspective of this origin point, you can map every point of the sphere to a 2D coordinate. The mapping works like this: Imagine a 2D plane going through the middle of the sphere. Draw a straight line (in the full 3D space) from the selected origin to any other point on the sphere. Where the line crosses the plane, that's your 2D vector representation of the other point. Under this visualization, the

... (read 379 more words →)

Replying to[Link] Wavefunctions: from Linear Algebra to Spinors

sen3y*

[Link] Wavefunctions: from Linear Algebra to Spinors

EDIT: This post is incorrect. See the reply chain below. After correcting my misunderstanding, I agree with your explanation.

The difference you're describing between vector fields and scalar fields, mathematically, is the difference between composition and precomposition. Here it is more precisely:

Pick a change-of-perspective function P(x). The output of P(x) is a matrix that changes vectors from the old perspective to the new perspective.
You can apply the change-of-perspective function either before a vector field V(x) or after a vector field. The result is either V(x)P(x) or P(x)V(x).
If you apply P(x) before, the vector field applies a flow in the new perspective, and so its arrows "tilt with your head."
If you apply P(x) after,

... (read more)

-2

Replying to[Link] Wavefunctions: from Linear Algebra to Spinors

sen3y

[Link] Wavefunctions: from Linear Algebra to Spinors

In the 2D matrix representation, the basis element corresponding to the real part of a quaternion is the identity matrix. So scaling the real part results in scaling the (real part of the) diagonal of the 2D matrix, which corresponds to a scaling operation on the spinor. It incidentally plays the same role on 3D objects: it scales them. Plus, it plays a direct role in rotations when it's -1 (180 degree rotation) or 1 (0 degree rotation). Same as with i, j, and k, the exact effect of changing the real part of the quaternion isn't obvious from inspection when it's summed with other non-zero components. For example, it's hard to... (read more)

[Link] Wavefunctions: from Linear Algebra to Spinors

sen

I wrote this blogpost because I thought it took an excessive amount of digging to understand what a spinor was. My original motivation was to understand wavefunctions more concretely since I recently discovered that wavefunctions are spinor-valued, not (necessarily) complex-valued. That took me down a rabbit hole of gamma matrices, geometric algebra, quaternions, and about a dozen other topics.

I think physics is taught very badly. Modern physical theories are built on some very heavy and very powerful mathematical machinery. That machinery is absolutely worth learning, but expositions on physical phenomena seem to have no middle ground between "breadth-first" (require all the background before being able to understand anything), "assembly-level" (discuss the raw... (read more)

Replying toAll AGI Safety questions welcome (especially basic ones) [~monthly thread]

sen3y

All AGI Safety questions welcome (especially basic ones) [~monthly thread]

I don't know why other people say it, but I can explain why it's nice to say it.

log P(x) behaves nicely in comparison to P(x) when it comes to placing iterated bets. When you maximize P(x), you're susceptible to high risk high reward scenarios, even when they lead to failure with probability arbitrarily close to 1. The same is not true when maximizing log P(x). I'm cheating here since this only really makes sense when big-P refers to "principal" (i.e., the thing growing or shrinking with each bet) rather than "probability".
p(x) doesn't vary linearly with the controls we typically have, so calculus intuition tends to break down when used to optimize p(x).

sen

It's about the dangers of unrestrained escalation. It's a failure of humanity that we even got to the point where a few people's small and difficult decisions decided the fate of so much. This post is a friendly reminder to...

Please avoid escalating socio-economic disagreements to nuclear levels.
Please avoid escalating market competitions to Moloch levels.
Please avoid escalating paperclippers to superintelligent levels.

Learn to recognize signs of unrestrained escalation. You may be participating in unrestrained escalation...

If your information landscape becomes increasingly narrowed by a centralized source or by recommender algorithms.
If you attempt to fight over-escalated issues in a way that encourages opposing parties to fight back. For example, if you try to overtly starve the

... (read more)

Canonical forms

sen

This post in the link gives an intuitive connection between canonical forms in mathematics and easy-to-understand examples in day-to-day conversations. It was motivated by the following question I posed to myself:

Why do mathematicians value canonical forms?

I think this addresses an important class of communication problems that people experience, and that's especially true of intelligent people that see the world through the lens of specialized knowledge. It's easy to trap yourself inside your head when you learn to (usefully!) pile on the complexity. Canonical forms offer one strategy for escape.

LESSWRONG
LW

LESSWRONG
LW

Petrov Day is not about unilateral action

[Link] Wavefunctions: from Linear Algebra to Spinors

Canonical forms

A foundation model approach to value inference

sen

A foundation model approach to value inference

[Link] Wavefunctions: from Linear Algebra to Spinors

Petrov Day is not about unilateral action

Canonical forms

sen

Petrov Day is not about unilateral action

[Link] Wavefunctions: from Linear Algebra to Spinors

Canonical forms

A foundation model approach to value inference

sen

A foundation model approach to value inference

[Link] Wavefunctions: from Linear Algebra to Spinors

Petrov Day is not about unilateral action

Canonical forms