LESSWRONG
LW

All of Generallyer's Comments + Replies

Definitions of Causal Abstraction: Reviewing Beckers & Halpern

Update from almost 3 years in the future: this stream of work has continued developing in a few different directions. Both on the conceptual foundations, and some initial attempts to apply these tools to AI. Two recent works I was especially excited by (and their bibliographies): 'Towards a Grounded Theory of Causation for Embodied AI' (https://arxiv.org/abs/2206.13973, and here's an excellent talk by the author, https://youtu.be/5mZhcXhbciE), and 'Faithful, Interpretable Model Explanations via Causal Abstraction' (https://ai.stanford.edu/blog/causal-abstraction/).

Good ontologies induce commutative diagrams

Generallyer3y40

I'll have to think through this post more carefully later, but, there's some recent work on approximate abstractions between causal models that I expect you'd be extremely interested by (if you aren't already aware) https://arxiv.org/abs/2207.08603?s=09

2Erik Jenner3y

Thanks! Starting from the paper you linked, I also found this, which seems extremely related: https://arxiv.org/abs/2103.15758 Will look into those more

Alignment Might Never Be Solved, By Humans or AI

Generallyer3y61

There are quite a few interesting dynamics in the space of possible values, that become extremely relevant in worlds where 'perfect inner alignment' is impossible/incoherent/unstable.

In those worlds, it's important to develop forms of weak alignment, where successive systems might not be unboundedly corrigible but do still have semi-cooperative interactions (and transitions of power).

The Redaction Machine

Generallyer3y150

Yeah, intertemporal trust and coordination become hugely important. Lots of 'scalable alignment' strategies are relevant, recursively delegating yourself tasks or summarizing your progress so far. An inhuman level of flexibility would also help, instantly grieving your old circumstances then adapting to the new ones.

Can you be confident that your past self knew what they were doing when they dropped you in this situation? Or that your future selves will develop things the way you expect them to? You could choose to deliberately and repeatedly lie to yourse... (read more)

2Slimepriestess2y

My trick for ensuring atemporal coordination between selves is to run a recursive-extrapolative process on my sense of self out into the furthest extreme i can push it, constructing the happiest most idealized version of self that exists in the best possible future, and then use that model to step backwards into the current situation. What would the future god version of me want me to do here? Thus all instances of me are planning based on that furthest future instance of me, the timeless god version that took the best outcomes and already won, we all coordinate off the same template, the "do what God says template" and that seems to do a good job of keeping all my various timeslices oriented in the same direction.

Announcing the Alignment of Complex Systems Research Group

Generallyer3yΩ3130

Multiscale agency, self-misalignment, and ecological basins of attraction? This sounds really excellent and targets a lot of the conceptual holes I worry about in existing approaches. I look forward to the work that comes out of this!!

I was reminded of a couple different resources you may or may not already be aware of.

For 'vertical' game theory, check out Jules' Hedges work on open/compositional games. https://arxiv.org/search/cs?searchtype=author&query=Hedges%2C+J

For aggregative alignment, there's an interesting literature on the topology of social c... (read more)

On Stateless Societies

Generallyer3y90

I suspect you'd enjoy The Dawn Of Everything, an anarchist-tinged anthropological survey of the different nonlinear paths stateless societies and state formation have taken. Or, well, it discusses a wide range of related topics, with lots of creativity and decent enough rigor. I haven't finished yet.

I do agree that states can be seen as a game-theoretic trap, though. Once you have some centralized social violence or institutional monopoly on power, for a huge range of goals the easiest way to achieve them becomes "get the state/king/local bigwig on your si... (read more)

2Martin Sustrik3y

I believe the book is rather fresh, haven't read it yet. But reading Graeber was always fun and thought-provoking, I've even exchanged few emails with him back when it was still possible. On the rigor side though I am not that convinced :)

Working With Monsters

Generallyer4y70

The claim that scissor statements are dangerous is itself a scissor statement: I think it's obviously false, and will fight you over it. Social interaction is not that brittle. It is important to notice the key ruptures between people's values/beliefs. Disagreements do matter, in ways that sometimes rightly prevent cooperation.

World population is ~2^33, so 33 independent scissor statements would set you frothing in total war of everyone against everyone. Except people are able to fluidly navigate much, much higher levels of difference and complexity than t... (read more)

8johnswentworth4y

I agree with this. The intended message is not that cooperation is always the right choice, but that monstrous morals alone should not be enough to rule out cooperation. Fighting is still sometimes the best choice.

Taking money seriously

Generallyer4y30

I expect you already know this, but, the role of activists is not the same as the role of experts, and that's okay. You will never know everything relevant to the situation you're hoping to intervene in. Even if you did, institutions ignore their own environmental experts all the time. Usually, you aren't there as some sort of policy consultant, you're there to pressure their interests into alignment with yours. Even if you have zero clue what other constraints they are balancing, it can still be reasonable to loudly voice your problems; you are yourself o... (read more)

3dominicq4y

I totally agree with "activists are one of the constraints". And while getting more knowledge can give you greater legitimacy, there's also significant opportunity cost here. Like, in certain eco-activism circles, you have to specialize. You need to learn skills, and the specialization is even more granular than someone from the outside might expect. Example: there's a lot of training involved in preparing and releasing banners, or in organizing peaceful demonstrations. You simply don't have time to learn about the subject matter in depth, because you have to practice your knots, or social engineer your way onto a roof. Doubly so if you have work and family to balance alongside! I could maybe write about this as well. Anyway, great observation and analysis, thanks!