LESSWRONG
LW

All of Lorxus's Comments + Replies

‘AI for societal uplift’ as a path to victory

To redteam, and in brief - what's the tale of why this won't have lead to a few very coordinated, very internally peaceful, mostly epistemically clean factions, each of which is kind of an echo chamber and almost all of which are wrong about something (or even just importantly mutually disagree on frames) in some crucial way, and which are at each other's throats?

Orienting Toward Wizard Power

Lorxus6d30

Reporting back in after having laid hands on some bupropion. I'm two days in and:

it's definitely doing anything helpful with respect to my senses of opportunity, pointfulness, allowedness/permission, and task initiation/switching/completion, among other subtle senses
it meshes just fine with my focus meds so far
the increase in anxiety has been real but thus far manageable
but also there's some weird effect where it's making me way more forgetful/absent-minded by way of that massively improved power at task-initiation and task-switching and sharpness of task-

... (read more)

Lighthaven Sequences Reading Group #38 (Tuesday 6/17)

Lorxus13d10

I am going to signpost the fact that in the absence of a scheduled event I would totally just jump in and pick some stuff for us to read and discuss tomorrow and post an event but I don't have a list of everything you've read so far.

3Garrett Baker13d

A post is coming! I will get future posts out earlier in the future, sorry.

Lighthaven Sequences Reading Group #37 (Tuesday 6/10)

Lorxus21d30

Got it. See you Tuesday!

Lighthaven Sequences Reading Group #37 (Tuesday 6/10)

Lorxus21d10

There is in fact a next event, right? I assume yes because event-chain description, but that still leaves me without knowledge of the reading.

2Garrett Baker21d

There will be! I haven’t decided on the readings yet though, sorry.

Truth or Dare

Lorxus1mo362

Thank you for writing this! A lot of it felt "obviously true" to me in a way that I find pleasant and at this point consider a mark of quality explication of everyday topics which are easy to misconstrue badly. There were maybe half a dozen points where I went "yup that absolutely tracks with my own experience and what I know of others' ".

Most notably, the way in which some behaviors/frames/flinches propagate themselves by bending people along their creases; I saw the gears-level punchline for "why do people develop and dissimulate like this" coming a coup... (read more)

Orienting Toward Wizard Power

Lorxus1mo30

Fabricate suggests otherwise. What if you just feel like maxing out mundane economicmagic of which Prestidigitation is the example par excellence?

Orienting Toward Wizard Power

Lorxus2mo50

I read this pretty soon after you posted it and have been thinking about it a lot about it in snatches ever since. I think it's pointing at something important. Here's a few things adjacent to it, or important components of it, or something like that. I think you will find them useful, @johnswentworth .

A belief in a Tomorrow which is worth living in, such that set-up actions are worth taking that wouldn't otherwise be worth at all.
A belief that you can, in fact, just do things - do new things, do things that play off of old things, do things for yourself t

... (read more)

johnswentworth's Shortform

Lorxus3mo10

Say more about "ray-tracing"? What does that look like? And do you have a bullshit-but-useful PCA-flavored breakdown of those few dimensions of variation?

Some Rules for an Algebra of Bayes Nets

Lorxus3moΩ010

What are "latential" string diagrams?

Just the word I use to describe "thing that has something to do with (natural) latents". More specifically for this case: a string diagram over a Markov category equipped with the extra stuff, structure, and properties that a Markov category needs to have in order to faithfully depict everything we care about when we want to write down statements or proofs about Bayes nets which might transform in some known restricted ways.

What does it it mean that you can't derive them all from a "fixed" set? Do you imagine some stron

... (read more)

4Vanessa Kosoy3mo

I think that there are two different questions which might be getting mixed up here: Question 1: Can we fully classify all rules for which sets of Bayes nets imply other Bayes nets over the same variables? Naturally, this is not a fully rigorous question, since "classify" is not a well-defined notion. One possible operationalization of this question is: What is the computational complexity of determining whether a given Bayes net follows from a set of other Bayes nets? For example, if there is a set of basic moves that generate all such inferences then the problem is probably in NP (at least if the number of required moved can also be bounded). Question 2: What if we replace "Bayes nets" by something like "string diagrams in a Markov category"? Then there might be less rules (because maybe some rules hold for Bayes nets but not in the more abstract setting).

Some Rules for an Algebra of Bayes Nets

Lorxus3moΩ130

that's not a thing that really happens?
What is the thing that doesn't happen? Reading the rest of the paragraph only left me more confused.

Somehow being able to derive all relevant string diagram rewriting rules for latential string diagrams, starting with some fixed set of equivalences? Maybe this is just a framing thing, or not answering your question, but I would expect to need to explicitly assume/include things like the Frankenstein rule - more generally, picking which additional rewriting rules you want to use/include/allow is something you do in ord... (read more)

2Vanessa Kosoy3mo

What are "latential" string diagrams? What does it it mean that you can't derive them all from a "fixed" set? Do you imagine some strong claim e.g. that the set of rewriting rules being undecidable, or something else? Okay, so this is not what you care about? Maybe you are saying the following: Given two diagrams X,Y, we want to ask whether any distribution compatible with X is compatible with Y. We don't ask whether the converse also holds. This is a certain asymmetric relation, rather than an equivalence.

Some Rules for an Algebra of Bayes Nets

Lorxus3mo*Ω230

Hi! I spent some time working on exactly this approach last summer at MATS, and since then have kept trying to work out the details. It's been going slowly but parts of it have been going at all.

My take regarding your and @Alexander Gietelink Oldenziel's comments below is - that's not a thing that really happens? You pick the rules and your desired semantics first, and then you write down your manipulation rules to reflect those. Maybe it turns out that you got it wrong and there's more (or fewer!) rules of derivation to write down, but as long as th... (read more)

2Vanessa Kosoy3mo

I found the above comment difficult to parse. What is the thing that doesn't happen? Reading the rest of the paragraph only left me more confused. What do you mean by "Markov equivalence class"?

LoganStrohl's Shortform

Lorxus3mo10

Did you ever get back to reading this? I think I got some very different things out of it when I read through! (And @whatstruekittycat will talk your ear off about it, among other topics.)

Steelmanning Divination

Lorxus3mo52

I think you maybe miss an entire branch of the tech-tree here I consider important - the bit about the Lindy case of divination with a coin-flip and checking your gut. It doesn't stop at a single bit in my experience; it's something you can use more generally to get your own read on some situation much less filtered by masking-type self-delusion. At the absolute least, you can get a "yes/no/it's complicated" out of it pretty easily with a bit more focusing!

I claim that divination^[1] is specifically a good way for routing around the worried self-... (read more)

Proselytizing

Lorxus3mo30

Got it, thanks. I'll see if I can figure out who that was or where to find that claim. Cheers.

Proselytizing

Lorxus3mo10

Maybe this is the right place to ask/discuss this, and maybe not - if it's not; say so and I'll stop.

IIRC you (or maybe someone once mentioned hearing about people who try to [experience the first jhana]^[1] and then feeling pain as a result, and that you didn't really understand why that happened. There was maybe also a comment about "don't do that, that sounds like you were doing it wrong".

After some time spent prodding at myself and pulling threads and seeing where they lead... I am not convinced that they were doing it wrong at all. There's a kind ... (read more)

2lsusr3mo

I feel like this is the wrong place for your comment. Your comment is a response to a claim someone (maybe me) made at a place on the Internet other than this blog post. I believe that other place is where your comment should go.

Lorxus's Shortform

Lorxus3mo41

Here's a game-theory game I don't think I've ever seen explicitly described before: Vicious Stag Hunt, a two-player non-zero-sum game elaborating on both Stag Hunt and Prisoner's Dilemma. (Or maybe Chicken? It depends on the obvious dials to turn. This is frankly probably a whole family of possible games.)

The two players can pick from among 3 moves: Stag, Hare, and Attack.

Hunting stag is great, if you can coordinate on it. Playing Stag costs you 5 coins, but if the other player also played Stag, you make your 5 coins back plus another 10.

Hunting hare is fi... (read more)

Lorxus's Shortform

Lorxus3mo252

A snowclone summarizing a handful of baseline important questions-to-self: "What is the state of your X, and why is that what your X's state is?" Obviously also versions that are less generally and more naturally phrased, that's just the most obviously parametrized form of the snowclone.

Classic(?) examples:
"What do you (think you) know, and why do you (think you) know it?" (X = knowledge/belief)
"What are you doing, and why are you doing it?" (X = action(-direction?)/motivation?)

Less classic examples that I recognized or just made up:
"How do you feel, and w... (read more)

Richard_Kennaway3mo*101

"What is the state and progress of your soul, and what is the path upon which your feet are set?" (X = alignment with yourself) I affected a quasi-religious vocabulary, but I think this has general application.

"What are you trying not to know, and why are you trying not to know it?" (X = self-deceptions)

The principle of genomic liberty

Lorxus4mo105

I think this post is pretty cool, and represents good groundwork on sticky questions of bioethics and the principles that should underpin them that most people don't think about very hard. Thanks for writing it.

Lorxus's Shortform

Lorxus4mo10

The phrasing I got from the mentor/research partner I'm working with is pretty close to the former but closer in attitude and effective result to the latter. Really, the major issue is that string diagrams for a flavor of category and commutative diagrams for the same flavor of category are straight-up equivalent, but explicitly showing this is very very messy, and even explicitly describing Markov categories - the flavor of category I picked as likely the right one to use, between good modelling of Markov kernels and their role doing just that for causal ... (read more)

Lorxus's Shortform

Lorxus4mo10

I guess? I mean, there's three separate degrees of "should really be kept contained"-ness here:

Category theory -> string diagrams, which pretty much everyone keeps contained, including people who know the actual category theory
String diagrams -> Bayes nets, which is pretty straightforward if you sit and think for a bit about the semantics you accept/are given for string diagrams generally and maybe also look at a picture of generators and rules - not something anyone needs to wrap up nicely but it's also a pretty thin
[Causal theory/Bayes net] string

... (read more)

2Gurkenglas4mo

I just meant the "guts of the category theory" part. I'm concerned that anyone says that it should be contained (aka used but not shown), and hope it's merely that you'd expect to lose half the readers if you showed it. I didn't mean to add to your pile of work and if there is no available action like snapping a photo that takes less time than writing the reply I'm replying to did, then disregard me.

Equations Mean Things

Lorxus4mo30

Not much to add apart from "this is clean and really good, thanks!".

Lorxus's Shortform

Lorxus4mo40

I promise I am still working on working out all the consequences of the string diagram notation for latential Bayes nets, since the guts of the category theory are all fixed (and can, as a mentor advises me, be kept out of the public eye as they should be). Things can be kept (basically) purely in terms of string diagrams. In whatever post I write, they certainly will be.

I want to be able to show that isomorphism of natural latents is the categorical property I'm ~97% sure it is (and likewise for minimal and maximal latents). I need to sit myself down and ... (read more)

6Gurkenglas4mo

give me the guts!!1 don't polish them, just take a picture of your notes or something.

Lorxus's Shortform

Lorxus4mo10

Because RLHF works, we shouldn't be surprised when AI models output wrong answers which are specifically hard for humans to distinguish from a right answer.

This observably (seems like it) generalizes to all humans, instead of (say) it being totally trivial somehow to train an AI on feedback only from some strict and distinguished subset of humanity such that any wrong answers it produced could be easily spotted by the excluded humans.

Such wrong answers which look right (on first glance) also observably exist, and we should thus expect that if there's anyth... (read more)

Report & retrospective on the Dovetail fellowship

Lorxus4mo10

Sent you an admonymous.

Lorxus's Shortform

Lorxus4mo30

(Random thought I had and figured this was the right place to set it down:) Given how centally important token-based word embeddings as to the current LLM paradigm, how plausible is it that (put loosely) "doing it all in Chinese" (instead of English) is actually just plain a more powerful/less error-prone/generally better background assumption?

Associated helpful intuition pump: LLM word tokenization is like a logographic writing system, where each word corresponds to a character of the logography. There need be no particular correspondence between the form... (read more)

johnswentworth's Shortform

Lorxus4mo30

As someone who does both data analysis and algebraic topology, my take is that TDA showed promise but ultimately there's something missing such that it's not at full capacity. Either the formalism isn't developed enough or it's being consistently used on the wrong kinds of datasets. Which is kind of a shame, because it's the kind of thing that should work beautifully and in some cases even does!

johnswentworth's Shortform

Lorxus4mo30

I imagine it's something like "look for things that are notably absent, when you would expect them to have been found if there"?

Internal music player: phenomenology of earworms

Lorxus8mo10

Do you know when you started experiencing having an internal music player? I recall that that started for me when I was about 6. Also, do you know whether you can deliberately pick a piece of music, or other nonmusical sonic experiences, to playback internally? Can you make them start up from internal silence? Under what conditions can you make them stop? Do you ever experience long stretches where you have no internal music at all?

1dkl97mo

The post answers most of that, except for the first question, for which my memories of childhood are too vague anyway, but it was surely before when I was 14.

Ayn Rand’s model of “living money”; and an upside of burnout

Lorxus8mo70

Sure - I can believe that that's one way a person's internal quorum can be set up. In other cases, or for other reasons, they might be instead set up to demand results, and evaluate primarily based on results. And that's not great or necessarily psychologically healthy, but then the question becomes "why do some people end up one way and other people the other way?" Also, there's the question of just how big/significant the effort was, and thus how big of an effective risk the one predictor took. Be it internal to one person or relevant to a group of humans, a sufficiently grand-scale noble failure will not generally be seen as all that noble (IME).

2Chris Lakin8mo

Why might it be set up like that? Seems potentially quite irrational. Veering into motivated reasoning territory here imo

Ayn Rand’s model of “living money”; and an upside of burnout

Lorxus8mo*102

This makes some interesting predictions re: some types of trauma: namely, that they can happen when someone was (probably even correctly!) pushing very hard towards some important goal, and then either they ran out of fuel just before finishing and collapsed, or they achieved that goal and then - because of circumstances, just plain bad luck, or something else - that goal failed to pay off in the way that it usually does, societally speaking. In either case, the predictor/pusher that burned down lots of savings in investment doesn't get paid off. This is maybe part of why "if trauma, and help, you get stronger; if trauma, and no help, you get weaker".

3Chris Lakin8mo

Maybe, but that also requires that the other group members were (irrationally) failing to consider that the “attempt could've been good even if the luck was bad”. In human groups, people often do gain (some) reputation for noble failures (is this wrong?)

D&D.Sci Coliseum: Arena of Data Evaluation and Ruleset

Lorxus8mo10

I didn't enjoy this one as much, but that's likely down to not having had the time/energy to spend on thinking this through deeply. That said... I did not in fact enjoy it as much and I mostly feel like garbage for having done literally worse than chance, and I feel like it probably would have been better if I hadn't participated at all.

4aphyer8mo

I don't think you should feel bad about that! This scenario was pretty complicated and difficult, and even if you didn't solve it I think "tried to solve it but didn't quite manage it" is more impressive than "didn't try at all"!

Some Rules for an Algebra of Bayes Nets

Lorxus8mo30

Let me see if I've understood point 3 correctly here. (I am not convinced I have actually found a flaw, I'm just trying to reconcile two things in my head here that look to conflict, so I can write down a clean definition elsewhere of something that matters to me.)

$P$ factors over $G$ . In $G$ , $X_{j}, X_{k}$ were conditionally independent of each other, given $X_{p a^{G} (k)}$ . Because $P$ factors over $G$ and because in $G$ , $X_{j}, X_{k}$ were conditionally independent of each other, given $X_{p a^{G} (k)}$ , we can very straightforwar... (read more)

Some Rules for an Algebra of Bayes Nets

Lorxus8mo30

We’ll refer to these as “Bookkeeping Rules”, since they feel pretty minor if you’re already comfortable working with Bayes nets. Some examples:
We can always add an arrow to a diagram (assuming it doesn’t introduce a loop), and the approximation will get no worse.

Here's something that's kept bothering me on and off for the last few months: This graphical rule immediately breaks Markov equivalence. Specifically, two DAGs are Markov-equivalent only if they share an (undirected) skeleton. (Lemma 6.1 at the link.)

If the major/only thing we care about here regar... (read more)

8DanielFilan8mo

A way I'd phrase John's sibling comment, at least for the exact case: adding arrows to a DAG increases the set of probability distributions it can represent. This is because the fundamental rule of a Bayes net is that d-separation has to imply conditional independence - but you can have conditional independences in a distribution that aren't represented by a network. When you add arrows, you can remove instances of d-separation, but you can't add any (because nodes are d-separated when all paths between them satisfy some property, and (a) adding arrows can only increase the number of paths you have to worry about and (b) if you look at the definition of d-separation the relevant properties for paths get harder to satisfy when you have more arrows). Therefore, the more arrows a graph G has, the fewer constraints distribution P has to satisfy for P to be represented by G.

3johnswentworth8mo

Proof that the quoted bookkeeping rule works, for the exact case: * The original DAG G asserts P[X]=∏iP[Xi|XpaG(i)] * If G′ just adds an edge from j to k, then G′ says P[X]=P[Xk|XpaG(k),Xj]∏i≠kP[Xi|XpaG(i)] * The original DAG's assertion P[X]=∏iP[Xi|XpaG(i)] also implies P[Xk|XpaG(k),Xj]=P[Xk|XpaG(k)], and therefore implies G′'s assertion P[X]=P[Xk|XpaG(k),Xj]∏i≠kP[Xi|XpaG(i)]. The approximate case then follows by the new-and-improved Bookkeeping Theorem. Not sure where the disconnect/confusion is.

D&D Sci Coliseum: Arena of Data

Lorxus8mo50

I'm going to start by attacking this a little on my own before I even look much at what other people have done.

Some initial observations from the SQL+Python practice this gave me a good excuse to do:

Adelon looks to have rough matchups against Elf Monks. Which we don't have. They are however soft to even level 3-4 challengers sometimes. Maybe Monks and/or Fencers have an edge on Warriors?
Bauchard seems to have particularly strong matchups against other Knights, so we don't send Velaya there. They seem a little soft to Monks and to Dwarf Ninjas and especiall

... (read more)

Why Large Bureaucratic Organizations?

Lorxus10mo30

I'm gonna leave my thoughts on the ramifications for academia, where a major career step is to repeatedly join and leave different large bureaucratic organizations for a decade, as an exercise to the reader.

Like, in a world where the median person is John Wentworth (“Wentworld”), I’m pretty sure there just aren’t large organizations of the sort our world has.

I have numerous thoughts on how Lorxusverse Polity handles this problem but none of it is well-worked out enough to share. In sum though: Probably cybernetics (in the Beer sense) got discovered way... (read more)

Shifting Headspaces - Transitional Beast-Mode

Lorxus10mo10

Sure, but you obviously don't (and can't even in principle) turn that up all the way! The key is to make sure that that mode still exists and that you don't simply amputate and cauterize it.

1M. Y. Zuo10mo

A ‘beast mode’ that no reader of LW will likely ever experience for even a full hour continuously is hardly a ‘mode’ is it? There are other terms for such phenomena.

Wei Dai's Shortform

Lorxus10mo10

[2.] maybe one could go faster by trying to more directly cleave to the core philosophical problems.
...
An underemphasized point that I should maybe elaborate more on: a main claim is that there's untapped guidance to be gotten from our partial understanding--at the philosophical level and for the philosophical level. In other words, our preliminary concepts and intuitions and propositions are, I think, already enough that there's a lot of progress to be made by having them talk to each other, so to speak.

OK but what would this even look like?\gen

Toss away ... (read more)

2TsviBT10mo

From scratch but not from scratch. https://www.lesswrong.com/posts/noxHoo3XKkzPG6s7E/most-smart-and-skilled-people-are-outside-of-the-ea?commentId=DNvmP9BAR3eNPWGBa https://tsvibt.blogspot.com/2023/09/a-hermeneutic-net-for-agency.html

Wei Dai's Shortform

Lorxus10mo10

Clearly academia has some blind spots, but how big? Do I just have a knack for finding ideas that academia hates, or are the blind spots actually enormous?

From someone who left a corner of it: the blindspots could be arbitrarily large as far as I know, because there seemed to me to be no real explicit culture of Hamming questions/metalooking for anything neglected. You worked on something vaguely similar/related to your advisor's work, because otherwise you can't get connections to people who know how to attack the problem.

Wei Dai's Shortform

Lorxus10mo10

As my reacts hopefully implied, this is exactly the kind of clarification I needed - thanks!

Like, bro, I'm saying it can't think. That's the tweet. What thinking is, isn't clear, but That thinking is should be presumed, pending a forceful philosophical conceptual replacement!

Sure, but you're not preaching to the choir at that point. So surely the next step in that particular dance is to stick a knife in the crack and twist?

That is -

"OK, buddy:
Here's property P (and if you're good, Q and R and...) that [would have to]/[is/are obviously natural and des

Lorxus10mo10

> https://www.lesswrong.com/posts/r7nBaKy5Ry3JWhnJT/announcing-iliad-theoretical-ai-alignment-conference#whqf4oJoYbz5szxWc

you didn't invite me so you don't get to have all the nice things, but I did leave several good artifacts and books I recommend lying around. I invite you to make good use of them!

2Alexander Gietelink Oldenziel10mo

Thank you Lorxus, that's appreciated. I'm sure we can make good use of them. Unfortunately, we get many more applications than we have spots so we have to make some tough choices. Better luck next time!

Dialogue on What It Means For Something to Have A Function/Purpose

Lorxus11mo32

(Minor quibble: I’d be careful about using “should” here, as in “the heart should pump blood”, because “should” is often used in a moral sense. For instance, the COVID-19 spike protein presumably has some function involving sneaking into cells, it “should” do that in the teleological sense, but in the moral sense COVID-19 “should” just die out. I think that ambiguity makes a sentence like “but it might be another thing to say, that the heart should pump blood” sound deeper/more substantive than it is, in this context.

This puts me in mind of what ... (read more)

Shifting Headspaces - Transitional Beast-Mode

Lorxus11mo60

To paraphrase:

Want and have. See and take. Run and chase. Thirst and slake. And if you're thwarted in pursuit of your desire… so what? That's just the way of things, not always getting what you hunger for. The desire itself is still yours, still pure, still real, so long as you don't deny it or seek to snuff it out.

-3M. Y. Zuo11mo

In this sense, no one who is alive in a modern city for longer than a day could possibly be in ‘beast mode’. Because they would have stepped in front of a bus/truck chasing something, and gotten wrecked and therefore would no longer exist. Nor could anyone enter ‘beast mode’ for any sustained period of time, and still remain alive.

tlevin's Shortform

Lorxus11mo10

@habryka Forgot to comment on the changes you implemented for soundscape at LH during the mixer - possibly you may want to put a speaker in the Bayes window overlooking the courtyard firepit. People started congregating/pooling there (and notably not at the other firepit next to it!) because it was the locally-quietest location, and then the usual failure modes of an attempted 12-person conversation ensued.

A rough and incomplete review of some of John Wentworth's research

Lorxus11mo90

any finite-entropy function $f (X)$

Uh...

$\forall x \in X,$ $P [X = x] > 0$ .
$\sum_{X} P [X = x] = 1.$
By "oh, no, the $(σ)$ s have to be non-repeating", $| X | \geq | N | .$ Thus by the nth term test, $\forall ϵ > 0,$ $\exists x \in X : P [X = x] < ϵ$ .
By properties of logarithms, $- log (P [X = x])$ has no upper bound over $x \in X$ . In particular, $- P [X = x] log (P [X = x])$ has no upper bound over $x \in X$ .
I'm not quite clear on how @johnswentworth defines a "finite-entropy function", but whichever reasonable way he does that, I'm pretty sure that the above means that the set of

... (read more)

We’re not as 3-Dimensional as We Think

Lorxus11mo20

most of them are small and probably don’t have the mental complexity required to really grasp three dimensions

Foxes and ferrets strike me as two obvious exceptions here, and indeed, we see both being incredibly good at getting into, out of, and around spaces, sometimes in ways that humans might find unexpected.

Davidad's Bold Plan for Alignment: An In-Depth Explanation

Lorxus1y10

, and here]

This overleaf link appears to be restricted-access-only?

tlevin's Shortform

Lorxus1y30

As someone who's spent meaningful amounts of time at LH during parties, absolutely yes. You successfully made it architecturally awkward to have large conversations, but that's often cashed out as "there's a giant conversation group in and totally blocking [the Entry Hallway Room of Aumann]/[the lawn between A&B]/[one or another firepit and its surrounding walkways]; that conversation group is suffering from the obvious described failure modes, but no one in it is sufficiently confident or agentic or charismatic to successfully break out into a subgrou... (read more)

1Lorxus11mo

Whiteboard Pen Magazines are Useful

Lorxus1y10

I liked this post so much that I made my own better Lesser Scribing Artifact and I'm preparing a post meant to highlight the differences between my standard and yours. Cheers!

Koan: divining alien datastructures from RAM activations

Lorxus1y10

Why do you need to be certain? Say there's a screen showing a nice "high-level" interface that provides substantial functionality (without directly revealing the inner workings, e.g. there's no shell). Something like that should be practically convincing.

Then whatever that's doing is a constraint in itself, and I can start off by going looking for patterns of activation that correspond to e.g. simple-but-specific mathematical operations that I can actuate in the computer.

I'm unsure about that, but the more pertinent questions are along the lines of "i

... (read more)

2TsviBT1y

It's an interesting different strategy, but I think it's a bad strategy. I think in the analogy this corresponds to doing something like psychophysics, or studying the algorithms involved in grammatically parsing a sentence; which is useful and interesting in a general sense, but isn't a good way to get at the core of how minds work. (I don't understand the basic logic here--probably easier to chat about it later, if it's a live question later.)