LESSWRONG
LW

All of Ben Goldhaber's Comments + Replies

I did! and I in fact have read - well some of :) - the whitepaper. But it still seems weird that it's not possible to Increase the Trust in the third party through financial means, dramatic PR stunts (auditor promises to commit sepuku if they are found to have lied)

bgold's Shortform

Ben Goldhaber2mo10

source needed, but I recall someone on the community notes team saying it was very similar but there are some small differences between prod and the open source version (it's difficult to maintain exact compatibility). For the point of the comment and context I agree open source does a good job of this, though given the number of people on twitter who still allege its being manipulated, I think you need some additional juice (a whistleblower prize?)

bgold's Shortform

Ben Goldhaber2mo160

Why so few third party auditors of algorithms? for instance, you could have an auditing agency make specific assertions about what the twitter algorithm is doing, whether the community notes is 'rigged'

It could be that this is too large of a codebase, too many people can make changes, it's too hard to verify the algorithm in production is stable. This seems unlikely to me with most modern devops stacks
It could be that no one will trust the third party agency. I guess this seems most likely... but really, have we even tried? Could we not have some gro

... (read more)

3Isopropylpod2mo

I think the biggest reason (especially for Twitter, but applies to other places) are currently lying about their algorithms, thus intentionally don't do third party audits to avoid tbe deception becoming known. (Like another comment mentioned community note's open source repo actually being used)

ryan_greenblatt2mo122

Community notes is open source. You have to hope that Twitter is actually using the implementation from the open source library, but this would be easy to whistleblow on.

5Garrett Baker2mo

Your second option seems likely. Eg did you know community notes is open source? Given that information, are you going to even read the associated whitepaper or the issues page? Even if you do, I think we can still confidently infer very few others reading this will (I know I’m not).

bgold's Shortform

Ben Goldhaber3mo40

epistemic status: thought about this for like 15 minutes + two deep research reports

a contrarian pick for underrated technology area is lie detection through brain imaging. It seems like it will become much more robust and ecologically valid through compute scaled AI techniques, and it's likely to be much better at lie detection than humans because we didn't have access to images of the internals of other peoples brains in the ancestral environment.

On the surface this seems like it would be transformative - brain scan key employees to make sure they'... (read more)

2ozziegooen3mo

Quick note: I think Robin Hanson is more on the side of "we're not doing this because we don't actually care". I'm more on the side of, "The technology and infrastructure just isn't good enough." What I mean by that is that I think it's possible to get many of the benefits of surveillance without minimal costs, using a combination of Structured Transparency and better institutions. This would be a software+governance challenge.

bgold's Shortform

Ben Goldhaber3mo30

Good points! I agree that actual prototyping is necessary to see if an idea works, and as a demo it can be far more convincing. Especially w/ the decreased cost of building web apps, leveraging them for fast demos of techniques seems valuable.

bgold's Shortform

Ben Goldhaber3mo*149

AI for improving human reasoning seems promising; I'm uncertain whether it makes sense to invest in new custom applications, as maybe improvements in models are going to do a lot of the work.

I'm more bullish on investing in exploration of promising workflows and design patterns. As an example, a series of youtube videos and writeups on using O3 as a forecasting aid for grantmaking, with demonstrations. Or a set of examples of using LLMs to aid in productive meetings, with a breakdown of the tech used and social norms that the participants agreed to.
-... (read more)

2ozziegooen3mo

Happy to see thinking on this. I like the idea of getting a lot of small examples of clever uses of LLM in the wild, especially by particularly clever/experimental people. I recently made this post to try to gather some of the techniques common around this community. One issue that I have though is that I'm really unsure what it looks like to promote neat ideas like these, outside of writing long papers or making semi-viral or at least [loved by a narrow community] projects. The most obvious way is via X/Twitter. But this often requires building an X audience, which few people are good at. Occasionally particularly neat images/clips by new authors go viral, but it's tough. I'd also flag: - It's getting cheaper to make web applications. - I think EA has seen more success in making blog posts and web apps than we did things like [presenting neat ideas in videos/tweets]. - Often, [simple custom applications] are pretty crucial for actually testing out an idea. You can generate wireframes, but this only tells you a very small amount. I guess what I'm getting at is that I think [web applications] are likely a major part of the solution - but that we should favor experimenting with many small ones, rather than going all-in on 2-4 ideas or so.

bgold's Shortform

Ben Goldhaber3mo7-1

I think people should write a bunch of their own vignettes set in the AI 2027 universe. Small snippets of life predictions as things get crazy, on specific projects that may or may not bend the curve, etc.

Provably Safe AI: Worldview and Projects

Ben Goldhaber3mo52

fyi @Zac Hatfield-Dodds my probability has fallen below 10% - I expected at least one relevant physical<>cyber project to have started in the past six months, since it hasn't I doubt this will make the timeline. While not conceding (because I'm still unsure how far AI uplift alone gets us), seems right to note the update.

bgold's Shortform

Ben Goldhaber4mo30

good to know thanks for flagging!

bgold's Shortform

Ben Goldhaber4mo240

Recently learned about Acquired savant syndrome. https://en.wikipedia.org/wiki/Jason_Padgett

After the attack, Padgett felt "off." He assumed it was an effect of the medication he was prescribed; but it was later found that, because of his traumatic brain injury, Padgett had signs of obsessive–compulsive disorder and post-traumatic stress disorder.^[5] He also began viewing the world through a figurative lens of mathematical shapes.
"Padgett is one of only 40 people in the world with “acquired savant syndrome,” a condition in which prodigious talents in math,

... (read more)

5avturchin4mo

I heard (25 years ago) about a friend of a friend who started to see complex images about any word. Even draw them. Turns out it was brain cancer, he died soon after.

Dalcy4mo485

Previous discussion, comment by A.H. :

Sorry to be a party pooper, but I find the story of Jason Padgett (the guy who 'banged his head and become a math genius') completely unconvincing. From the video that you cite, here is the 'evidence' that he is 'math genius':
He tells us, with no context, 'the inner boundary of pi is f(x)=x sin(pi/x)'. Ok!
He makes 'math inspired' drawings (some of which admittedly are pretty cool but they're not exactly original) and sells them on his website
He claims that a physicist (who is not named or interviewed) saw him drawing i

... (read more)

In response to critiques of Guaranteed Safe AI

Ben Goldhaber5mo10

Minor point: It seems unfair to accuse GSAI of being vaporware. It has been less than a year since the GSAI paper came out and 1.5 since Tegmark/Omohundro's Provably Safe paper, and there are many projects being actively funded through ARIA and others that should serve as tests. No GSAI researchers that I know of promised significant projects in 2024 - in fact several explicitly think the goal should be to do deconfusion and conceptual work now and plan to leverage the advances in autoformalization and AI-assisted coding that are coming down the pipe fast.

While I agree that there are not yet compelling demonstrations, this hardly seems at the level of Duke Nukem Forever!

bgold's Shortform

Ben Goldhaber5mo10

what are the bottlenecks preventing 10x-100x scaling of Control Evaluations?

I'm not confident in the estimates of the safety margin we get from internal only evaluations - the challenge of eliciting strong subversion performance seems very hard for getting satisfactory estimates of the subversion capability of models against control protocols.
I'd feel more confident if we had thousands of people trying to create red-team models, while thousands of blue teams propose different monitoring methods, and control protocols.
The type of experiments describe

... (read more)

bgold's Shortform

Ben Goldhaber5mo377

I think more leaders of orgs should be trying to shape their organizations incentives and cultures around the challenges of "crunch time". Examples of this include:

What does pay look like in a world where cognitive labor is automated in the next 5 to 15 years? Are there incentive structures (impact equity, actual equity, bespoke deals for specific scenarios) that can help team members survive, thrive, and stay on target?
What cultural norms should the team have to AI assisted work? On the one hand it seems necessary to accelerate safety progress, on the oth

... (read more)

4Guive5mo

Some kind of payment for training data from applications like MSFT rewind does seem fair. I wonder if there will be a lot of growth in jobs where your main task is providing or annotating training data.

Davidad's Bold Plan for Alignment: An In-Depth Explanation

Ben Goldhaber6mo30

This post was one of my first introductions to davidad's agenda and convinced me that while yes it was crazy, it was maybe not impossible, and it led me to working on initiatives like the multi-author manifesto you mentioned.

Thank you for writing it!

Building AI Research Fleets

Ben Goldhaber6mo30

I would be very excited to see experiments with ABMs where the agents model fleets of research agents and tools. I expect in the near future we can build pipelines where the current fleet configuration - which should be defined in something like the terraform configuration language - automatically generates an ABM which is used for evaluation, control, and coordination experiments.

bgold's Shortform

Ben Goldhaber8mo120

Cumulative Y2K readiness spending was approximately $100 billion, or about $365 per U.S. resident.
Y2K spending started as early 1995, and appears t peaked in 1998 and 1999 at about $30 billion per year.

https://www.commerce.gov/sites/default/files/migrated/reports/y2k_1.pdf

Provably Safe AI: Worldview and Projects

Ben Goldhaber10mo10

Ah gotcha, yes lets do my $1k against your $10k.

3Zac Hatfield-Dodds10mo

Locked in! Whichever way this goes, I expect to feel pretty good about both the process and the outcome :-)

Provably Safe AI: Worldview and Projects

Ben Goldhaber10mo10

Given your rationale I'm onboard for 3 or more consistent physical instances of the lock have been manufactured.

Lets 'lock' it in.

5Ben Goldhaber3mo

2Zac Hatfield-Dodds10mo

Nice! I look forward to seeing how this resolves. Ah, by 'size' I meant the stakes, not the number of locks - did you want to bet the maximum $1k against my $10k, or some smaller proportional amount?

Provably Safe AI: Worldview and Projects

Ben Goldhaber11mo10

@Raemon works for me; and I agree with the other conditions.

4Zac Hatfield-Dodds11mo

I think we're agreed then, if you want to confirm the size? Then we wait for 2027!

Provably Safe AI: Worldview and Projects

Ben Goldhaber11mo10

This seems mostly good to me, thank you for the proposals (and sorry for my delayed response, this slipped my mind).

OR less than three consistent physical instances have been manufactured. (e.g. a total of three including prototypes or other designs doesn't count)

Why this condition? It doesn't seem relevant to the core contention, and if someone prototyped a single lock using a GS AI approach but didn't figure out how to manufacture it at scale, I'd still consider it to have been an important experiment.

Besides that, I'd agree to the above conditions!

6Zac Hatfield-Dodds11mo

I don't think that a thing you can only manufacture once is a practically usable lock; having multiple is also practically useful to facilitate picking attempts and in case of damage - imagine that a few hours into an open pick-this-lock challenge, someone bent a part such that the key no longer opens the lock. I'd suggest resolving neutral in this case as we only saw an partial attempt. Other conditions: * I think it's important that the design could have at least a thousand distinct keys which are non-pickable. It's fine if the theoretical keyspace is larger so long as the verified-secure keyspace is large enough to be useful, and distinct keys/locks need not be manufactured so long as they're clearly possible. * I expect the design to be available in advance to people attempting to pick the lock, just as the design principles and detailed schematics of current mechanical locks are widely known - security through obscurity would not demonstrate that the design is better, only that as-yet-secret designs are harder to exploit. I nominate @raemon as our arbiter, if both he and you are willing, and the majority vote or nominee of the Lightcone team if Raemon is unavailable for some reason (and @habryka approves that).

Provably Safe AI: Worldview and Projects

Ben Goldhaber11mo50

(8) won't be attempted, or will fail at some combination of design, manufacture, or just-being-pickable. This is a great proposal and a beautifully compact crux for the overall approach.

I agree with you that this feels like a 'compact crux' for many parts of the agenda. I'd like to take your bet, let me reflect if there's any additional operationalizations or conditioning.

However, I believe that the path there is to extend and complement current techniques, including empirical and experimental approaches alongside formal verification - whatever

... (read more)

6Zac Hatfield-Dodds11mo

quick proposals: * I win at the end of 2026, if there has not been a formally-verified design for a mechanical lock, OR the design does not verify it cannot be mechanically picked, OR less than three consistent physical instances have been manufactured. (e.g. a total of three including prototypes or other designs doesn't count) * You win if at the end of 2027, there have been credible and failed expert attempts to pick such a lock (e.g. an open challenge at Defcon). I win if there is a successful attempt. * Bet resolves neutral, and we each donate half our stakes to a mutally-agreed charity, if it's unclear whether production actually happened, or there were no credible attempts to pick a verified lock. * Any disputes resolved by the best judgement of an agreed-in-advance arbiter; I'd be happy with the LessWrong team if you and they also agree.

Ryan Kidd's Shortform

Ben Goldhaber1y82

I agree with this, I'd like to see AI Safety scale with new projects. A few ideas I've been mulling:

- A 'festival week' bringing entrepreneur types and AI safety types together to cowork from the same place, along with a few talks and lot of mixers.
- running an incubator/accelerator program at the tail end of a funding round, with fiscal sponsorship and some amount of operational support.
- more targeted recruitment for specific projects to advance important parts of a research agenda.

It's often unclear to me whether new projects should actually... (read more)

Davidad's Bold Plan for Alignment: An In-Depth Explanation

Ben Goldhaber2y71

First off thank you for writing this, great explanation.

Do you anticipate acceleration risks from developing the formal models through an open, multilateral process? Presumably others could use the models to train and advance the capabilities of their own RL agents. Or is the expectation that regulation would accompany this such that only the consortium could use the world model?
Would the simulations be exclusively for 'hard science' domains - ex. chemistry, biology - or would simulations of human behavior, economics, and politics also be needed? My

... (read more)

6Gabin2y

* The formal models don't need to be open and public, and probably shouldn't be. Of course this adds a layer of difficulty, since it is harder to coordinate on an international scale and invite a lot of researchers to help on your project when you also want some protection against your model being stolen or published on the internet. It is perhaps okay if it is open source in the case where it is very expensive to train a model in this simulation and no other group can afford it. * Good question. I don't know, and I don't think that I have a good model of what the simulation would look like. Here is what my (very simplified, probably wrong) model of Davidad would say: * We only want to be really sure that the agent is locally nice. In particular, we want the agent to not hurt people (or perhaps only if we can be sure that there are good reasons, for example if they were going to hurt someone). The agent should not hurt them with weapons, or by removing the oxygen, or by increasing radiations. For that, we need to find a mathematical model of human boundaries, and then we need to formally verify that these boundaries will be respected. Since the agent is trained using time-bounded RL, after a short period of time it will not have any effect anymore on the world (if time-bounded RL indeed works), and the shareholders will be able to determine if the policy had a good impact on the world or not, and if not, train another agent and/or change the desiderata and/or improve the formal model. That's why it is more important to have a fine model of chemistry and physics, and we can do with a coarser model of economics and politics. In particular, we should not simulate millions of people. * Is it reasonable? I don't know, and until I see this mathematical model of human boundaries, or a very convincing prototype, I'll be a bit skeptical.

Protectionism will Slow the Deployment of AI

Ben Goldhaber3y30

This seems like an important crux to me, because I don't think greatly slowing AI in the US would require new federal laws. I think many of the actions I listed could be taken by government agencies who over-interpret their existing mandates given the right political and social climate. For instance, the eviction moratorium during COVID, obviously should have required congressional action, but was done by fiat through an over-interpretation of authority by an executive branch agency.

What they do or do not do seems mostly dictated by that socio-political climate, and by the courts, which means less veto points for industry.

Protectionism will Slow the Deployment of AI

Ben Goldhaber3y10

I agree that competition with China is a plausible reason regulation won't happen; that will certainly be one of the arguments advanced by industry and NatSec as to why it should not be throttled. However, I'm not sure, and currently don't think it will, be stronger than the protectionist impulses,. Possibly it will exacerbate the "centralization" of AI dynamic that I listed in the 'licensing' bullet point, where large existing players receive money and de-facto license to operate in certain areas and then avoid others (as memeticimagery points out). So fo... (read more)

2James_Miller3y

Greatly slowing AI in the US would require new federal laws meaning you need the support of the Senate, House, presidency, courts (to not rule unconstitutional) and bureaucracy (to actually enforce). If big tech can get at least one of these five power centers on its side, it can block meaningful change.

What price would you pay for the RadVac Vaccine and why?

Ben Goldhaber4y10

hah yes - seeing that great post from johnwentsworth inspired me to review my own thinking on RadVac. Ultimately I placed a lower estimate on RadVac being effective - or at least effective enough to get me to change my quarantine behavior - such that the price wasn't worth it, but I think I get a rationality demerit for not investing more in the collaborative model building (and collaborative purchasing) part of the process.

What price would you pay for the RadVac Vaccine and why?

Ben Goldhaber4y30

I'm sorry I didn't see this response until now - thank you for the detailed answer!

Player vs. Character: A Two-Level Model of Ethics

Ben Goldhaber5y30

I'm guessing your concern feels similar to ones you've articulated in the past around... "heart"/"grounded" rationality, or a concern about "disabling pieces of the epistemic immune system".

I'm curious if 8 mo's later you feel you can better speak to what you see as the crucial misunderstanding?

Thiel on Progress and Stagnation

Ben Goldhaber5y20

Out of curiosity what's one of your more substantive disagreements with Thiel?

RT-LAMP is the right way to scale diagnostic testing for the coronavirus

Ben Goldhaber5y10

I'd be quite interested in reading that guide!

Competition: Amplify Rohin’s Prediction on AGI researchers & Safety Concerns

Ben Goldhaber5y20

Forecast - 25 mins

I thought it was more likely that in the short run there could be a preference cascade among top AGI researchers, and as others have mentioned due to the operationalization of top AGI researchers might be true already.
If this doesn't become a majority concern by 2050, I expect it will be because of another AI Winter, and I tried to have my distribution reflect that (a little hamfistedly).

Rereading Atlas Shrugged

Ben Goldhaber5y30

Thanks for posting this. I recently reread the Fountainhead, which I similarly enjoyed and got more out of than did my teenage self - it was like a narrative, emotional portrayal of the ideals in Marc Andreessen's It's Time to Build essay.

I interpreted your section on The Conflict as the choice between voice and exit.

Solving Math Problems by Relay

Ben Goldhaber5y100

The larger scientific question was related to Factored Cognition, and getting a sense of the difficulty of solving problems through this type of "collaborative crowdsourcing". The hope was running this experiment would lead to insights that could then inform the direction of future experiments, in the way that you might fingertip feel your way around an unknown space to get a handle on where to go next. For example if it turned out to be easy for groups to execute this type of problem solving, we might push ahead with competitions between teams t... (read more)

2DirectedEvolution5y

Thanks for that thorough answer! All projects are forms of learning. I find that much of my learning time is consumed by two related tasks: 1. Familiarizing myself with the reference materials. Examples: reading the textbook, taking notes on a lecture, asking questions during a lecture. 2. Creating a personalized meta-reference to distill and organize the reference materials so that it'll be faster and easier to re-teach myself in the future. Examples: highlighting textbook material that I expect I won't remember and crossing out explanations I no longer need, re-formatting concepts learned in a math class into a unified presentation format, deciding which concepts need to be made into flash cards. Those steps seem related to the challenges and strategies you encountered in this project. We know that students forget much of what they learn, despite their best efforts. I think it's wiser not to try hard to remember everything, but instead to "plan to forget" and create personalized references so that it's easy to re-teach yourself later when the need arises. I wish that skill were more emphasized in the school system. I think we put too much emphasis on trying to make students work harder and memorize better and "de-stress," and too little on helping students create a carefully thought-out system of notes and references and practice material that will be useful to them later on. The process of creating really good notes will also serve as a useful form of practice and a motivating tool. I find myself much more inclined to study if I've done this work, and I do in fact retain concepts much better if I've put in this work. Your project sounds like an interesting approach to tackle a related challenge. I'd be especially interested to hear about any efforts you make to tease out the differences between work that's divided between different people, and work that's divided between different "versions of you" at different times.

Solving Math Problems by Relay

Ben Goldhaber5y40

Thanks, rewrote and tried to clarify. In essence the researchers were testing transmission of "strategies" for using a tool, where an individual was limited in what they could transmit to the next user, akin to this relay experiment.

In fact they found that trying to convey causal theories could undermine the next person's performance; they speculate that it reduced experimentation prematurely.

3ESRogs5y

Better now, thanks!

bgold's Shortform

Ben Goldhaber5y80

... my god...

ESRogs's Shortform

Ben Goldhaber5y30

Thanks for posting this. Why did you invest in those three startups in particular? Was it the market, the founders, personal connections? And was it a systematic search for startups to invest in, or more of an "opportunity-arose" situation?

3ESRogs5y

These were all personal connections / opportunity-arose situations. The closest I've done to a systematic search was once asking someone who'd done a bunch of angel investments if there were any he'd invested in who were looking for more money and whom he was considering investing more in. That was actually my first angel investment (Pantelligent) and it ended up not working out. (But of course that's the median expected outcome.) (The other two that I invested in that are not still going concerns were AgoraFund and AlphaSheets. Both of those were through personal connections as well.)

What are the best tools for recording predictions?

Ben Goldhaber5y20

I know Ozzie has been thinking about this, because we were chatting about how to use an Alfred workflow to post to it. Which I think would be great!

What are the best tools for recording predictions?

Answer by Ben GoldhaberMay 24, 202040

I've spent a fair bit of time in the forecasting space playing w/ different tools, and I never found one that I could reliably use for personal prediction tracking.

Ultimately for me it comes down to:

1.) Friction: the predictions I'm most interested in tracking are "5-second-level" predictions - "do I think this person is right", "is the fact that I have a cough and am tired a sign that I'm getting sick" etc. - and I need to be able to jot that down quickly.

2.) "Routine": There are certain sites that a... (read more)

5ozziegooen5y

For those reading, the main thing I'm optimizing Foretold for right now, is for forecasting experiments and projects with 2-100 forecasters. The spirit of making "quick and dirty" questions for personal use conflicts a bit with that of making "well thought out and clear" questions for group use. The latter are messy to change, because it would confuse everyone involved. Note that Foretold does support full probability distributions with the guesstimate-like syntax, which prediction book doesn't. But it's less focused on the quick individual use case in general. If there are recommendations for simple ways to make it better for individuals; maybe other workflows, I'd be up for adding some support or integrations.

3Raemon5y

Is there an option for foretold to become Very Low Friction somehow? I agree with the "5 second level predictions" thing being a key issue.

How likely is it that US states or cities will prevent travel across their borders?

Answer by Ben GoldhaberMar 14, 2020130

The commerce clause gives the federal government broad powers to regulate interstate commerce, and in particular the the U.S. Secretary of Health and Human Services can exercise it to institute quarantine. https://cdc.gov/quarantine/aboutlawsregulationsquarantineisolation.html

bgold's Shortform

Ben Goldhaber5y30

Depression as a concept doesn't make sense to me. Why on earth would it be fitness enhancing to have a state of withdrawal, retreat, collapse where a lack of energy prevents you from trying new things? I've brainstormed a number of explanations:

depression as chemical imbalance: a hardware level failure has occurred, maybe randomly maybe because of an "overload" of sensation
depression as signaling: withdrawal and retreat from the world indicates a credible signal that I need help
depression as retreat: the environment has become dangerous

... (read more)

4Taran5y

I think you're asking too much of evolutionary theory here. Human bodies do lots of things that aren't longterm adaptive -- for example, if you stab them hard enough, all the blood falls out and they die. One could interpret the subsequent shock, anemia, etc. as having some fitness-enhancing purpose, but really the whole thing is a hard-to-fix bug in body design: if there were mutant humans whose blood more reliably stayed inside them, their mutation would quickly reach fixation in the early ancestral environment. We understand blood and wound healing well enough to know that no such mutation can exist: there aren't any small, incrementally-beneficial changes which can produce that result. In the same way, it shouldn't be confusing that depression is maladaptive; you should only be confused if it's both maladaptive and easy to improve on. Intuitively it feels like it should be -- just pick different policies -- but that intuition isn't rooted in fine-grained understanding of the brain and you shouldn't let it affect your beliefs.

2Matt Goldenberg5y

On a group selection level it might make lots more sense to have certain people get into states where they're very unlikely to procreate.

How much delay do you generally have between having a good new idea and sharing that idea publicly online?

Answer by Ben GoldhaberFeb 22, 202020

I rarely share ideas online (I'm working on that); when I do the ideas tend to be "small" observations or models, the type I can write out quickly and send. ~10mins - 1 day after I have it.

What is Success in an Immoral Maze?

Ben Goldhaber6y20

I've heard that Talking Heads song dozens of times and have never watched the video. I was missing out!

bgold's Shortform

Ben Goldhaber6y30

neat hadn't seen that thanks

What were the biggest discoveries / innovations in AI and ML?

Answer by Ben GoldhaberJan 06, 202030

NeurIPS best paper awards will likely contain good leads.

Circling as Cousin to Rationality

Ben Goldhaber6y50

I expect understanding something more explicitly - such as yours and another persons boundaries - w/o some type of underlying concept of acceptance of that boundary can increase exploitability. I recently wrote a shortform post on the topic of legibility that describes some patterns I've noticed here.

I don't think on average Circling makes one more exploitable, but I expect it increases variance, making some people significantly more exploitable than they were before because previously invisible boundaries are now visible, and can thus be attacke... (read more)

bgold's Shortform

Ben Goldhaber6y*160

Yes And is an improv technique where you keep the energy in a scene alive by going w/ the other persons suggestion and adding more to it. "A: Wow is that your pet monkey? B: Yes and he's also my doctor!"
Yes And is generative (creates a lot of output), as opposed to Hmm No which is critical (distills output)
A lot of the Sequences is Hmm No
It's not that Hmm No is wrong, it's that it cuts off future paths down the Yes And thought-stream.
If there's a critical error at the beginning of a thought that will undermine everything else

... (read more)

Gordon Seidoh Worley6y110

Babble and prune seems related

[Part 1] Amplifying generalist research via forecasting – Models of impact and challenges

Ben Goldhaber6y60

IMO the term "amplification" fits if the scheme results in a 1.) clear efficiency gain and 2.) it's scalable. This looks like (delivering equivalent results but at a lower cost OR providing better results for an equivalent cost. (cost == $$ & time)), AND (~ O(n) scaling costs).

For example if there was a group of people who could emulate [Researcher's] fact checking of 100 claims but do it at 10x speed, then that's an efficiency gain as we're doing the same work in less time. If we pump the number to 1000 claims and the fac... (read more)

ozziegooen's Shortform

Ben Goldhaber6y30

Is there not a distillation phase in forecasting? One model of the forecasting process is person A builds up there model, distills a complicated question into a high information/highly compressed datum, which can then be used by others. In my mind its:

Model -> Distill - > "amplify" (not sure if that's actually the right word)

I prefer the term scalable instead of proliferation for "can this group do it cost-effectively" as it's a similar concept to that in CS.

5ozziegooen6y

Distillation vs. Instillation My main point here is that distillation is doing 2 things: transitioning knowledge (from training data to a learned representation), and then compressing that knowledge.[1] The fact that it's compressed in some ways arguably isn't always particularly important; the fact that it's transferred is the main element. If a team of forecasters basically learned a signal, but did so in a very uncompressed way (like, they wrote a bunch of books about said signal), but still were somewhat cost-effective, I think that would be fine. Around "Profileration" vs. "Scaling"; I'd be curious if there are better words out there. I definitely considered scaling, but it sounds less concrete and less specific. To "proliferate" means "to generate more of", but to "scale" could mean, "to make look bigger, even if nothing is really being done." I think my cynical guess is that "instillation/proliferation" won't catch on because they are too uncommon, but also that "distillation" won't catch on because it feels like a stretch from the ML use case. Could use more feedback here. [1] Interestingly, there seem to be two distinct stages in Deep Learning that map to these two different things, according to Naftali Tishby's claims.

bgold's Shortform

Ben Goldhaber6y40

Thanks for including that link - seems right, and reminded me of Scott's old post Epistemic Learned Helplessness

The only difference between their presentation and mine is that I’m saying that for 99% of people, 99% of the time, taking ideas seriously is the wrong strategy

I kinda think this is true, and it's not clear to me from the outset whether you should "go down the path" of getting access to level 3 magic given the negatives.

Probably good heuristics are proceeding with caution when encountering new/out there ideas, remember... (read more)

bgold's Shortform

Ben Goldhaber6y250

Why do I not always have conscious access to my inner parts? Why, when speaking with authority figures, might I have a sudden sense of blankness.
Recently I've been thinking about this reaction in the frame of 'legibility', ala Seeing like a State. State's would impose organizational structures on societies that were easy to see and control - they made the society more legible - to the actors who ran the state, but these organizational structure were bad for the people in the society.

For example, census data, standardized weights and m

... (read more)

4Viliam6y

Related: Reason as memetic immune disorder I like the idea that having some parts of you protected from yourself makes them indirectly protected from people or memes who have power over you (and want to optimize you for their benefit, not yours). Being irrational is better than being transparently rational when someone is holding a gun at your head. If you could do something, you would be forced to do it (against your interests), so it's better for you if you can't. But, what now? It seems like rationality and introspection is a bit like defusing a bomb -- great if you can do it perfectly, but it kills you when you do it halfways. It reminds me of a fantasy book which had a system of magic where wizards could achieve 4 levels of power. Being known as a 3rd level wizard was a very bad thing, because all 4th level wizards were trying to magically enslave you -- to get rid of a potential competitor, and to get a powerful slave (I suppose the magical cost of enslaving someone didn't grow up proportionally to victim's level). To use an analogy, being biologically incapable of reaching 3rd level of magic might be an evolutionary advantage. But at the same time, it would prevent you from reaching the 4th level, ever.