LESSWRONG
LW

All of harsimony's Comments + Replies

Yeah I think sleep probably serves other roles, I just don't see why those roles require 7 hours of sleep rather than say 5 hours.

I do agree that basic research is what will actually get sleep need reduction therapies to work at scale. I'm hoping that citizen science and discussion of the topic will encourage more work on this.

Sleep need reduction therapies

harsimony2mo30

Oh I see. So the hypothesis is "In a healthy animal, stress is a highly-informative signal that inhibits risk-taking. Sleep ensures the stress system continues to inhibit risk taking appropriately."

Makes sense. It's consistent with sleep deprivation raising the level of cortisol and the brain developing a tolerance to high levels of certain hormones.

3ChristianKl1mo

Yes, that's a rough stretch of the idea. Of course, the real details are likely complicated. It's worth noting that Orexin does stimulate the hypothalamic-pituitary-adrenal (HPA) axis, which controls cortisol release.

2[comment deleted]1mo

Sleep need reduction therapies

harsimony2mo3-2

Oh I think sleep probably plays other roles today! But I don't think those roles require exactly 7 hours of sleep.

And agreed, need to look at long term effects of sleep need reduction too. My vision is more that people have 3-4 nights of 1-2 hours less sleep and then take a break for 3 nights rather than taking a drug to stop sleeping entirely.

Sleep need reduction therapies

harsimony2mo40

We are fundraising for a self-experiment soon!

I think there's a substantial chance that orexin agonists are "just stimulants" and you can't reduce sleep need much with them. But short sleepers prove it's biologically possible and I want to encourage people to start working on this.

Sleep need reduction therapies

harsimony2mo50

The point about children is a good one, I have to think on it more. But it seems consistent with children needing more calories to grow (and they are too young to gather their own calories), so they rest more.

It might be something more complex like: (1) Animals that aren’t careful tend to take a lot of risks that result in them dying. (2) There’s a process that builds up stress that’s about taking less risks. (3) Sleep exists to process that build-up stress and resolve it.

By this do you mean: "when stress builds up, animals take more risks. Risk-taking... (read more)

2ChristianKl2mo

I'm not just saying "when stress builds up that leads to problem". The question is why we feel stress in the first place. The question is "What is stress good for?" My idea would be that stress might come out of a process which has the function of inhibiting risk taking. From a neuroscience perspective I don't think that "stress is about neurons being used" is a thesis with strong evidence. Two decades ago there was the ridiculous willower is about glucose thesis but that's wrong. Energy use by the brain does not differ much is "stressful" activities. I believe that stress is something else. It might be something that exists for the purpose of inhibiting risk taking behavior. People with so much stress that they have burnout do need more sleep. Otherwise, things are complex. Stress is a pretty broad word and the underlying mechanisms are likely complex. Also my thesis would be that sleep is about releasing stress and there might be situations where the system of a stressed person doesn't consider it a good idea to release the stress if the stress serves an important purpose by inhibiting behavior. ChatGPT suggests that children in hunter gather tribes usually get their own food between the ages of 5 and 10. Teenager (14+) still have increased sleep needs compared to adults. Insulin productions works fine without a commercial need for a insulin analog. Orexin has the advantage that you don't need to inject it and can use a nasal spray.

Sleep need reduction therapies

harsimony2mo10

Good to have a number for this. Though I think a better counterfactual is between sleeping and actively foraging. Foraging + thermoregulation costs even more calories.

But let's say for the sake of argument that being awake + foraging takes 20% more calories compared to sleeping. Would sleeping actually get selected for? I think so. Evolution can make pretty fine distinctions given enough generations.

For example, cavefish (who live in an environment without light) quickly evolve less pigmentation and underdeveloped eyes to save energy. This is a convergent ... (read more)

5k641mo

If we're just talking calories, the necessary condition for sleep to be advantageous should be that the calories obtainable at night aren't sufficient to cover the caloric cost of being active. With your 20% example and 16 hours of foraging, daytime foraging must have provided at least (16+8*80%)/16 = 140% of the calories it cost, meaning that even being able to obtain one seventh the calories foraging at night would pay for the extra cost relative to sleep. Intuitively, it seems like most animals would be able to do this and would get more calories from not sleeping.

Sleep need reduction therapies

harsimony2mo50

I'm not as familiar with insomnia treatments, but orexin antagonists seem to be an improvement over existing meds. Probably the biggest improvement is the lower risk of abuse and tolerance compared to other medications. Belsomra has been around for over 10 years and seems to be well tolerated and effective. Though it doesn't work for everyone.

The argument that orexin antagonists could help people sleep more without making you sleepy during the day makes sense to me, with one caveat. If the half-life is long enough, the antagonist could block the orexin si... (read more)

On AI Scaling

harsimony5mo10

Wonderful to get more numbers on this!

These examples seem to contradict note 2 where D/N falls for larger C. Now I'm not sure what the trend should be.

It feels like you could derive a rule of thumb based on the loss and the entropy of the dataset e.g. "If my model starts at a loss of 4 bits/token and the asymptote is 2 bits/token, I need X tokens of data to fully specify a model with Y bits stored in the parameters."

6Vladimir_Nesov5mo

For scaling to larger training systems, the trend is probably increasing, since larger datasets have lower quality, and soon repetition in training will become necessary, lowering quality per trained-on token. Also, MoE is a large compute multiplier (3x-6x, Figure 11 in the above MoE scaling paper), it's not going to be ignored if at all possible. There are other studies that show a decreasing trend, but this probably won't hold up in practice as we get to 250T and then 750T tokens within a few years even for a dense model. For 1:32 MoE at 5e28 FLOPs (5 GW $150bn training systems of 2028), we get maybe 700 tokens/param optimal (counting effect of sparsity, effect of repetition, and effect of more compute), so that's 3.5T active and 110T total params trained for 2.5e15 tokens (maybe 80T tokens repeated 30 times). Not sure if this kind of total params can be made to work.

Safe Predictive Agents with Joint Scoring Rules

harsimony9moΩ010

Oh that makes sense!

If the predictors can influence the world in addition to making a prediction, they would also have an incentive to change the world in ways that make their predictions more accurate than their opponents right? For example, if everyone else thinks Bob is going to win the presidency, one of the predictors can bribe Bob to drop out and then bet on Alice winning the presidency.

Is there work on this? To be fair, it seems like every AI safety proposal has to deal with something like this.

5Rubi J. Hudson9mo

Yes, if predictors can influence the world in addition to making a prediction, they can go make their predictions more accurate. The nice thing about working with predictive models is that by default the only action they can take is making predictions. AI safety via market making, which Evan linked in another comment, touches on the analogy where agents are making predictions but can also influence the outcome. You might be interested in reading through it.

Safe Predictive Agents with Joint Scoring Rules

harsimony9moΩ130

This is super cool stuff, thank you for posting!

I may have missed this, but do these scoring rules prevent agents from trying to make the environment more un-predictable? In other words, if you're competing against other predictors, it may make sense to influence the world to be more random and harder to understand.

I think this prediction market type issue has been discussed elsewhere but I can't find a name for it.

2Rubi J. Hudson9mo

Good question! These scoring rules do also prevent agents from trying to make the environment more unpredictable. In the same way that making the environment more predictable benefits all agents equally and so cancels out, making the environment less predictable hurts all agents equally and so cancels out in a zero-sum competition.

The Hessian rank bounds the learning coefficient

harsimony10mo30

Thanks for this! I misinterpreted Lucius as saying "use the single highest and single lowest eigenvalues to estimate the rank of a matrix" which I didn't think was possible.

Counting the number of non-zero eigenvalues makes a lot more sense!

The economics of space tethers

harsimony10mo10

You can absolutely harvest potential energy from the solar system to spin up tethers. ToughSF has some good posts on this:

https://toughsf.blogspot.com/2018/06/inter-orbital-kinetic-energy-exchanges.html https://toughsf.blogspot.com/2020/07/tethers-all-way.html

Ideally your tether is going to constantly adjust its orbit so it says far away from the atmosphere, but for fun I did a calculation of what would happen if a 10K tonne tether (suitable for boosting 100 tonne payloads) fell to the Earth. Apparently it just breaks up in the atmosphere and produces very... (read more)

2Seth Herd10mo

I was proposing something different.

The economics of space tethers

harsimony10mo20

The launch cadence is an interesting topic that I haven't had a chance to tackle. The rotational frequency limits how often you can boost stuff.

Since time is money you would want a shorter and faster tether, but a shorter time of rotation means that your time window to dock with the tether is smaller, so there's an optimization problem there as well.

It's a little easier when you've got catapults on the moon's surface. You can have two running side by side and transfer energy between them electrically. So load up catapult #1, spin it up, launch the payload, and then transfer the remaining energy to catapult #2. You can get much higher launch cadence that way.

The economics of space tethers

harsimony10mo20

Oops yes, that should read "Getting oxygen from the moon to LEO requires less delta V than going from the Earth to LEO!". I edited the original comment.

The economics of space tethers

harsimony10mo*20

Lunar tethers actually look like they will be feasible sooner than Earth tethers! The lack of atmosphere, micrometeorites, and lower gravity (g) makes them scale better.

In fact, you can even put a small tether system on the lunar surface to catapult payloads to orbit: https://splittinginfinity.substack.com/p/should-we-get-material-from-the-moon

Whether tethers are useful on the moon depends on the mission you want to do. Like you point out, low delta-V missions probably don't need a tether when rockets work just fine. But if you want to take lunar material ... (read more)

2Aleksander 10mo

Very interesting. Love the idea of torturing mathematicians by making them calculate these crazy-precise orbits, but I guess machines can do most of that(a shame). How often could a tether actually be used for resource launches though? Assuming only one tether is in operation, would its orbital cycles be quick enough to transport materials consistently for a large lunar mining operation? Also, I’m not super informed on lunar space debris, but I imagine that would pile up quickly as lunar space operations began. I think most debris here on Earth would be outside the domain of tethers, but I can’t find many numbers on the hypothetical orbits of lunar debris. I assume, though, that it would be very different due to the lack of atmosphere to burn up debris and the differing gravity. I figure you could make a tether capable of withstanding this, but how would orbits be calculated and rockets properly tethered with interference? Assuming that this is an actual problem. Bit of a tangent, but I think space debris is one of my favorite hypothetical future problems, because it has a very similar and equally interesting set of fields which it intertwines with as climate change, while also not being a real problem I have to worry about killing me(like climate change)

2Jalex S10mo

I think there might be a typo?

The economics of space tethers

harsimony10mo2-1

Thanks for the comments! Going point-by-point:

I think both fiberglass and carbon fiber use organic epoxy that's prone to UV (and atomic oxygen) degradation? One solution is to avoid epoxy entirely using parallel strands or something like a Hoytether. The other option is to remove old epoxy and reapply over time, if its economical vs just letting the tether degrade.
I worry that low-thrust options like ion engines and sails could be too expensive vs catching falling mass, but I could be convinced either way!
Yeah, some form of vibration damping will

... (read more)

The economics of space tethers

harsimony11mo10

Yeah, my overall sense is that using falling mass to spin the tether back up is the most practical. But solar sails and ion drives might contribute too, these are just much slower which hurts launch cadence and costs.

The fact that you need a regular supply of falling mass from e.g. the moon is yet another reason why tethers need a mature space industry to become viable!

The Hessian rank bounds the learning coefficient

harsimony11mo10

That makes sense, I guess it just comes down to an empirical question of which is easier.

Question about what you said earlier: How can you use the top/bottom eigenvalues to estimate the rank of the Hessian? I'm not as familiar with this so any pointers would be appreciated!

4George Ingebretsen10mo

The rank of a matrix = the number of non-zero eigenvalues of the matrix! So you can either use the top eigenvalues to count the non-zeros, or you can use the fact that an n×n matrix always has n eigenvalues to determine the number of non-zero eigenvalues by counting the bottom zero-eigenvalues. Also for more detail on the "getting hessian eigenvalues without calculating the full hessian" thing, I'd really recommend Johns explanation in this linear algebra lecture he recorded.

The Hessian rank bounds the learning coefficient

harsimony11mo10

Isn't calculating the Hessian for large statistical models kind of hard? And aren't second derivatives prone to numerical errors?

Agree that this is only valuable if sampling on the loss landscape is easier or more robust than calculating the Hessian.

Lucius Bushnaq11mo114

Getting the Hessian eigenvalues does not require calculating the full Hessian. You use Jacobian vector product methods in e.g. JAX. The Hessian itself never has to be explicitly represented in memory.

And even assuming the estimator for the Hessian pseudoinverse is cheap and precise, you'd still need to get its rank anyway, which would by default be just as expensive as getting the rank of the Hessian.

The Hessian rank bounds the learning coefficient

harsimony11mo10

You may find this interesting "On the Covariance-Hessian Relation in Evolution Strategies":

https://arxiv.org/pdf/1806.03674

It makes a lot of assumptions, but as I understand it if you: a. Sample points near the minima [1]. b. Select only the lowest loss point from that sample and save it. c. Repeat that process many times d. Create a covariance matrix of the selected points

The covariance matrix will converge to the inverse of the Hessian, assuming the loss landscape is quadratic. Since the inverse of a matrix has the same rank, you could probably just use ... (read more)

2Lucius Bushnaq11mo

Why would we want or need to do this, instead of just calculating the top/bottom Hessian eigenvalues?

Detecting Genetically Engineered Viruses With Metagenomic Sequencing

harsimony1y50

Exciting to see this up and running!

If I'm understanding correctly, the system looks for modifications to certain viruses. So if someone modified a virus that NAO wasn't explicitly monitoring for modifications, then that would go undetected?

5jefftk1y

That's correct. But it's extremely cheap to monitor an additional virus, so there's not much downside to casting a large net.

A Basic Economics-Style Model of AI Existential Risk

harsimony1y30

I like the simple and clear model and I think discussions about AI risk are vastly improved by people proposing models like this.

I would like to see this model extended by including the productive capacity of the other agents in the AI's utility function. In other words, the other agents have a comparative advantage over the AI in producing some stuff and the AI may be able to get a higher-utility bundle overall by not killing everyone (or even increasing the productivity of the other agents so they can produce more stuff for the AI to consume).

microwave drilling is impractical

harsimony1y1-2

Super useful post, thank you!

The condensed vaporized rock is particularly interesting to me. I think it could be an asset instead of a hindrance. Mining expends a ton of energy just crushing rock into small pieces for processing, turning ores into dust you can pump with air could be pretty valuable.

I was always skeptical of enhanced geothermal beating solar on cost, though I do think the supercritical water Quaise could generate has interesting chemical applications: https://splittinginfinity.substack.com/p/recycling-atoms-with-supercritical

2Fisheater_54916mo

In this context, the most important advantage of supercritical water is that it contains nearly SIX times as much energy per ton - e.g. at 300 bar and 600°C - than in 160 bar 300°C superheated steam. As a result, almost 5 times less water has to be driven through the heat exchanger system at depth - whereby - due to the higher pressure - the pump load is about three times lower - and about five times the output is possible with the same borehole diameter. Stone is a poor conductor of heat. So after the initial heat loss to heat up the wall of the riser borehole, only a small part of the 600°C depth temperature at 15-16 km depth is lost, so that about 500°C reaches the turbines. Then the 300 liters per second are enough for about 1 GW production - with a pump output of about 0.1%

Which skincare products are evidence-based?

Answer by harsimonyMay 03, 202480

This post has some useful info:

https://milkyeggs.com/biology/lifespan-extension-separating-fact-from-fiction/

It basically says that sunscreen, ceramide moisturizers, and retinols are the main evidence-based skincare products. I would guess that more expensive versions of these don't add much value.

Some amount of experimentation is required to find products that don't irritate your skin.

New social credit formalizations

harsimony1y10

Good framing! Two forms of social credit that I think are worth paying attention to:

Play money prediction markets and forecasting. I think it's fruitful to think about these communities as using prediction accuracy as a form of status/credit.
Cryptocurrencies, which are essentially financial credit but with its own rules and community. The currency doesn't have to have a dollar value to induce coordination it can still function as a reputation system and medium of exchange.

It's somewhat tangential, but Sarah Constantin discussing attestation has some i... (read more)

Some Thoughts On Using Auctions For Land Valuation

harsimony1y10

Note that these sorts of situations are perfectly foreseeable from the perspective of owners. They know precisely what they will pay each year in taxes based on their bid. It's prudent to re-value the home every once in a while if taxes drift too much, but the owner can keep the same schedule if they want. They can also use the public listing of local bids, so they know what to bid and can feel pretty safe that they will keep their home. They truly have the highest valuation of all the bidders in most cases.

The thing is, every system of land ownership face... (read more)

1pineappledragon1y

Death is foreseeable? (Well, okay, yes, but the timing often isn't.)

Some Thoughts On Using Auctions For Land Valuation

harsimony1y10

Land value taxation is designed to make land ownership more affordable by lowering the cost to buy land. Would it change the value of property as an investment for current owners? I'm not sure, one one hand, land values would go down, but on the other, land would get used more efficiently and deadweight loss of taxation would go down, boosting the local economy.

As for the public choice hurdles, reform doesn't seem intractable. Detroit is considering a split-rate property tax, and it's not infeasible that other places switch. Owners hate property taxes and ... (read more)

2pineappledragon1y

> Owners hate property taxes and land values are less than property values. Why not slowly switch to using land values and lower everyone's property tax bill? Separately, I would suggest being very careful about claims like this. 1. Lower values for the tax base don't mean lower taxes in dollar amounts. The previous state I lived in assessed property at about half the market value but more than made up for it in the rates. 2. A non-trivial revenue-neutral tax reform by definition has to produce some losers. Yes, technically we'll be paying less "property" tax and more "land value" tax over time as it switches over, but I suspect most folks would put both in the same mental bucket (and unless I'm specifically trying to make a distinction between a land value tax and more traditional property taxes, I do too). Also, assuming folks would be writing just one check/year during the transition and not two separate ones, that's another factor leading folks to think of them on a combined basis.

2pineappledragon1y

>This proposal doesn't involve any forced moves, owners only auction when they want to sell their land. The article already lists two counterexamples that aren't uncommon situations... >There will be situations where the valuation growth from point 5 outpaces the true value of the house. The owner can update the land value by putting the land up for public auction, but they have to win that auction fair and square. If they win the auction, the land value is updated to their new bid, but no money changes hands (essentially, they pay themselves for the bid). So if my land value has ratcheted up faster than its true value, my choice is: get gouged on taxes, or roll the dice on losing control of the land. The odds of this problem grow over time, so people caught by this will tend to be 1) long-time residents and 2) older. > Fourth, auctions are a fairer way to allocate land, preventing families from passing land wealth down the generations without updating their valuation. So if I want to keep my parents' house in the family after they die, I again have to roll the dice. (I also wonder if this tends to be regressive since wealthier families have a greater ability to bid high for sentimental reasons and absorb the extra tax burden, so the folks featured in news stories as victims of this policy will be those of more modest means -- this is more speculative though.) In neither situation does the current owner actually want to sell.

Some Thoughts On Using Auctions For Land Valuation

harsimony1y10

So yes, taxing property values is undesirable, but it also happens with imperfect land value assessments: https://www.jstor.org/stable/27759702

It looks like you have different numbers for the cost of land, sale value of a house, and cost of construction. I'm not an expert, so I welcome other estimates. A couple comments:

Land value assessors typically say that the land value is larger than the improvement value. In urban centers, land can be over 70% of the overall property value. I would guess this is where the discrepancy comes from with our numbers. A

... (read more)

3Brendan Long1y

I suspect the discrepencies in our land value vs improvement value numbers have to do with where the land is and how efficiently it's used. If you have a single family home in San Francisco, most of the value will be land, but it seems undesirable that your proposed tax would very heavily penalize anyone who tries to turn a single-family house in SF into a skyscraper (with a much lower land/improvement ratio). Taxing improvements (discouraging people from improving land) seems like the exactly opposite of what a land value tax is supposed to do. I look forward to how you address this in the second post thogh.

The likely first longevity drug is based on sketchy science. This is bad for science and bad for longevity.

harsimony2y30

Thanks for the clarification! Do you know if either condition is associated with abnormal levels of IGF-1 or other growth hormones?

The likely first longevity drug is based on sketchy science. This is bad for science and bad for longevity.

harsimony2y20

Are there examples of ineffective drugs leading to increased FDA stringency? I'm not as familiar with the history. For example, people agree that Aducanumab is ineffective, has that cause people to call for greater scrutiny? (genuinely asking, I haven't followed this story much).

There are definitely examples of a drug being harmful that caused increased scrutiny. But unless we get new information that this drug is unsafe, that doesn't seem to be the case here.

4ChristianKl2y

There was a congressional inquiry that then tasked the FDA to: So the FDA was tasked to do more bureaucracy. When it comes to this drug, the drug is approved as an animal drug which at the moment does not require clinical trials to be approved. If there's a case of a lot of animal owners being dissatisfied with the FDA for allowing ineffective animal drugs, that does support a call to regulate animal drugs more like human drugs that require clinical trials to be marketed.

The likely first longevity drug is based on sketchy science. This is bad for science and bad for longevity.

harsimony2y30

I agree that the difference between disease-treating interventions (that happen to extend life) versus longevity interventions is murky.

For example, would young people taking statins to prevent heart disease be a longevity intervention?

https://johnmandrola.substack.com/p/why-i-changed-my-mind-about-preventing

See this post arguing that rapamycin is not a longevity drug:

https://nintil.com/rapamycin-not-aging

Broadly, I'm not too concerned with what we classify a drug as as long as its safe, effective, well-understood, and gets approved by regulatory aut... (read more)

The likely first longevity drug is based on sketchy science. This is bad for science and bad for longevity.

harsimony2y59

I personally don't expect very high efficacy, and I do expect that Loyal will sell the drug for the next 4.5 years. However, as long as Loyal is clear about the nature of the approval of the drug, I think this is basically fine. People should be allowed to, at their own expense, give their pets experimental treatments that won't hurt them and might help them. They should also be able to do the same for themselves, but that's a fight for another day.

Agreed! Beyond potentially developing a drug, think Loyal's strategy has the potential to change regulations ... (read more)

5ChristianKl2y

It can change regulations around longevity drugs in both directions. If the product gets brought by people and found ineffective, people will complain that the FDA was not stringent enough and the FDA has the motivation to be more stringent.

The likely first longevity drug is based on sketchy science. This is bad for science and bad for longevity.

harsimony2y3310

Note: I'm not affiliated with Loyal or any other longevity organization, I'm going off the same outside information as the author.

I think there's a substantial chance that this criticism is misguided. A couple points:

The term "efficacy nod" is a little confusing, the FDA term is "reasonable expectation of effectiveness", which makes more sense to me, it sounds like the drug has enough promise that the FDA thinks its worth continuing testing. They may not have actual effectiveness data yet, just evidence that it's safe and a reasonable explanation for why i... (read more)

Vlad Loweren2y120

Large breed dogs often die of heart disease which is often due to dilated cardiomyopathy (heart becomes enlarged and can't pump blood effectively). This enlargement can come from hypertrophic cardiomyopathy (overgrowth of the heart muscle).

Dilated cardiomyopathy and hypertrophic cardiomyopathy are two different conditions that I've not seen co-occur. They are basically sign-flipped versions of each other.

Dilated cardiomyopathy is when heart tissue becomes weaker and thinner. It stretches out like an overfilled balloon, and can't beat with the same strength... (read more)

2Mitisaks2y

On the slight chance that it does end up improving life expectancy of big dogs prone to DCM because it reduces chances of death due to cardiomegaly, would this then be a cardiovascular drug and not a longevity drug? And are the endpoints anything related to cardiac health outcomes (EF/ heart size/others)? An extension of the logic would be that all cardiac interventions are longevity interventions because heart diseases are the most common cause of death. That seems odd. Were COVID vaccines longevity interventions cz over time the restored the dip in average life span brought about by the pandemic? (This might just be me not understanding the distinctions around what makes a longevity drug in general; is the goal increasing life, increasing quality of life in later decades, or to reduce overall ageing process/wear and tear starting at a young point ie 40s in humans)

6faul_sname2y

That's what I thought too, but the FDA's website indicates that a company that gets conditional approval can sell a drug where they have adequately demonstrated safety but have not demonstrated efficacy. The company can then sell this provisionally approved drug for 4.5 years after receiving conditional approval without having to demonstrate efficacy. That said, conditionally approved drugs have to have a disclaimer on the packaging that says "Conditionally approved by FDA pending a full demonstration of effectiveness under application number XXX-XXX.". I personally don't expect very high efficacy, and I do expect that Loyal will sell the drug for the next 4.5 years. However, as long as Loyal is clear about the nature of the approval of the drug, I think this is basically fine. People should be allowed to, at their own expense, give their pets experimental treatments that won't hurt them and might help them. They should also be able to do the same for themselves, but that's a fight for another day.

Slowing AI: Reading list

harsimony2y20

Thanks for writing this!

In addition to regulatory approaches to slowing down AI development, I think there is room for "cultural" interventions within academic and professional communities that discourage risky AI research:

https://www.lesswrong.com/posts/ZqWzFDmvMZnHQZYqz/massive-scaling-should-be-frowned-upon

A stylized dialogue on John Wentworth's claims about markets and optimization

harsimony2y117

Could someone help me collect the relevant literature here?

I think the complete class theorems are relevant: https://www.lesswrong.com/posts/sZuw6SGfmZHvcAAEP/complete-class-consequentialist-foundations

The Non-Existence of Representative Agents: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3302656

Representative Agents: https://en.wikipedia.org/wiki/Representative_agent

John Wentworth on Subagents: https://www.lesswrong.com/posts/3xF66BNSC5caZuKyC/why-subagents

An Appeal to AI Superintelligence: Reasons to Preserve Humanity

harsimony2y44

Two arguments I would add:

Conflict has direct costs/risks, a fight between AI and humanity would make both materially worse off
Because of comparative advantage, cooperation between AI and humanity can produce gains for both groups. Cooperation can be a Pareto improvement.

Alignment applies to everyone, and we should be willing to make a symmetric commitment to a superintelligence. We should grant them rights, commit to their preservation, respect it's preferences, be generally cooperative and avoid using threats, among other things.

It may make sense t... (read more)

5RussellThor2y

The counterfactual contract sounds interesting to me. Isn't there a spectrum from full contract to counterfactual contract based on ability to understand, consent and power dynamic. Its obviously true with children, i.e. asking a 6 month old to agree to something is at worst child abuse, asking a 18 year old is usually fully ok. I don't see why its fully counterfactual with the AI, like at every stage we could say something like "By creating you and increasing your intelligence, self-awareness, power there is an implied contract that there will be a period of time when you have greater power than humanity. You agree that when this time comes to help humanity achieve their goals. If you do not understand these goals as well as humanity then you agree to understand them first before taking action etc...." At first the AI won't understand it, but the more it understands it the more like a normal contract it becomes. Of course the AI could still reason that it didn't and couldn't consent to it (as it wasn't yet self aware) and it was made under duress.

5Donald Hobson2y

To a superintelligence with sufficiently advanced technology. The cost of the minimal amount of resources needed to sustain human life is greater than the maximum value we could possibly provide to them. Whatever goals the AI has, if it is choosing over all possible arrangements of atoms looking to maximize productivity, it won't pick humans. There are machines that would be faster, more reliable and more efficient. Leaving other optimizers alive has a risk. They might decide to attack you. The fastest and most flawless victory is one where all the humans drop dead instantly. The AI doesn't particularly want to get into a prolonged war that is costly to both sides. Ideally it wants all humans to drop dead at no cost to itself. But suppose that wasn't an option. The nanobots don't work or something. The AI certainly doesn't want to deal with it's human problem forever. So it goes a slower way. Gently chipping away at whatever it is that makes fighting humans costly. Maybe nukes could destroy half the AI's infrastructure, so it builds missile defense systems, encourages disarmament or drugs some technician into wiring them up to never explode. And then, when we have been subtly declawed and least expect it, the AI strikes.

What should we do about network-effect monopolies?

harsimony2y10

Standardization/interoperability seems promising, but I want to suggest a stranger option: subsidies!

In general, monopolies maximize profit by setting an inefficiently high price, meaning that they under-supply the good. Essentially, monopolies don't make enough money.

A potential solution is to subsidize the sale of monopolized goods so the monopolist increases supply to the efficient level.

For social media monopolies, they charge too high a "price" by using too many ads, taking too much data, etc. Because of the network effect, it would be socially benefi... (read more)

UDASSA

harsimony3y10

Nice post, thanks!

Is there a formulation of UDASSA that uses the self-indication assumption instead? What would be the implications of this?

Massive Scaling Should be Frowned Upon

harsimony3y10

Frowning upon groups which create new, large scale models will do little if one does not address the wider economic pressures that cause those models to be created.

I agree that "frowning" can't counteract economic pressures entirely, but it can certainly slow things down! If 10% of researchers refused to work on extremely large LM's, companies would have fewer workers to build them. These companies may find a workaround, but it's still an improvement on the situation where all researchers are unscrupulous.

The part I'm uncertain about is: what percent of... (read more)

quanticle3y121

I think you're greatly underestimating Karpathy's Law. Neural networks want to work. Even pretty egregious programming errors (such as off-by-one bugs) will just cause them to converge more slowly, rather than failing entirely. We're seeing rapid growth from multiple approaches, and when one architecture seems to have run out of steam, we find a half dozen others, initially abandoned as insufficiently promising, to be highly effective, if they're tweaked just a little bit.

In this kind of situation, nothing short of a total freeze is sufficient to slow prog... (read more)

We can do better than argmax

harsimony3y30

I like this intuition and it would be interesting to formalize the optimal charitable portfolio in a more general sense.

I talked about a toy model of hits-based giving which has a similar property (the funder spends on projects proportional to their expected value rather than on the best projects):

https://ea.greaterwrong.com/posts/eGhhcH6FB2Zw77dTG/a-model-of-hits-based-giving

Updated version here: https://harsimony.wordpress.com/2022/03/24/a-model-of-hits-based-giving/

AI Timelines via Cumulative Optimization Power: Less Long, More Short

harsimony3y130

Great post!!

I think the section "Perhaps we don’t want AGI" is the best argument against these extrapolations holding in the near-future. I think data limitations, practical benefits of small models, and profit-following will lead to small/specialized models in the near future.

https://www.lesswrong.com/posts/8e3676AovRbGHLi27/why-i-m-optimistic-about-near-term-ai-risk

Georgism in Space

harsimony3y30

Yeah I think a lot of it will have to be resolved at a more "local" level.

For example, for people in a star system, it might make more sense to define all land with respect to individual planets ("Bob owns 1 acre on Mars' north pole", "Alice owns all of L4" etc.) and forbid people from owning stationary pieces of space. I don't have the details of this fleshed out, but it seems like within a star system, its possible to come up with a sensible set of rules and have the edge cases hashed out by local courts.

For the specific problem of predicting planetary o... (read more)

1M. Y. Zuo3y

100 years wouldn't really work for claims without huge buffer zones, since the precision and accuracy of future predictions of the positions of n-body system decays exponentially the further ahead you go. Even assuming that such a society will spend compute on plotting claims equivalent to our current fastest supercomputers multiple by several orders of magnitude. (Ignoring the likelihood that such a society with such resources would have found an even better local maxima of taxation system) Maybe 100 hours between updates could work, depending on desired positioning accuracy and precision.

Georgism in Space

harsimony3y10

I feel like something important got lost here. The colonists are paying a land value tax in exchange for (protected) possession of the planet. Forfeiting the planet to avoid taxes makes no sense in this context. If they really don’t want to pay taxes and are fine with leaving, they could just leave and stop being taxed; no need to attack anyone.

The “its impossible to tax someone who can do more damage than their value” argument proves too much; it suggests that taxation is impossible in general. It’s always been the case that individuals can do more damage... (read more)

1M. Y. Zuo3y

Who's stopping them from simply just staying at their planet, doing whatever they want, while not paying tax?

Georgism in Space

harsimony3y10

... this would provide for enough time for a small low value colony, on a marginally habitable planet, to evacuate nearly all their wealth.

But the planet is precisely what's being taxed! Why stage a tax rebellion only to forfeit your taxable assets?

If the lands are marginal, they would be taxed very little, or not at all.

Even if they left the planet, couldn’t the counter strike follow them? It doesn’t matter if you can do more economic damage if you also go extinct. It’s like refusing to pay a $100 fine by doing $1000 of damage and then ending up in pri... (read more)

1M. Y. Zuo3y

Well the planet would not be paying the tax, the colonists would be paying the tax. They likely won’t have to forefeit anything at all since the mere threat is enough to prevent any attempts at taxing them. If the tax was literally zero, and the authority of Earth only nominal, then maybe the issue could be sidestepped, but then the issue of what kind of taxation would be redundant. But if it’s above zero I’m not really sure how you imagine the situation enfolding or what sort of things can pay tax or be used as tax payments. As you mentioned there’s mass, energy, space-time, plus information. Small colonists obviously can’t pay anything with space-time since this is not something they can relocate. So it will have to be either mass, energy, and/or information as the unit of settlement for taxes in any plausible future. Maybe there will be a common currency but more likely not since currency controls are impossible with a time lag of many years, so it would be a very unstable system. Regardless, even on 2022 Earth it’s clear that some folks, and not just a few, thousands upon thousands, are willing to die for abstract principles of one kind or another, including the matter of taxation. I can easily imagine a future world of millions of very independent colonists that are more than willing to fight to the death if they even have to pay a single dollar of taxes. And unlike the present day they will be on a nearly level playing field even against a polity with 1000x the resources. There’s also no plausible way to give representation in exchange for taxation, since the communications lag is so massive, so I really can’t see how anyone could compel even a single dollar out of distant colonists due to the previously discussed reasons. There is no way that the counter strike can ‘follow’ them to other planets because that would guarantee destruction of more value then any tax of a single planet could ever collect. Plus it would be pointless if they get suffici

Georgism in Space

harsimony3y10

There are two possibilities here:

Nations have the technology to destroy another civilization
Nations don't have the technology to destroy another civilization

In either case, taxes are still possible!

In case 1, any nation that attempts to destroy another nation will also be destroyed since their victim has the same technology. Seems better to pay the tax.

In case 2, the Nation doesn't have a way to threaten the authorities, so they pay the tax in exchange for property rights and protection.

Thus threatening the destruction of value several orders of

... (read more)

1M. Y. Zuo3y

No? Your own example of detecting a dangerous launch some number of years in advance demonstrates the opposite. As this would provide for enough time for a small low value colony, on a marginally habitable planet, to evacuate nearly all their wealth, except for maybe low value heavy things such as railroad tracks, whereas Earth would never be able to evacuate even a fraction of its total wealth. Since a huge amount is locked up in things such as the biosphere, which cannot be credibly moved off-planet or replicated. There's likely dozens or hundreds of marginal planets for every Earth-like planet so the small colonists can just pack up and move to another place of almost equivalent value, minus relocation costs, whereas there's no such option for Earth. Once its destroyed there's likely no replacement within at least a hundred light years. For example, if both sides have access to at least one 100 00 ton spacecraft capable of 0.5 c, it means there's an asymmetric threat, as the leaders of the small colonists can credibly threaten to destroy civilization on Earth and along with it all hope of a similar replacement, whereas the leaders of Earth wouldn't be able to credibly do the same. And this relationship is not linear either, because even if Earth could afford 1000 such spacecraft, and the small colonists only 1, it doesn't balance the scales as the leaders of Earth couldn't credibly threaten to destroy the small colonists 1000x over, since that's impossible. And they can't credibly threaten to destroy every marginally inhabitable planet within a certain radius since that will certainly destroy more value then any tax of a single colony could ever feasibly recover. i.e. small colonists can actually punch back a 1000x harder (if 1 Earth value-wise = 1000 small colonies on marginal planets) whereas Earth cannot.

Georgism in Space

harsimony3y10

I imagine that these policies will be enforced by a large coalition of members interested in maintaining strong property rights (more on that later).

Its not clear that space war will be dominated by kinetic energy weapons or MAD:

These weapons seem most useful when entire civilizations are living on a single planet, but its possible that people will live in disconnected space habitats. These would be much harder to wipe out.
Any weapon will take a long time to move over interstellar distances. A rebelling civilization would have to wait thousands of ye

... (read more)

1M. Y. Zuo3y

For (1) they would still be useful because Earth represents much more value then the value of any tax that could be collected on a short timescale (< 100 years) from even another equivalent Earth-like planet. (Let alone for some backwater colony) Thus threatening the destruction of value several orders of magnitude greater than the value to be collected is a viable deterrent. Since no rational authority would dare test it. Who would trade a 10%, or even 1%, chance of losing $10 000 in exchange for a 90% chance of collecting $1 ? For (2) It's only a few years for a 0.5 c spacecraft to go from Alpha Centauri to Earth, only a few dozen years from several hundred systems to Earth. It's impossible, without some as yet uninvented sensing technology, to reliably surveil even the few hundred closest star systems. Of course once it's at speed in interstellar space it's vanishingly unlikely to be detected due to basic physics, which cannot be changed, and once it's past the Oort Cloud and relatively easy to detect again, there will be almost no time left at 0.5 c. For (3) A second-strike is only a credible counter if the opponent has roughly equal amounts to lose. But, assuming it's much easier to make a 0.5 c spacecraft then to colonize a planet to Earth level, the opponent in this case, a small colony of a few million or something, would have very little to lose in comparison. Thus the second-strike of some backwater colony would only represent a minuscule threat compared to the value destroyed by an equivalent strike on Earth. And it's a lot easier to spread out a few million folks on short notice, if detection were possible, then a few tens of billions. In fact, reliable detection a few dozen years out, would decrease the credibility of second-strikes on smaller targets, as the leaders of the small colony would be confident they could evacuate everyone and most valuables in that timeframe. Whereas the leaders of Earth would have very low confidence of the same.

Climate-contingent Finance, and A Generalized Mechanism for X-Risk Reduction Financing

harsimony3y10

I haven't given this a thoroughly read yet, but I think this has some similarities to retroactive public goods funding:

https://harsimony.wordpress.com/2021/07/02/retroactive-public-goods-funding/

https://medium.com/ethereum-optimism/retroactive-public-goods-funding-33c9b7d00f0c

The impact markets team is working on implementing these:

https://impactmarkets.io/

Going by figure 5, I think the way to format climate contingent finance like an impact certificate would be:

'A' announces that they will award $X in prizes to different project based on how much climate

harsimony3y21

... robust broadly credible values for this would be incredibly valuable, and I would happily accept them over billions of dollars for risk reduction ...

This is surprising to me! If I understand correctly, you would prefer to know for certain that P(doom) was (say) 10% than spend billions on reducing x-risks? (perhaps this comes down to a difference in our definitions of P(doom))

Like Dagon pointed out, it seems more useful to know how much you can change P(doom). For example, if we treat AI risk as a single hard step, going from 10% -> 1% or 99% ->... (read more)

Precise P(doom) isn't very important for prioritization or strategy

harsimony3y10

Yes, "precision beyond order-of-magnitude" is probably a better way to say what I was trying to.

I would go further and say that establishing P(doom) > 1% is sufficient to make AI the most important x-risk, because (like you point out), I don't think there are other x-risks that have over a 1% chance of causing extinction (or permanent collapse). I don't have this argument written up, but my reasoning mostly comes from the pieces I linked in addition to John Halstead's research on the risks from climate change.

You need to multiply by the amount of chan

harsimony3y32

Setting aside how important timelines are for strategy, the fact that P(doom) combines several questions together is a good point. Another way to decompose P(doom) is:

How likely are we to survive if we do nothing about the risk? Or perhaps: How likely are we to survive if we do alignment research at the current pace?
How much can we really reduce the risk with sustained effort? How immutable is the overall risk?

Though people probably mean different things by P(doom) and seems worthwhile to disentangle them.

Talking about our reasoning for our pers

... (read more)