All of siclabomines's Comments + Replies

I hate that I actually liked Answer to Job

Thus, I must currently hold Sam Altman guilty

 

*innocent

2pythagoras5015
My mistake. Fixed it. Thanks for pointing that out!

The largest models should be expected to compress less than smaller ones though, right?

The main problem with crawlers is that their usage patterns don't match those of regular users, and most optimization effort is focused on the usage patterns of real users, so bots sometimes wind up using the site in ways that consume orders of magnitude more compute per request than a regular user would.

And Twitter has recently destroyed his API, I think? Which perhaps has the effect of de-optimizing the usage patterns of bots.

Hinton says he partly regrets his life’s work.

 

This may be another Cade Metz moment. (38:20)

 

Right. From what I've seen, the people that support censoring misinformation are almost never doing it out of worries that themselves will get misinformed.

I'm assuming dsj's hypothetical scenario is not one where GPT-6 was prompted to simulate an actor playing a villain.

It's a nice analogy, but it all rests on whether infinite evidence is a thing or not, and there aren't arguments one way or the other here. (Sure, infinite evidence would mean "whatever log odds you come up with, this is even stronger", but that doesn't rule out it is a thing). 

Like, how much evidence for the hypothesis "I'll perceive the die to come up a 4" does the event "Ok, die was thrown and I am perceiving it to be a 3" provide? Or how much evidence do I have of being conscious right now when I am feeling like something? I think any answer different from infinity is just playing a word game.

Aiming for convergence on truth. I guess it's true this might lead to a failure mode where one seeks for convergence more than anything else. But taken literally, this should not discourage exploring new wild hypotheses. If you are both equally wrong, by growing your uncertainty you get nearer to converging on truth.

True. Still, using 1960's prices with current production assumes a 1960 flat demand curve, right? It's like using off-season avocado prices when no one buys them to compute real GDP during avocado season. 

Maybe the UK's case curve has flattened after the end of the spike due to the asymptomatic people that are getting tested for whatever reasons and turn positive for the reason you state? It doesn't feel likely (perhaps it's just the other omicron subvariant giving it a push? or just the "control system" of people relaxing?). The hospital admissions continued to go down as one would expect if this was the case, though the data at ourworldindata is a few days behind. 

I know it's unlikely, but if it was indeed omicron, its faster generation time also would make its numbers drop faster if they managed to move R under 1

I presume 12 feet is a quarter of the risk of 6 feet [...] there is no magic number

 

My intuitive oversimplified model of this has been analogous to the direct sound vs reverberant sound in acoustics (in slow motion). 

 

I'd expect the risk from direct viruses to follow the inverse square law (at least to the extent that the risk is linear to the expected number of viruses around you, which can't be true for high risks). And maybe be even be reduced by cloth masks which stop big droplets (?).

But the reverberant viruses are supposed to be the ma... (read more)

Can immune escape by itself explain the transmission advantage or do we also need it to be spreading better?

2Daniel V
Trevor Bedford gets into this, and the short answer is technically yes, but the important part about decomposing Rt is not the decomposing it per se but the info it yields on how much vaccine escape might be going on. For reasonable R0s, there has to be substantial vaccine escape. https://twitter.com/trvrb/status/1466076797670363140
4Zvi
In theory if enough people have enough immunity you can explain any advantage with immune escape. I can't rule it out. However, I'm skeptical, primarily because if everyone was previously infected then you wouldn't see vaccines giving much advantage vs. infection (either by Omicron or Delta) or protection against severe disease. The fact that vaccines or being infected still matters vs. baseline population puts a not-too-high cap on how many people have already been infected, which in turn makes the current doubling times very hard to explain with evasion alone. 

Makes sense to me...

On the other hand, if it takes longer to show symptoms but it's still equally transmissible since early but for longer, you get higher Rs without surprising new mechanisms of transmission. Also, it may also be escaping our current precautions instead of the immunity.

Why does it follow that a longer time to develop symptoms suggests immune escape?

 

Also, if the timeline is longer, then the estimates of how much more transmissible Omicron is, based on the time it's taken for it to displace Delta, should be even greater, right?

4Lukas_Gloor
The speed by which Omicron outcompeted Delta is explained by three broad contributors:  - Generation time - Immunity escape - Transmissibility in susceptible populations We learn (probabilistically) that the first factor isn't lower than in Delta, and may even be higher (longer generation time). This leaves even more work to be done for the other two factors. You're right that in theory, Omicron could just be crazy good at spreading in susceptible populations. However, I think that's unlikely on priors given that you get an easy 3x increase in overall transmissibility with full immune escape, and that seems easier, mutation-wise, than the virus suddenly turning into something that's a lot more transmissible than any coronavirus we've ever seen. 

On the other hand, those lockdowns may only last until the cases start going down again, but you can't get unvaccinated.

 If not mandating vaccination for indoor dining, then what?

Even that minimally coercive approach you describe is pretty coercive; I don´t expect the benefits to outweigh the ugly side of making many tens of millions of people be injected with something they don´t like or trust or want. Some people are still getting convinced to get vaccinated just with time alone, and many other things could be done better to convince more people without more restrictions. I don´t know what to expand on without making this too long. 

1tkpwaeub
Sure, it's coercive, but at least the scope of coercion is smaller (in terms of who is being coerced to do stuff)

the WHO who still refuse to admit Covid is airborne

 

Sort of.  For some months now, the WHO states that it can spread "in poorly ventilated and/or crowded indoor settings [...] because aerosols remain suspended in the air" 

 

EDIT: (used to ask why the link wasn't formatting properly)

2MondSemmel
(Meta comment: Formatting is in rich text by default; selecting text displays a hover with formatting options. You can switch to markdown formatting in your user settings ("Activate Markdown Editor").)

And if the choice is between ‘no indoor dining (or other X) for anyone’ and ‘no indoor dining (or other X) for the unvaccinated’ I know which one I’m choosing, and which one leaves me more free.

I agree that "no indoor dining for anyone" is worse than mandating vaccination for indoor dining. But I also don´t think the situation merits either. Protecting the immunocompromised and people that want but can´t get vaccinated doesn´t make up for the concerns.

2tkpwaeub
Can you elaborate? If not mandating vaccination for indoor dining, then what? Full disclosure, I'm actually not a big fan of vax passes either, but coming from the opposite end: I think vaccination should be mandatory across the board and backed up by very large fines, whether or not one chooses to enter public spaces. In my opinion this would actually be the minimally coercive approach, averaged over the entire population - since it doesn't require everyone else to be constantly policing vaccination status. Despite my reservations, if asked to pick a side I'll always land on Team Vax Passport.

 we’d not only not make them mandatory, they’d be forbidden.

The space between those two is very small, maybe even negative.

Regarding the intelligence tests after COVID: Fourth, I can imagine some people that had COVID and go test themselves might actually want/expect to see some effect and end up not doing their best, to be a victim or have an excuse or something to blame for whatever.

6Dokler
Another point on how the test-taking sample may be biased- it's possible that those who expect that they have experienced cognitive decline are less likely to take the test as a way to avoid confronting reality.

It's unlikely that these corporations would make the assumption that all future IP would also be "confiscated"

Do you have a good explanation to Moderna's market price drop?

Even if they made that assumption, what are they supposed to do? Stop investing in future developments, and slowly go out of business?

Borrow less, invest less, or, as you say in your last line, focus on other ways of making money that don't require innovation and IP?

3[anonymous]
  1 - it was tiny compared to recent changes 2 - the efficient market hypothesis is quite quite false.
8tkpwaeub
Neither the Moderna nor the Pfizer stocks dropped very dramatically - both had solid upward trends throughout the past month, and the prices are already bouncing back. Investors make mistakes too.

Right! My untrained intuition still resists a bit; I should play with the numbers.

Niice, it makes sense! Thanks!

So to recap, I was right in that riskier assets can have higher avg returns, but I was missing the usually bigger and opposing effect where as the assets gets riskier, the same avg returns rely more and more on lucky very big gains while doing worse more often (at least if they are sort of lognormal). 

My second point I still think was correct, right? -- i.e., that if Scott believed ETH had some chance of total collapse (a mixture distribution), then this skews it to the other side and pushes the median below the mean, and gives some reason to think ETH is more likely to outperform BTC. Does this make sense?

2SimonM
It is possible that Scott believed that ETH is negatively-skewed (ie small chance of collapsing, large chance of small increase) but this would be inconsistent with his probability that ETH is going to 5k.  I think the vast majority of people think crypto is positively-skewed.

If ETH is less risky than BTC then the median performance of ETH will outperform BTC and his probability could be consistent with EMH

Wait. Does this mean that EMH expects less risky investments to have higher performance on average? That sounds shocking enough that I must be confusing something here. Or is this some sort of median vs mean distinction that I'm not seeing

?

3SimonM
tl;dr "some sort of median vs mean distinction" No, there's two things going on which act against each other: * Riskier assets have higher returns on average * Riskier assets are more skewed (mean higher than median) I've made the (I think safe) assumption that the skewness of the assets are more important than the relative differences in their expected return.  You can have a play with some toy models for this, for example, fixed Sharpe, lognormal assets  you will have something which looks like: log(X) ~ N(sharpe * vol - vol^2/2, vol) P(return larger than r) = P( Z*vol + (sharpe*vol - vol^2/2) > r) = P(Z  > (-sharpe + vol/2 + r/vol)) If we're interested in r = 0 (outperformance of current price) then as vol increases, the probability goes down.  (There's actually quite a bit of intuitive stuff which drops out of this model (if we're required to hit a given r, then increasing volatility makes it easier (up until vol = sqrt(2*r)))

About 17 and the EMH. Can't Scott be just thinking that ETH is sufficiently more risky than BTC so it may have higher expected returns even with the EMH (the EMH allows this, right?). Or even that he might think ETH has some chance of total collapse (like an outlier at 0) so even with equal expected returns it's much more probable that ETH outperforms BTC than the other way around (?)

1SimonM
Actually this is the other way around. If ETH is less risky than BTC then the median performance of ETH will outperform BTC and his probability could be consistent with EMH This is neither consistent with historical realised volatility (ETH is more volatile than BTC), nor is it consistent with the options market (ETH implied vols are all higher than the equivalent moneyness BTC implied vols)

What's this supposed to be estimating or predicting with Bayes here? The thing you'll end up doing?  Something like this?: 

Each of the 3 processes has a general prior about how often they "win" (that add up to 100%, or maybe the basal ganglia normalizes them). And a bayes factor, given the specific "sensory" inputs related to their specific process, while remaining agnostic about the options of the other process. For example, the reinforcer would be thinking: "I get my way 30% of the time. Also, this level of desire to play the game is 2 times mo... (read more)

Whatever prevents the most infection, hospitalization and death is the right answer either way

I first read this sentence as suggesting that killing people is the best way to prevent infection.

3pjeby
Yes, apparently death is the right answer, either way. ;-)

Yeah, if R0 is held constant and also COVID-UK is going up in absolute numbers. 

Israel's deaths are dropping more slowly than I would have intuitively expected given the vaccinations; I now wonder if it's because of longer duration of the new strains which means we may have to wait a little longer until most of the previous infections resolve. Anyone that's been looking at detailed data (like strain prevalence, the ages of the people still dying, etc) has an opinion? (I just looked at the daily death and vaccination rate)

I haven't read the papers so, please correct me if I guess wrong (most likely), anybody.

I'm guessing the UK strain was estimated from relative growth between strains when the UK cases were skyrocketing, and that gave around ~40% higher R0 than COVID-classic. 


Now, say they were underestimating the duration of the UK strain. That would mean it is actually more transmissible than estimated -- but it was masked by the long timescales (transmissibleness means R, right?). And that would mean that it's that much harder to contain than we thought (yet it was ... (read more)

7TheMajor
I continue to be extremely surprised by the UK decline in numbers. The Netherlands is reporting a current estimated R of 1.1-1.2 for the English strain and 0.8-0.9 for the wild types. They furthermore estimate that just over half of all newly reported cases are English strain by now. But the UK daily cases have dropped by 80% in 40 days, which at a reproduction time of 6 days would mean R = 0.79 throughout. In the past I suggested a few potential, not mutually exclusive, explanations: 1. The UK has implemented significantly more effective measures, and if we just copy them we can totally beat the English strain. 2. The height of the UK peak in the second week of January was caused by Christmas and New Years holiday craze, which caused significant delayed reporting ('better take that test after I visit all my friends and family, otherwise I won't be allowed to join them') and massively overestimates the peak, and also the decay. 3. The Dutch models are crap. 4. The UK numbers are crap. 5. The English strain has spread throughout the London area so rapidly that it hit local group immunity, and the plummet afterwards is caused by a lack of geographical spread. Once this picks up again the UK will see a stark rise in cases. I previously put my money on hypothesis number 5, but as time goes on it steadily loses credibility. If anybody has a suggestion for what's going on in the UK right now I'm all ears, I am currently not taking their drop in cases at face value.
3Zvi
Depends if you think the previous R0 calculations were based on getting the timing right, and how you think about what's acting on what. If this makes us update towards a much higher R0, then yes we are in more trouble rather than less trouble and it could end up here faster on net, whereas if we hold R0 as known then this slows things down.

I think you get more points for earlier predictions.

So one should interpret the points as a measure of how useful you've been to the overall predictions in the platform, and not how good you should be expected to be on a specific question, right?

3Davidmanheim
Not really. Overall usefulness is really about something like covariance with the overall prediction - are you contributing different ideas and models. That would be very hard to measure, while making the points incentive compatible is not nearly as hard to do. And how well an individual predictor will do, based on historical evidence, is found in comparing their brier to the metaculus prediction on the same set of questions. This is information which users can see on their own page. But it's not a useful figure unless you're asking about relative performance, which as an outsider interpreting predictions, you shouldn't care about - because you want the aggregated prediction.

Yeah, I wasn't trying to be tautological. 

 

I am under the impression that you are thinking something like: "Bezos has ~100 billion to spend. If he spends 1 million in X, then he has 1 million less to spend on the rest. But he won't even get to spend it in his lifetime, so that extra million in X doesn't change how much he would spend in Y. Therefore, it's wrong to say that Y will become more available because Bezos spent in X.".

I don't think that's the right way to think about all this. (Warning: oversimplification coming):

 

Bezos earns some ... (read more)

I think I disagree a bit with both (but what do I know).

For someone like Jeff Bezos, an increase in spending on Item A probably just results in slightly less money spent by his great-grandchildren in 100 years.

This doens't seem to me to be the right way to think about it. Short term, the more he spends on Item A will result in lower spending on Item B, or lower investment in his companies, a lower transfer of money from him to someone else (like through lower savings). Or more money being spent overall if he just uses up cash he had hidden in his pillow; w... (read more)

2jmh
I agree that in a very strict sense money spent on X definitionally cannot be spent on something else (you already spent it so don't have it). But does that type of tautological view matter here? (And if that is not the basis of your point and I'm misunderstanding, sorry.) This kind of reminds me of the old Richard Pryor movie Bruster's Millions.  I suspect the numbers have change but at one point I recall Bezos had a new worth of 90 billion. Even with a paltry 1% return on total assets that's like 900 million a year. Could you spend all that? I'm not sure I could really keep tracking of an income stream like that -- I suspect I would often be doing the equivalent of dropping a million on the floor now and then and forgetting to pick it up for a week or so. So the rich buying more vaccines doesn't really equate to not buying enough of other things that prices for those other things drop enough to make more available to the poor (who are budget constrained much more realistically than are the rich).

I doubt that kind of hidden information can affect PredictIt betting odds as it limits the amount each person can bet. 

 

There is bias or Zvi is reaching wrong conclusions with the same info.

It's the placebo effect, obviously; you can't get sick if you zinc it works.

Do they really get higher expected returns from that?

I know they do when the market isn't efficient (relative to the specific investor), but that doesn't help me.

1jmh
I think the answer is yes. I would say it is a very similar strategy to that of corporate financial management and using financial leverage to improve returns and earnings.

Why is it that riskier investments should give higher expected returns?

I ask not because I don't get that the avg person would rather invest on something safe than something unsafe, all else being equal. I get that. I ask because I imagine that investors could bring their total risk down through diversification without harming the expected returns, so big money would prefer the higher expected returns even if they are risky, and in doing that, they'd bring down the extra returns from the riskier investments.

Is it because investments options are so correlated that diversification isn't enough to bring the risk of a portfolio down to acceptable levels? Or some other reason?

1ChristianKl
Investors can always use leverage to get higher risk and higher returns if they want. 

I have the opposite impressions. Science should embrace causality more and do it better. And as a layman term it should be refined so that we stop talking about the causes of any event as a cake where each slice has a name and only one name.


I find it hard to summarize why, at least right now, but my view is sorta similar to Pearl's (though I don't totally like how he puts it). Hopefully later I'll re-read this more attentively and comment something more productive (if no one has done a strictly better job already).

1George3d6
I believe the thing we differ on might just be a semantic, at least as far as redefinition goes. My final conclusion is around the fact that the term is bad because it's ill-defined, but with a stronger definitions (or ideally multiple definitions for different cases) it would be useful, it would also, however, be very foreign to a lot of people.

I meant hiding just the CWish posts. There're enough non-CWish posts to attract people that value the way of thinking in general.


Also, it doesn't sound that bad to attract users through 1 to 1 recommendations only. Or allow unlogged people to read all, but only little by little release the power for new users to interact with the content. Maybe release it all at once if a high karma user vouches for you (they lose it that person gets banned or something). Maybe instead of karma, there could be another value that better reflects how much you are ... (read more)

I was 80% kidding. I do believe that the type of people that could attack this community are hugely people that can't tolerate trying to read and understand the kind of content in here; let alone Scott's 999999 word analytical yet clear essays. They didn't sign up for real thinking and nuance when they went into activism.


And unlike others, I don't think mobs are organized. They look like it, but its some sort of emergent behaviour that can be managed by making it boring for the average mob member to attack.

8wizzwizz4
I agree with this assessment. It almost feels like a hive mind; I've dipped into the peripherals of online mobs before, and have felt "hey, this action is a good idea" thoughts enter my head unbidden. I'd probably participate in such things often, if I didn't have a set of heuristics that (coincidentally) cancels out most of this effect, and a desire not to associate with the sorts of people who form mobs. If the barrier-to-entry is increased to "requires two minutes of unrewarded drudgery, where it's not intuitively obvious what needs to be done" in such a way that a short, well-worded "mob instruction" message can't bypass the effect, it's unlikely a mob will form around such actions. Incidentally, I wonder whether programming for the mob is a field of social psychology.

I still don't get it. agc asked how is the retaliation NOT at attempt to stifle criticism. TurnTrout answered that it is not: it's retaliation for a doxing attack, not for criticism. Then wolflow said something that's "literally" wrong, and metaphorically I didn't get it; probably TurnTrout didn't get it too so he answered the literal interpretation. Etc etc.


But the upvotes-downvotes show I'm not seeing something here.

Of course, that was a given. I just assumed that most of us don't need days of exclusive focus to write an email.

3ChristianKl
It's quite possible to invest more time into "contacting the NYTimes" then writing a single email. You could for example encourage other people to write as well. Especially people who they NYTimes might more care about than random people. 

Simply requiring log-in to read some posts, and limiting the rate of new users (maybe even make it invite only most of the times, like a private torrent tracker), should go a long way to prevent mob attacks.

6Spiracular
A quiz and a day's wait before adding a new user is another option. Make it something that a regular lurker who read the rules would be able to pass easily, but a rando couldn't. SCP wiki did something like this, it seemed to help with quality control. Rotate through 3 different quizes, or scramble the quiz order sometimes, if you want to make automated sign-ups annoying for mobs and spammers. Have the web people track the number of sign-up-quiz fails (it's a nice metric for "is there a mob at the doorstep"). (Edit: Ah, someone already proposed a more-elaborate variant using GPT-X. Simple quizes with a few mild gotcha-questions should be enough of a screen for most cases, though.) A proposal I think I haven't seen posed is giving new members a "trial period." If an average (or randomly-selected) post doesn't have a karma score of at least X by the end of the period (or if it dips below Y at any point), they're out and their stuff is deleted. Ban them from handing out karma until after the trial, or this quickly breaks. This probably still has weird incentive consequences that I'm not seeing, though... ...it does mean having a bit of an evaporative-filter for quality-ratings, and it means links to crappy posts turn into deadlinks in just a matter of time.
6Viliam
If log-in is required to read posts, I am afraid there would be no new users. How would anyone find out that the interesting debate exists in the first place? But if you have no users, there is no one to talk to.

Make a captcha with GPT-X rationalist content against real rationalist content. If you can't tell the difference, you are out :P

Also, train GPT-X on content that triggers mobs, and then use it to keep them busy elsewhere :P

3Rudi C
I have seen some captchas like "What's the capital of [Some Country]" in some forums. We can add some basic captchas that need some level of highschool math and basic familiarity with the sequences for verifying new user registrations, and then wall-off certain posts from unverified users. I'm not sure this will be that  effective though. All it takes to defeat it is some screenshots.
4niplav
This might become difficult with values of X≥4, though…

Given the news cycle speed, it makes sense to get ready for the likely scenarios.

It likely makes more sense to follow Scotts advice to contact the NYTimes to advice not to doxx him then focus on preparing for retaliation.

Load More