All of Randomized, Controlled's Comments + Replies

If you're interested in rerunning this, consider doing it with two tokens that don't have an obvious ordinal relationship. Ie, in the training set, GPT sees "A" then "B" a billion trillion times, and perhaps has a sense that in many contexts "A" is preferred to "B".

1Teun van der Weij
Might be a good way to further test this indeed. So maybe something like green and elephant?

Under this view, perhaps a certain set of interpretability techniques might emerge under a paradigm that makes certain assumptions (eg, that ML kernals are "mostly" linear, that systems are "mostly" stateless, that exotic hacks of the underlying hardware aren't in play, etc). If a series of anomalies were to accumulate that couldn't be explained within this matrix, you might expect to see a new paradigm needed.

Kuhn’s view is that during normal science scientists neither test nor seek to confirm the guiding theories of their disciplinary matrix. Nor do they regard anomalous results as falsifying those theories. (It is only speculative puzzle-solutions that can be falsified in a Popperian fashion during normal science (1970b, 19).) Rather, anomalies are ignored or explained away if at all possible. It is only the accumulation of particularly troublesome anomalies that poses a serious problem for the existing disciplinary matrix. A particularly troublesome anomaly

... (read more)
3Randomized, Controlled
Under this view, perhaps a certain set of interpretability techniques might emerge under a paradigm that makes certain assumptions (eg, that ML kernals are "mostly" linear, that systems are "mostly" stateless, that exotic hacks of the underlying hardware aren't in play, etc). If a series of anomalies were to accumulate that couldn't be explained within this matrix, you might expect to see a new paradigm needed.

The Standford Phil Encylopedia gives:

According to Kuhn the development of a science is not uniform but has alternating ‘normal’ and ‘revolutionary’ (or ‘extraordinary’) phases. The revolutionary phases are not merely periods of accelerated progress, but differ qualitatively from normal science. Normal science does resemble the standard cumulative picture of scientific progress, on the surface at least. Kuhn describes normal science as ‘puzzle-solving’ (1962/1970a, 35–42). While this term suggests that normal science is not dramatic, its main purpose is t

... (read more)
3Randomized, Controlled

I wrote "estimate of up to 15% chance", which is compatible with what you're saying here. But I don't mind updating it to be more precise.

This roughly corresponds to the risk stance I've been taking since finishing my primary course of vaxx.

I just tested + for the first time today. At the very least, this is comforting to read. Thanks!

This comment was made before I updated the question to clarify what's in scope and added the moderation guidelines.

I'm sorry to hear about your health issues with LC. They sounds truly terrible. This question isn't addressed at the topic you're asking about, however.

6Fractalideation
Hiya, Thank you for your kind words and clarifying the scope of your question and sorry for having slightly deviated away from it. When I have the time I will try to find or create a relevant thread and move my post in there if that is possible. In any case very glad to see long Covid and ME/CFS being discussed/addressed on LW, many thanks for that. If ever at some point I feel I can contribute in answering your question within its scope and I have the energy & time to do it, I will gladly do it. In the meantime I will read with interest any answers/comments from the participants to this (from a personal point of view at least) interesting and useful thread. Kind regards

But what is the base rate? How do demographic factors affect the base rate? Vaccination status?

4 month follow up: a lot of heat was very helpful, but underpowered to fully deal with this problem, and also inconvenient to constantly have a hotwater bottle in the back of my sweater like a hunchback of Notre Dame. Gently ramped up, progressive upper back strengthening has been very helpful for getting to a much more sustainably comfortable point. There's still some lingering issues in the spot between the shoulder blades and occasionally I do still treat with heat, but it's much much better now.

2Shoshannah Tekofsky
Interesting! I dug through the comments too and someone referred to this article by Holden Karnofsky, but I don't actually agree with that for adults (kids, sure).
Answer by Randomized, Controlled5-1

A number of answers have already alluded to deprioritizing long-term savings, but you could go farther and borrow from the post-singularity future. Get a mortgage or other loan. This may work out well even in some worlds with friendly superintelligence, because maybe the AGI gives us a luxury automated communism and your debt obligation is somehow disolved.

Umm... this is not financial advice.

Byyyye!

Yeah, as a 1.5 week follow up, the heat treatement has continued to has continued to be effective, with the sense of tightness continuing to resolve. But it's also required a fairly high dose: probably several hours a day sitting with the waterbottle shoved into my t-shirt. I have a more ergonomic chair arriving tomorrow, G-d and B-zzos willing.

6Randomized, Controlled
4 month follow up: a lot of heat was very helpful, but underpowered to fully deal with this problem, and also inconvenient to constantly have a hotwater bottle in the back of my sweater like a hunchback of Notre Dame. Gently ramped up, progressive upper back strengthening has been very helpful for getting to a much more sustainably comfortable point. There's still some lingering issues in the spot between the shoulder blades and occasionally I do still treat with heat, but it's much much better now.

Two days later: this is working quite well so far.

I've worked as a professional programmer for nine years now. I think in at some point a few years ago, it actually began to erode my sense of agency working with computers. At a certain point I became less interested in hobby programming. This was 100% a healthy thing, I started doing things like dancing a lot of contact improv and rock climbing. But when I largely stopped hobby programming, almost all my programming experience was coming from working on production systems. Writing production code is slow. I've routinely had the experience of one or two li... (read more)

1Randomized, Controlled
Two days later: this is working quite well so far.

Interesting. Maybe you're right.

I think my model here is something more like, "ML agents that can do good text generation get rolled out to the masses by google and apple, and then some amount of glue infrastructure is developed or even just you can say, "hey google, help me with my dating profile" and it'll do a thing that's 70th or 80th percentile for writing quality, diluting out those of us who were doing 85th to 99th percentile writing.

2ChristianKl
Grammary already helps me to improve my writing quality and gives me suggestions on how to write better. We will likely see services like Grammary improve and that might increase average writing quality. On the other hand, I don't expect the average person to let an ML agent write their whole profile as that would feel to weird to the average person in the same way that testing photos on photofeeler feels weird to them.
3ChristianKl
Photofeeler also happens to be a commercialized product. 

But shortly after than it'll be available to everyone and we'll have lost another useful signal

4ChristianKl
Most people don't even engage in basic steps like testing their photos on photofeeler right now. I highly doubt that a significant amount of online daters will use AI generated responses even if the tech is available.

See, your first mistake here was not to consult with John Wentworth and Paul Christiano about thermodynamics.

I bet you didn't even ring up JeffTK!

So much for the rationalist virtue of scholarship..

Overall, this post is a bit confusing--it's like someone from a completely different society was suddenly transported to modern USA. What are you asking / telling us?

I had a similar reaction, I had the impression that Zvi is more worldly and jaded than the median LWer.

High quality, interesting, funny writing has been a difficult to manufacture signal up till now. It's possible GPT-n will change that. But folks on LW are probably filtering for people who will filter for making real pretty with the letter forms.

2ChristianKl
Folks on LW are also the kind of people who might take advantage of letting GPT-n do the funny writing for themselves if they lack the skills themselves.

A small bit of anecdata in favor of the vasoconstriction mechanism: I'm about two weeks into a flare up of upper back soreness/tightness/yeck. It's been a bit chronic/mild for... maybe six months, but I did some stuff that flared it up to... 1.3/10ish two weeks ago. Uncomfortable, and concerning, but not crazy. What was worrying was that it didn't seem to be resolving. Sitting at the computer definitely made it worse. I tried some alternations to my seating (currently not very ergonomic for logistic reasons that are taking time to fix). Adding lower back s... (read more)

2Steven Byrnes
Healing Back Pain makes a claim along those lines (p62):

I have been the only weirdo I know of who wears a P100. I say this to emphasize that I've been taking covid seriously.

I don't see any reason to believe covid will be over in two months, or N months, for any value of N less than "however long it takes for humans to come into a new equilibrium with a novel virus." I don't know how long that will be, but 2 seems wrong.

As someone who's worn a p100 a lot, I can also say it's hardly cost free. It has all sorts of social, convenience, physical and psychological costs. Maybe those costs are <<< than your ... (read more)

Answer by Randomized, Controlled40

For an answer that follows a very different intuition, take a look at Does Cosmological Evolution Select for Technology? by Jeffery Shainline. This is up there with aestivation and infinite ethics on the fun idea scale. He gives a nice summary at 2:13:23 on Lex Fridman's podcast to around 2:38. Highly recommend listening to the relevant clip on Fridman, it's pretty great. The entire episode is really interesting, and also contains some other supporting context for Shainline's arguement. Caveat: I haven't read the paper yet. The abstract is:

If the paramet

... (read more)

Thanks for this, Elizabeth. 

Do you have any thoughts on trying to use a PReP protocol with herpes antivirals? I spent about twenty minutes the other day doing some initial searches (just duckduckgo didn't have time to get into pubMed), and didn't turn up anything. Valaciclovir inhibits viral DNA synthesis and is, I believe, fairly safe/mostly well tolerated. 

2Elizabeth
Nothing emprical or quantified. From first principles it sure seems like it should help. There was one paper that claimed an antiviral reduced symptomatic days but not DNA shedding on symptom free days, which I found pretty surprising but didn't have time to follow up on. 
0Vaniver
I normally see it spelled "valacyclovir"; I'm a little confused about how much sense it makes to take prophylactically because most people already have HSV antibodies from a previous exposure, but I think if you're worried about HSV it's probably worth it.

Alarming that it freely "lies" (?) or hallucinates or whatever is going on, rather than replying "I don't know".

3IC Rainbow
That's entirely expected. Hallucilying is a typical habit of language models. They do that unless some prompt engineering have been applied.

sorry to be a rube, but.. why is dredging important? I would guess to make shipping lanes and harbors accessible to larger ships, but is shipping lane and harbor capacity a serious bottleneck? Is dredging capacity one? Is it something else?

8CatCube
This isn't worth its own post, so I'll tack it on to the front of this one: in normal US parlance, a vessel that conducts dredging operations is called a "dredge."  Every comment referring to a "dredger" is like fingernails on a chalkboard inside my brain.  (Not you, Randomized, Controlled.) Answering your question: Efficient shipping requires a deeper channel than normally exists naturally, and a dredge is used to create a channel of the desired dimensions.  And, of course, since it's not a natural river bed nature keeps trying to make it one so you have to keep doing it periodically as the channel fills in with sediments transported from upstream.  Most dredging operations are maintenance dredging to keep this channel open.  It's basically a routine maintenance task, like mowing the grass (mowing the bottom of the river?).  To give you an idea of the scale of how much goes on you can see the federal government's contracting efforts for dredging here (live page, so it'll be different every day): https://sam.gov/search/?index=opp&page=1&pageSize=25&sort=-modifiedDate&sfm%5BsimpleSearch%5D%5BkeywordRadio%5D=ALL&sfm%5BsimpleSearch%5D%5BkeywordTags%5D%5B0%5D%5Bkey%5D=Y1KF&sfm%5BsimpleSearch%5D%5BkeywordTags%5D%5B0%5D%5Bvalue%5D=Y1KF&sfm%5Bstatus%5D%5Bis_active%5D=true In the US, there's a mix of government vessels and contract dredges owned by private firms that do this work.  The contracts on the linked page would represent the effort required over and above the US Army's own dredges, and local port authorities will contract for work required (over and above their own vessels if they own one, of course.) You also have larger construction dredging operations, when creating a new port or a larger shipping channel.  It's often more complex than maintenance dredging, because if a channel has already been created you can typically assume that only moving sands and silts are required to keep that channel open.  With a new channel, or deepening/widening an existing one,
3DirectedEvolution
You can read the Wiki article I just wrote and follow the cited sources to get the lowdown!

Oooooh, these are much better than the ones I was got from nightcafe (I just checked, I was actually using "CLIP guided diffusion".)

DALL-E 2's marshes and sunset marshes are slightly better than what I was getting.

I used nightcafe.studio, a VQGAN+CLIP webservice a bunch in March for the worldbuilding.ai entry I was working on. I found it.. okay for generating images that I could then edit in photoshop, but it took many many tries to get something decent. I'd be particularly interested in seeing what DALLE-E 2 does with these prompts:

"Beautiful giant sunset over the saltwater marsh with tiny abandoned buildings in the distance" "Glass greenhouse with a beautiful forest inside, with people and drones flying" "People dropping into a beautiful marsh from flying drones on a sunny day" "Happy children hanging from flying drones on a sunny day beautiful storybook illustration"

2Swimmer963 (Miranda Dixon-Luinenburg)
"Happy children hanging from flying quadcopter drones on a sunny day, beautiful storybook illustration".  Adding "quadcopter" made the drones much easier to recognize! 
4Swimmer963 (Miranda Dixon-Luinenburg)
"People falling from robotic flying drones into a beautiful marsh, on a sunny day, matte painting" I think some of the "people" are also robotic? DALL-E is trying though! 
4Swimmer963 (Miranda Dixon-Luinenburg)
"People and drones flying around inside a giant glass greenhouse with a beautiful forest inside, 3D rendering".  I swapped the order because when entered verbatim, the prompt you gave had DALL-E forgetting to include any people or drones. I find it's more likely to actually include smaller or foreground features of a scene if I put them at the front and describe the larger backdrop after.  "3D rendering" is the best I got out of several style prompts (I tried "digital art" and "screenshots from a scifi blockbuster movie" as well.) 
3Swimmer963 (Miranda Dixon-Luinenburg)
"Beautiful giant sunset over the saltwater marsh with tiny abandoned buildings in the distance, matte painting" (came out better IMO than the original prompt with no style guidance, which sort of forgot about the buildings.) 

I'm not sure that "first-amendment style free speech" is a good frame for discussing speech issues on the internet. Much of what the internet, and social media does, is provide amplification rather than just speech. Maybe there should be more theorization of what rights and expectations citizens should have w/r/t being able to scale or amplify their speech.

As a kid I developed something that basically is a body scan. I worried about monsters while I was lying in bed. At some point I started imagining a field of green energy starting at the very bottom of my feet and working its way very slowly up each leg, then my torso, down my arms, and finally to the very top of my head. Once I was cloaked in this protective green shield I was safe, as long as I didn't move. It broke if I moved. But if I stayed still I could then inflate it outwards, enveloping my house, neighborhood, city, and eventually the entire world... (read more)

I'm sorry you've had to go through this. I wish it was otherwise.

2Gunnar_Zarncke
It is supposed to let you think if you remember the answer or can come up with it yourself. I explained it in this earlier shortform. 

Wait, rather than some cuckoo idea that this is related to altitude or marginally less cuckoo lithium theory, what if it's about regional cuisine and eating habits?

4Adele Lopez
Under the lithium theory, the amount of lithium in water/food accumulates as water sources go downhill, thus explaining the altitude connection, so just an altitude connection is strictly less "cuckoo". From the obesity map and my experiences living across the country, I would be surprised if regional cuisine/habits ends up mattering more than altitude, but might be worth testing more rigorously.
6Matthew Barnett
I'm currently persuaded that it's probably some combination of hypoxia (as suggested here), demographics, and traditional cuisine being more common at higher altitudes. Maybe it's a little bit of groundwater contamination? I'd really want to see a post that actually thoroughly investigated each of these hypotheses, rather than just sort of pointing to a factor on a graph and saying "yep it's probably that".

Also, before and during WW2, Japan had the most shockingly horrifying death-cult-y style leadership and culture. Dan Carlin does a good job sketching this in his Supernova in the East series.

I am not sure if this dichotomy is a helpful one but we can see Templarrr as stating that there is a theoretic 'failing' which need not be mutually exclusive with the pragmatic 'usefulness' of a theory.

That was what I was also trying to say, in a very pithy way : )

All these criticism can be true, and AGI can still be an existential threat.

2Hickey
I am not sure if this dichotomy is a helpful one but we can see Templarrr as stating that there is a theoretic 'failing' which need not be mutually exclusive with the pragmatic 'usefulness' of a theory. Both of you can be right and that would still mean that it is worthwhile to think up how to ameliorate/solve the theoretical problems posed and not devalue (or discontinue) the work being done in the pragmatic domain.

I saw Katja's post too, and had a reasonably big update from it, although I don't think she addresses the impact of vaccination status on long-cov probability or badness, so my update is smaller than it might have been.

Thank you for this. I'd say prior to reading this I was around 70% that for someone recently vaccinated or boosted Omicron isn't really worth worrying about, and getting Omicron at some point might even be +EV, due to cross immunity effect. I'd say now I'm around.. 45% on this?

My sense is that EAs in general have not been in the "strict lockdown" mode, trying to do some more careful tradeoffs to allow for things like EAGs. This struck me as.. reasonable-ish at the time, (even up through delta). But if Omicron has a similar long-covid story, this suggests that the upcoming EAGs may not be a good idea in person.

2KatjaGrace
I did ask about it, data here (note that n is small): https://www.lesswrong.com/posts/iTH6gizyXFxxthkDa/positly-covid-survey-long-covid

I feel unreasonably validated by this

6Dirichlet-to-Neumann
I feel unreasonably validated by this too to be honest.

I posted mine a day after this comment, do you still feel that way?

Answer by Randomized, Controlled50

Some automated phone queuing systems systems (the things where you call in and get put on hold for three days listening to music) offer a service where you can press a button, hang up and they'll call you back when you would have gotten to the end of the queue.

This should be a mandatory for all these systems.

3Dirichlet-to-Neumann
I think this one is totally uncontroversial and I'm voting for whoever has this in their platform.
Answer by Randomized, Controlled10

I would have guessed this was related to the Scott Alexander/NYT thing, but didn't that resolve months ago?

1Nicole Dieker
I'm sure I'm not the person to say whether anything resolved (and also that is not what I am referring to in this case).

A rough typology of music micro-skills from my first year and a half learning saxophone.

  • Reading music

    • Getting notes + annotations correct

      • Binding notes + annotations + physical movements together crisply. Annotations were learned significantly after notes.
      • Reading the rests as intervals
    • Reading ahead on the sheet

    • Executing the next physical move correctly

    • Keeping your place in the music synchronized with what you're fingering...

    • ...even while you're beginning to chunk the music into phrases

    • Knowing where I am in the piece of music, being a

... (read more)

Sorry, I was being a bit flip/insider-y. Probably inappropriately so.

I'm curious how much you've engaged with the AI Safety literature/arguments?

"Yudkowsky's DM" --> Eliezer Yudkowsky's [Twitter] Direct Messages.

D𝜋140

I think I have expressed my views on the matter of responsibility quiet clearly in the conclusion.

I just checked Yudkowsky on Google. He founded this website, so good.

Here is not the place to argue my views on super-intelligence, but I clearly side with Russell and Norvig. Life is just too complex; luckily.

As for safety, the title of Jessica Taylor's article is:

"Quantilizers: A Safer Alternative to Maximizers for Limited Optimization".

I will just be glad to have proved that alternative to be effective.

In the software industry we have the concept of responsible disclosure when one finds a security exploit in a published package.

Do we have responsible disclosure procedures for something that may represent a fundamental AI capability advancement? Who would you even disclose to? Slide into Yudkowsky's DMs?

dkirmani*160

MIRI should have a form you can fill out (or an email address) specifically for people who think they've advanced AI and are worried about ending the world with it. MIRI should also have Cold-War-style hotlines to OpenAI, DeepMind, and the other major actors.

4D𝜋
I am not sure I understand your question (sorry, I do not know what is Yudkowsky'DMs) I basically disclosed, to all, that the way we all think we think, does work. What kind of responsibility could that bear ?

There were periods of time when I would look at the site two or three times a week, but often just to check the 'adjusted prevalence' numbers, which are hidden by default under the "Details" dropdown. Oh, yeah, prevalence hasn't changed much, no need to update cached models, cool. Probably worth ~$20 aggregated.

The biggest update for me was realizing that going to the dentist was much less risky than it intuitively felt. This was great, probably worth at least $100 (or, I dunno, maybe worth negative money if the dentist is actually net harmful to me, which... (read more)

4Sameerishere
My willingness to pay for a quantification of risks for long covid: $500
Load More