All of sanyer's Comments + Replies

sanyer20

Is your goal to identify double-cruxes in a podcast? If so, our tool might not be the best for that, since it's supposed to be used in live conversations as a kind of a mediator. Currently, the Double-Crux Bot can only be used either as a bot that you add to your Slack / Discord, or by joining our Discord channel.

Probably more useful for you is this: In a recent hackathon, Tilman Bayer produced a prompt that was used to extract double-cruxes in from a debate. As a model they used Gemini 2.0 Flash Thinking, you can see the prompt here: https://docs.google.c... (read more)

sanyer10

Yes I've heard about it (I'm based in Switzerland myself!)
I don't think it changes the situation that much though, since OpenAI, Anthropic, and Google are still mostly American-owned companies

sanyer44

If we don't build fast enough, then the authoritarian countries could win.

If you build AI for the US, you're advancing the capabilities of an authoritarian country at this point.
I think people who are worried about authoritarian regimes getting access to AGI should seriously reconsider whether advancing US leadership in AI is the right thing to do. After the new Executive Order, Trump seems to have sole interpretation of law, and there are indications that the current admin won't follow court rulings. I think it's quite likely that the US won't be a democr... (read more)

2Nathan Helm-Burger
Have you noticed that AI companies have been opening offices in Switzerland recently? I'm excited about it.
sanyer10

Really interesting work! I have two questions:

1. In the model organisms of misalignment -section it is stated that AI companies might be nervous about researching model organisms because it could increase the likelihood of new regulation, since it would provide more evidence on concerning properties in AI system. Doesn't this depend on what kind of model organisms the company expects to be able to develop? If it's difficult to find model organisms, we would have evidence that alignment is easier and thus there would be less need for regulation.  

2. Wh... (read more)

sanyer20

I've also found "spreadsheet literacy" a recurring skill

What exactly do you use spreadsheets for? Any examples?

sanyer10

Unfortunately the bot works only in Discord and Slack.

sanyer120

Here's another about biking:

sanyer40

Sure! Here's a simple conversation about tea:

sanyer43

Filtering for "people who can afford to pay for a workshop" works pretty well.

This is surprising to me. It seems to assume income is just based on general competence, which doesn't seem true to me. There are a lot of people who seem to have these traits who would find it really difficult to pay for this, and vice versa

3Frederic Bahnson
The filtering described here seems moderately specific but not sensitive, whether or not you agree with the "income implies competence" relationship being strong. It seems true that those who are interested in and can pay for a $4k course of this type are more likely to have 17 of the attributes in question than a person picked at random from the population. However, the filter tells you nothing about, and completely excludes, a large number of people who would fit the "have 17 of these attributes" criteria but not the "have $4k to spend on a course or the time to take it" criteria.  The filter allows in a population of people with above-average chances of meeting the attribute criteria, but blocks a large and unknown number of other people who would also meet that criteria.  It is potentially good for creating a desired environment in the course (having mostly people with a lot of the desired attributes), but is not a good filter for identifying the much larger population of people who might be interested in and benefitted by the course (as described in article as having 17 of the attributes and therefore capable of picking up the other two).
1CronoDAS
Well, you're filtering on both "can afford to pay for a workshop" and "wants to attend a workshop thay charges that much"...
sanyer10

I can see why you think it would be contradictory. The idea in the example was that both of you want better working environment in your workplace, but you have different opinions on how to get there. Whereas the disclaimers were about situations where this is not the case. For example, a situation where the other person doesn't care about a safe working environment. Does that make it clearer?

We are probably going to change the example if it's unclear though

2Gretta Duleba
No explicit deadline, I currently expect that we'll keep the position open until it is filled. That said, I would really like to make a hire and will be fairly aggressively pursuing good applications. I don't think there is a material difference between applying today or later this week, but I suspect/hope there could be a difference between applying this week and next week.
sanyer10

Some of the links in this post don't work for me. They seem to be links to localhost.

sanyer10

Is there a tag for posts applying CFAR-style rationality techniques? I'm a bit surprised that I haven't found one yet, and also a bit surprised by how few posts of people applying CFAR-style techniques (like internal double crux) there are.

sanyer10

It doesn't seem sufficient anymore to have a VPN in order to get an access to Claude. You also need a UK/US -based phone number. If anyone knows how to get around this, please let me know!