All of Jsevillamol's Comments + Replies

To my knowledge, we currently don’t have a way of translating statements about “loss” into statements about “real-world capabilities”.

Now we do!

SolidGoldMagikarp (plus, prompt generation)

My intuition is that it's not a great approximation in those cases, similar to how in regular Laplace the empirical approximation is not great when you have eg N<5

Id need to run some calculations to confirm that intuition though.

Jsevillamol2yΩ240

Here is a 2012 meme about SolidGoldMagikarp

https://9gag.com/gag/3389221

SolidGoldMagikarp (plus, prompt generation)

Jsevillamol2yΩ250

This site claims that the strong SolidGoldMagikarp was the username of a moderator involved somehow with Twitch Plays Pokémon

https://infosec.exchange/@0xabad1dea/109813506433583177

4mwatkins2y

Partially true. SGM was a redditor, but seems to have got tokenised for other reasons, full story here: https://twitter.com/SoC_trilogy/status/1623118034960322560 "TPPStreamerBot" is definitely a Twitch Plays Pokemon connection. Its creator has shown up in the comments here to explain what it was.

4Jsevillamol2y

Here is a 2012 meme about SolidGoldMagikarp https://9gag.com/gag/3389221

I still don't understand - did you mean "when T/t is close to zero"?

1dust_to_must2y

Oops yes, sorry!

What's r?

1dust_to_must2y

Oops, I meant lambda! edited :)

Looking for Spanish AI Alignment Researchers

Jsevillamol2y30

That's exactly right, and I think the approximation holds as long as T/t>>1.

This is quite intuitive - as the amount of data goes to infinity, the rate of events should equal the number of events so far divided by the time passed.

1dust_to_must2y

Thanks for the confirmation! In addition to what you say, I would also guess that e−λ∗t is a reasonable guess for P(no events in time t) when t > T, if it's reasonable to assume that events are Poisson-distributed. (but again, open to pushback here :)

Slightly against aligning with neo-luddites

If you want to join the Spanish-speaking EA community, you can do so through this link!

Jsevillamol2y14-1

I agree with the sentiment that indiscriminate regulation is unlikely to have good effects.

I think the step that is missing is analysing the specific policies No-AI Art Activist are likely to advocate for, and whether it is a good idea to support it.

My current sense is that data helpful for alignment is unlikely to be public right now, and so harder copyright would not impede alignment efforts. The kind of data that I could see being useful are things like scores and direct feedback. Maybe at most things like Amazon reviews could end up being useful for to... (read more)

ARC paper: Formalizing the presumption of independence

Jsevillamol2y50

Great work!

Stuart Armstrong gave one more example of a heuristic argument based in the presumption of independence here.

https://www.lesswrong.com/posts/iNFZG4d9W848zsgch/the-goldbach-conjecture-is-probably-correct-so-was-fermat-s

paulfchristiano2y130

There are a huge number of examples like that floating around in the literature, we link to some of them in the writeup. I think Terence Tao's blog is the easiest place to get an overview of these arguments, see this post in particular but he discusses this kind of reasoning often.

I think it's easy to give probabilistic heuristic arguments for about 80 of the ~100 conjectures in the wikipedia category unsolved problems in number theory.

About 30 of those (including the Goldbach conjecture) follow from the Cramer random model of the primes. Another 9 a... (read more)

Counterarguments to the basic AI x-risk case

Jsevillamol2y192

Here are my quick takes from skimming the post.

In short, the arguments I think are best are A1, B4, C3, C4, C5, C8, C9 and D. I don't find any of them devastating.

A1. Different calls to ‘goal-directedness’ don’t necessarily mean the same concept

I am not sure I parse this one.I am reading it as "AI systems might be more like imitators than optimizers" from the example, which I find moderately persuasive

A2. Ambiguously strong forces for goal-directedness need to meet an ambiguously high bar to cause a risk

I am not sure I understand this one either.I am readi... (read more)

Counterarguments to the basic AI x-risk case

Jsevillamol2y813

Eight examples, no cherry-picking:

Nit: Having a wall of images makes this post unnecessarily harder to read.
I'd recommend making a 4x2 collage with the photos so they don't take that much space.

9habryka2y

I edited it to be a table (my guess is this was primarily the result of images being displayed different by default for the AI Impacts website and LessWrong).

A Bayesian Aggregation Paradox

Jsevillamol3y40

As it is often the case, I just found out that Jaynes was already discussing a similar issue to the paradox here in his seminal book.

This wikipedia article summarizes the gist of it.

How to get into AI safety research

Jsevillamol3yΩ140

I also found this thread of math topics on AI safety helpful.

https://forum.effectivealtruism.org/posts/d7fJLQz2QaDNbbWxJ/what-are-the-coolest-topics-in-ai-safety-to-a-hopelessly

Duels & D.Sci March 2022: It's time for D-d-d-d-d-d-d-d-d-d-d-d-d-d-data!

Duels & D.Sci March 2022: It's time for D-d-d-d-d-d-d-d-d-d-d-d-d-d-data!

Ah sorry for the lack of clarity - let's stick to my original submission for PVE

That would be:

[0,1,0,1,0,0,9,0,0,1,0,0]

Duels & D.Sci March 2022: It's time for D-d-d-d-d-d-d-d-d-d-d-d-d-d-data!

Yes, I am looking at decks that appear in the dataset, and more particularly at decks that have faced a deck similar to the rival's.

Good to know that one gets similar results using the different scoring functions.

I guess that maybe the approach does not work that well ¯\_(ツ)_/¯

3aphyer3y

Seeking clarification here: which of these decks are you currently submitting? If you need more time to decide, let me know.

DeepMind: Generally capable agents emerge from open-ended play

Thank you for bringing this up!

I think you might be right, since the deck is quite undiverse and according to the rest diversity is important. That being said, I could not find the mistake in the code at a glance :/

Do you have any opinions on [1, 1, 0, 1, 0, 1, 2, 1, 1, 3, 0, 1]? This would be the worst deck amongst the decks that played against a deck similar to the rival's in my code, according to my code.

1Measure3y

Jsevillamol3yΩ7130

Marius Hobbhahn has estimated the number of parameters here. His final estimate is 3.5e6 parameters.

Anson Ho has estimated the training compute (his reasoning at the end of this answer). His final estimate is 7.8e22 FLOPs.

Below I made a visualization of the parameters vs training compute of n=108 important ML system, so you can see how DeepMind's syste (labelled GOAT in the graph) compares to other systems.

[Final calculation]
(8 TPUs)(4.20e14 FLOP/s)(0.1 utilisation rate)(32 agents)(7.3e6 s/agent) = 7.8e22 FLOPs
==========================
NOTES BELOW
[Ha

... (read more)

4Daniel Kokotajlo3y

Thanks so much! So, for comparison, fruit flies have more synapses than these XLAND/GOAT agents have parameters! https://en.wikipedia.org/wiki/List_of_animals_by_number_of_neurons

Duels & D.Sci March 2022: It's time for D-d-d-d-d-d-d-d-d-d-d-d-d-d-data!

Jsevillamol3y40

Fixed, thanks!

Duels & D.Sci March 2022: It's time for D-d-d-d-d-d-d-d-d-d-d-d-d-d-data!

Jsevillamol3y40

Here is my very bad approach after spending ~one hour playing around with the data

Filter decks that fought against a similar to the rivals deck, using a simple measure of distance (sum of absolute differences between the deck components)
Compute a 'score' of the decks. The score is defined as the sum of 1/deck_distance(deck) * (1 or -1 depending on whether the deck won or lost against the challenger)
Report the deck with the maximum score

So my submission would be: [0,1,0,1,0,0,9,0,0,1,0,0]

Code

3Measure3y

2aphyer3y

Could you try reformatting this, please? It looks like your answer hasn't been successfully spoilered out. Thank you!

Duels & D.Sci March 2022: It's time for D-d-d-d-d-d-d-d-d-d-d-d-d-d-data!

Compute Trends — Comparison to OpenAI’s AI and Compute

Seems like you want to include A, L, P, V, E in your decks, and avoid B, S, K. Here is the correlation between the quantity of each card and whether the deck won. The ordering is ~similar when computing the inclusion winrate for each card.

Jsevillamol3y60

Thanks for the comment!

I am personally sympathetic to the view that AlphaGo Master and AlphaGo Zero are off-trend.

In the regression with all models the inclusion does not change the median slope, but drastically increases noise, as you can see for yourself in the visualization selecting the option 'big_alphago_action = remove' (see table below for a comparison of regressing the large model trend without vs with the big AlphaGo models).

In appendix B we study the effects of removing AlphaGo Zero and AlphaGo Master when studying record-setting models. The upp... (read more)

Compute Trends Across Three eras of Machine Learning

Jsevillamol3yΩ3100

Following up on this: we have updated appendix F of our paper with an analysis of different choices of the threshold that separates large-scale and regular-scale systems. Results are similar independently of the threshold choice.

A Bayesian Aggregation Paradox

An Intuitive Introduction to Functional Decision Theory

Thanks for engaging!

To use this theorem, you need both an $x$ (your data / evidence), and a $θ$ (your parameter).

Parameters are abstractions we use to simplify modelling. What we actually care about is the probability of unkown events given past observations.

You start out discussing what appears to be a combination of two forecasts

To clarify: this is not what I wanted to discuss. The expert is reporting how you should update your priors given the evidence, and remaining agnostic on what the priors should be.

A likelihood is

... (read more)

1JonasMoss3y

Okay, thanks for the clarification! Let's see if I understand your setup correctly. Suppose we have the probability measures pE and p1, where pE is the probability measure of the expert. Moreover, we have an outcome x∈{A,B,C}. In your post, you use p1(x∣z)∝pE(z∣x)p1(x), where z is an unknown outcome known only to the expert. To use Bayes' rule, we must make the assumption that p1(z∣x)=pE(z∣x). This assumption doesn't sound right to be, but I suppose some strange assumption is necessary for this simple framework. In this model, I agree with your calculations. I'm not sure. When we're looking directly at the probability of an event x (instead of the probability of the probability an event), things get much simpler than I thought. Let's see what happens to the likelihood when you aggregate from the expert's point of view. Letting x∈{A,B,C}, we need to calculate the expert's likelihoods pE(z∣A) and pE(z∣B∪C). In this case, pE(z∣B∪C)=pE(B∪C∣z)pE(B∪C)pE(z),=pE(B∣z)+pE(C∣z)pE(B∪C)pE(z),=pE(z∣B)P(B)+pE(z∣C)P(C)pE(B)+pE(C), which is essentially your calculations, but from the expert's point of view. The likelihood pE(z∣B∪C) depends on pE(B∪C), the prior of the expert, which is unknown to you. That shouldn't come as a surprise, as he needs to use the prior of in order to combine the probability of the events B and C. But the calculations are exactly the same from your point of view, leading to p1(z∣B∪C)=pE(z∣B)p1(B)+pE(z∣C)p1(C)p1(B)+p1(C) Now, suppose we want to generally ensure that pE(z∣B∪C)=p1(z∣B∪C). Which is what I believe you want to do, and which seems pretty natural to do, at least since we're allowed to assume that pE(z∣x)=p1(z∣x) for all simple events x. To ensure this, we will probably have to require that your priors are the same as the expert. In other words, your joint distributions are equal, or p1(z,x)=pE(z,x). Do you agree with this summary?

Jsevillamol3y50

Great sequence - it is a nice compendium of the theories and important thought experiments.

I will probably use this as a reference in the future, and refer other people here for an introduction.

Looking forward to future entries!

1Heighn3y

Awesome, thanks for your comment!

Patricia Hall & The Warlock Curse