Intended tone was humorous, as in the 'you guys have [X]s?' meme, not to deny that Russia has such executives, although I haven't seen anything notable from Sberbank. I've certainly kept an eye on Mistral and SSI if no one else.
However right now I think I'd list at least 5 American labs and 4 Chinese labs as substantially ahead of anyone anywhere else until proven otherwise, excluding SSI which is impossible to get a read on.
Making that argument seems... unwise of them.
I wouldn't obviously even put AMD on the list given that they're up on rather big single stock news, but yes, good note, there is that.
Would a reasonable way to summarize this be that if you train on pretend reward hacking you get emergent misalignment that takes the form of pretending (playacting) misbehaving and being evil, whereas if you here train on realistic reward hacking examples it starts realistically (and in some ways strategically) misbehaving and doing other forms of essentially reward hacking instead?
Yes.
Knowing that, hopefully you wouldn't?
Oh, of course, how silly of me!
I was not aware of this at the time.
My guess is that on the margin more time should be spent improving the core messaging versus saturating the dialogue tree, on many AI questions, if you combine effort across everyone.
This is potentially important context from Janus/Repligate, including the claim that it an incomplete/inexact version of something real: https://x.com/repligate/status/1994973338448662858