All of sflicht's Comments + Replies

On February 25, 2024, President Zelenskyi revealed that the Ukrainian losses had amounted to 31,000, while there have been 180,000 Russian soldiers killed.

Do you have any reason to believe these figures are accurate?

-4Shannar
Perhaps we will know the true figures after the end of the war, but approximate non-official calculations show that in Ukraine the losses are much higher, about 180-200 thousand. And the losses are more like 1 to 2.

and that that will come with the need for content moderation

It will certainly come with calls for content moderation, but for all the reasons you allude to, the assertion that there will be a need for such moderation seems quite tendentious.

1Joerg Weiss
Agreed, good point! Let's say there will be non-stupid arguments in favor of content moderation. For example (top of my head): * Children need to learn using creating tools; if a video editor was able to generate hard core porn or excessively violent content, it's tricky to leave them alone with it. * AI doesn't just generate content, it brings in knowledge. Some knowledge is restricted from circulation for good reasons (This is basically the bio-terrorism argument). I tend to think it's better to limit the model's capability so it fits the use case (e. g. no ability to create porn in software used by children) than having a Llama Guard style moderation tool in the loop supervising both user and model behavior; but I'm still very vague on my own position in the debate. I also don't know the different approaches people are trying out. I'd really like to read up on it though. (What I do know: I don't want the Metas and Googles of the world being in charge of defining the control mechanisms for something that will be involved in basically all of our creative processes.)

I agree with @faul_sname that the bash is more readable.

But maybe a better (more readable/maintainable) Python alternative is to explicitly use Amazon's Python API for S3 downloads? I've never used it myself, but googling suggests:

import json
import boto3
from io import BytesIO
import gzip

try:
     s3 = boto3.resource('s3')
     key='YOUR_FILE_NAME.gz'
     obj = s3.Object('YOUR_BUCKET_NAME',key)
     n = obj.get()['Body'].read()
     gzipfile = BytesIO(n)
     gzipfile = gzip.GzipFile(fileobj=gzipfile)
     content = gzipfile.read()
     print(content)
... (read more)
2jefftk
I don't see how that solves any of the problems I have here?
sflicht4-3

I'd argue that using argh is just as easy and strictly better:


$ cat test.py
#!/usr/local/bin/python
import argh

def start(width, depth, height):
   print(float(width) * float(depth) * float(height))
   
if __name__ == '__main__':
   p = argh.ArghParser()
   p.set_default_command(start)
   p.dispatch()

$ ./test.py -h
usage: test.py [-h] width depth height

positional arguments:
 width       -
 depth       -
 height      -

options:
 -h, --help  show this help message and exit

$ ./test.py 1 2 3
6.0

$ ./test.py 1 2 3 4
usage: test.py [-h] width depth hei
... (read more)
4philh
Another reason this is not strictly better, is that for someone reading the source code it's likely less clear what's going on. (But I could imagine I find the module useful at some point.)
4jefftk
argh isn't in the standard library. For a quick script this matters much less than if you're building something that would stick around but it's still a downside. It's also a bit of boilerplate I'd need to look up every time to get started on a script. In this case the benefit is minimal: slightly better error messages. The better path for improving the script is mildly helpful, but switching once you need something other than mandatory positional arguments is not hard.

AFAICT those fines have not been for missing cookie banners. And if I were Mark Zuckerberg, I might think to myself, "the EU is going to shake us down for 'privacy violations' no matter what we do, so why should I bother making our user experience worse with annoying cookie banners?" 

(Also, to some extent, FAANG-scale companies may get fined but serve as a shield for all smaller companies. If you were a Brussels bureaucrat with a focus on fining websites for privacy issues, and you could get hundreds of millions for targeting a FAANG [not that you get... (read more)

3Jiro
Small players could be fined to set precedents that can be used against larger players.
5ChristianKl
The size of the fine depends on the number of privacy violations.  Brussels bureaucrats are not the only people who can bring lawsuits for GDPR violations. A startup that makes money through an app likely has European customers and thus there are assets that could be targeted.  In practice, both the Google Play Store and the Apple store have rules that enforce privacy. Especially, Apple is quite willing to tell Joe Startup that his app needs to follow the rules if it wants to be in their store. 

OK, maybe I'm wrong about the politics as regards large multinationals. (Although I'm not sure I'm wrong.)

But that argument says nothing about why a website like JSTOR (non-profit, US-based) complies. I'm skeptical that anyone would try to enforce against them, and also that any such enforcement would have actual legal consequences. EU tries to fine JSTOR, JSTOR says "we are in the US" and doesn't pay, then...? Does anyone actually think the EU is going to force all European ISPs to block JSTOR? I suppose if JSTOR uses EU-based datacenters to serve some content to European users, those could be shut down. I do not think that would be a popular move with European academics.

Why do non-EU-based companies/websites bother to comply with this directive? For that matter, why do even big firms with an EU presence comply? I can see why a firm with an EU office or employees might worry about some legal risk, but (a) is it really true that the EU would devote significant enforcement resources to prosecuting/fining "victimless" violations of this directive? (b) for sufficiently popular websites (Amazon, FB, ...) surely the companies have more leverage than the EU, since I'd think that one of these firms even threatening to stop serving... (read more)

4Kaj_Sotala
If a firm has a policy of "we will follow the law except when we can get away with breaking it", they may miscalculate what they can get away with and be hit with fines. Having an ethical injunction to try to follow the law in all cases, even when you think you might get away with not following it, is the safest policy. Of course, in practice big companies do skirt the law and try to get away with it, but for that to be rational there needs to be a sufficient payoff for it. In practical terms, it doesn't cost FB/Amazon anything to comply with cookie banner laws, nor would they get any real benefit from breaking them. So even if the risk was small, why take it?
4ChristianKl
Decisions on whether or not to enforce laws are not made in the EU by directly elected politicians driven by the desire to be popular with the population.  Practically we do see Facebook and Amazon getting fined: https://www.wsj.com/articles/facebooks-whatsapp-fined-around-270-million-for-eu-privacy-violations-11630576800
7Ben
If amazon or facebook were breaking the rules, the EU would probably love to go after them and hit them with giant fines. They are multinational corporations, they are big tech, they are even American! Fining them for continuing to operate in the EU would be very popular across a lot of the political spectrum.
5[anonymous]
So sorry! I assumed hyperreals and complex numbers are bigger than reals like a complete amateur!

2^16 != 1632

1lsusr
Fixed. Thanks.
sflicht00

Note that in the inexact case (i.e. observation error) this model (the Lasso) fits comfortably in a Bayesian framework. (Double exponential prior on u.) Leon already made this point below and jsteinhardt replied

[This comment is no longer endorsed by its author]Reply