LESSWRONG
LW

27

[ Question ]

What prevents SB-1047 from triggering on deep fake porn/voice cloning fraud?

26th Sep 2024

1 min read

27

Recently, there was a post on SB-1047 and how it's quite mild regulation. I'm not expert on it and don't know how it works.

In the comment section I was asking:

Why wouldn't deep fake porn or voice cloning technology to engage in fraud be powerful enough to materially contribute to critical harm?
There are cases of fraud that could do $500,000,000 in damages.
Given how juries decide about damages, a model that's used to create child porn for thousands of children could be argued to cause $500,000,000 in damages as well. Especially when coupled with something like trying to extort the children.

I'm surprised that my comment didn't get any engagement where people explained how they think the law will handle those cases while at the same time my post got no karma votes.

I'd love to believe that the law is well thought out, and simply a good step for AI safety. At the same time, I also like having accurate beliefs about the effects of the law, so let me repeat my question here.

How does the law handle damage caused by deep fake porn or fraud with voice cloning?

AI GovernanceAI

27

New Answer

New Comment

2 Answers sorted by
top scoring

Sep 27, 2024

145

Notwithstanding the tendentious assumption in the other comment thread that courts are maximally adversarial processes bent on on misreading legislation to achieve their perverted ends, I would bet that the relevant courts would not in fact rule that a bunch of deepfaked child porn counted as "Other grave harms to public safety and security that are of comparable severity to the harms described in subparagraphs (A) to (C), inclusive", where those other things are "CBRN > mass casualties", "cyberattack on critical infra", and "autonomous action > mass casualties". Happy to take such a bet at 2:1 odds.

But there are some simpler reason that particular hypothetical fails:

Image models are just not nearly as expensive to train, so it's unlikely that they'd fall under the definition of a covered model to begin with.
Even if someone used a covered multimodal model, existing models can already do this.

See:

(2) “Critical harm” does not include any of the following:
(A) Harms caused or materially enabled by information that a covered model or covered model derivative outputs if the information is otherwise reasonably publicly accessible by an ordinary person from sources other than a covered model or covered model derivative.

[-]ChristianKl1mo42

Child porn is frequently used to justify all sorts of highly invasive privacy interventions. ChatGPT * seems to think it would be a public safety thread under Californian law.

Existing models can do pictures but not video. A complex multimodal model might be able to do video porn.

Better models might produce deep fake audio with less data and at nearer to how the person actually speaks.

There's also the question of whether deep fake porn or faked audio is "accessible information" in the sense of that paragraph (2) (A). That paragraph clearly absol... (read more)

6RobertM1mo

Seems like it'd pretty obviously cover information generated by non-covered models that are routinely used by many ordinary people (as open source image models currently are). As a sidenote, I think the law is unfortunately one of those pretty cursed domains where it's hard to be very confident of anything as a layman without doing a lot of your own research, and you can't even look at experts speaking publicly on the subject since they're often performing advocacy, rather than making unbiased predictions about outcomes. You could try to hire a lawyer for such advice, but it seems to be pretty hard to find lawyers who are comfortable giving their clients quantitative (probabilistic) and conditional estimates. Maybe this is better once you're hiring for e.g. general counsel of a large org, or maybe large tech company CEOs have to deal with the same headaches that we do. Often your best option is to just get a basic understanding of how relevant parts of the legal system work, and then do a lot of research into e.g. relevant case law, and then sanity-check your reasoning and conclusions with an actual lawyer specialized in that domain.

2ChristianKl1mo

Deep fake porn of a particular person is not information that's generated by non-covered models that are routinely used by many ordinary people even if the models could generate the porn if instructed to do so.

2RobertM1mo

Almost no specific (interesting) output is information that's already been generated by any model, in the strictest sense.

2ChristianKl1mo

If I tell a model to write me a book summary, that book summary can be specific interesting output without containing any new information. If I want to know how to build a bomb, there are already plenty of sources out there on how to build a bomb. The information is already accessible from those sources. When an LLM synthesizes the existing information in its training data to help someone build a bomb it's not inventing new information. Deep fakes aren't about simply repeating information that's already in the training data. So the argument would be that the lawmaker chose to say "accessible" because they want to allow LLMs to synthesize the existing information in their training data and repeat it back to the user but that does not mean that the lawmaker had an intention to allow the LLMs to produce new information that gets used to create harm even if there are other ways to create that information.

[-]cfoster01mo42

I’m not sure if you intended the allusion to “the tendentious assumption in the other comment thread that courts are maximally adversarial processes bent on on misreading legislation to achieve their perverted ends”, but if it was aimed at the thread I commented on… what? IMO it is fair game to call out as false the claim that

It only counts if the $500m comes from "cyber attacks on critical infrastructure" or "with limited human oversight, intervention, or supervision....results in death, great bodily injury, property damage, or property loss."

even if deep... (read more)

1

2RobertM1mo

Not your particular comment on it, no.

Sep 26, 2024

2-2

It only counts if the $500m comes from "cyber attacks on critical infrastructure" or "with limited human oversight, intervention, or supervision....results in death, great bodily injury, property damage, or property loss."

So emotional damages, even if severe and pervasive, can't get you there.

[-]cfoster01mo134

If you read the definition of critical harms, you’ll see the $500m doesn’t have to come in one of those two forms. It can also be “Other grave harms to public safety and security that are of comparable severity”.

1RamblinDash1mo

I have a hard time imagining a Court ruling that "Other grave harms to public safety and security that are of comparable severity" could embrace something so different-in-kind than the listed items.

7Shankar Sivarajan1mo

From the Fish and Game code: This was read to include bees for the purposes of the Endangered Species Act which lists (Emphases mine).

2RamblinDash1mo

Well, "fish" is a statutorily defined term that clearly includes all invertebrates. What did you want the court to do, ignore the statutory text? Arguably, that outcome supports the notion that the courts are less likely to just ignore text limiting the kinds of harm that are cognizable, not more likely, as you seem to be arguing.

6ChristianKl1mo

(4) uses three terms "public safety", "public security" and comparable severity. I would expect that severity means $500,000,000 worth of damage. Public safety does not seem to be a clearly defined term but fairly broad.

[-]ChristianKl1mo53

If someone creates an automated system that makes deep fake porn and then emails with that porn to blackmail people and publishes the deep fake porn when people don't pay up, that could very well be a system with limited human oversight, intervention, or supervision.

Those people who pay the blackmail would also suffer from property loss.

If you have someone committing suicide because of deep fake porn images of themselves, it might also result in death.

If you have one suicide + $500,000,000 worth in emotional damage wouldn't it count?

7 comments, sorted by

Click to highlight new comments since: Today at 12:39 PM

[-]Yair Halberstadt1mo51

I'm not sure why those shouldn't be included? If someone uses my AI to perform 500 million dollars of fraud, then I should probably have been more careful releasing the product.

[-]ChristianKl1mo4-3

leogao spoke of SB-1047 being quite mild.

If you include those things, I expect that you effectively outlaw Open Source AI models for video/audio generation.

I think it's a reasonable decision to outlaw Open Source AI models, but it doesn't sound "mild".

[+][comment deleted]1mo20

Deleted by Yair Halberstadt, 09/26/2024

Reason: Incorrect

[+]Shankar Sivarajan1mo-8-10