This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Wikitags
LW
Login
Subscribe
Discussion
0
1
AI Misuse
Raemon
AI Misuse
Subscribe
Discussion
0
1
Written by
Raemon
last updated
1st May 2023
Summaries
Cancel
Submit
AI misuse.
Humans using AI in a way that harms humanity.
Posts tagged
AI Misuse
Most Relevant
2
63
Managing catastrophic misuse without robust AIs
Ω
ryan_greenblatt
,
Buck
1y
Ω
17
2
30
Adversarial Robustness Could Help Prevent Catastrophic Misuse
Ω
aog
1y
Ω
18
2
17
Distinguishing misuse is difficult and uncomfortable
lemonhope
2y
3
1
89
Covert Malicious Finetuning
Ω
Tony Wang
,
dannyhalawi
9mo
Ω
4
1
79
Human study on AI spear phishing campaigns
Simon Lermen
,
Fred Heiding
,
Andrew Kao
3mo
8
1
38
Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation
Ω
Soroush Pour
,
rusheb
,
Quentin FEUILLADE--MONTIXI
,
Arush
,
scasper
1y
Ω
2
1
23
On excluding dangerous information from training
ShayBenMoshe
1y
5
1
22
Proposal: we should start referring to the risk from unaligned AI as a type of *accident risk*
Christopher King
2y
6
1
18
Proposal: Align Systems Earlier In Training
OneManyNone
2y
0
1
6
How to solve the misuse problem assuming that in 10 years the default scenario is that AGI agents are capable of synthetizing pathogens
jeremtti
4mo
0
1
2
Technical Risks of (Lethal) Autonomous Weapons Systems
Heramb
5mo
0