This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Wikitags
LW
Login
Machine Unlearning
Settings
Applied to
The case for unlearning that removes information from LLM weights
by
Ebenezer Dukakis
2mo
ago
Applied to
Machine Unlearning in Large Language Models: A Comprehensive Survey with Empirical Insights from the Qwen 1.5 1.8B Model
by
Saketh Baddam
3mo
ago
Applied to
Gradient Routing: Masking Gradients to Localize Computation in Neural Networks
by
TurnTrout
5mo
ago
Applied to
Breaking Circuit Breakers
by
NickyP
9mo
ago
Applied to
Unlearning via RMU is mostly shallow
by
NickyP
9mo
ago
Applied to
Deep Forgetting & Unlearning for Safely-Scoped LLMs
by
NickyP
1y
ago
Applied to
LLM Modularity: The Separability of Capabilities in Large Language Models
by
NickyP
2y
ago
Applied to
Machine Unlearning Evaluations as Interpretability Benchmarks
by
NickyP
2y
ago
NickyP
v1.0.0
Oct 23rd 2023 GMT
(+1330)
2
Created by
NickyP
at
2y