An alignment tax (sometimes called a safety tax) is the additionalextra cost incurred when makingof ensuring that an AI aligned,system is aligned, relative to the cost of building an unaligned AI.
Approachesalternative. The term ‘tax’ can be misleading: in the safety literature, ‘alignment/safety tax’ or ‘alignment cost’ is meant to refer to increased developer time, extra compute, or decreased performance, and not only to the financial cost/tax required to build an aligned system.
In order to get a better idea of what the alignment tax
No Tax:Max Tax:Paul Christiano distinguishes two main approaches for dealing with the alignment tax.[1][2] One approach seeks
- The first is to
find wayshave the will to pay the tax, such as persuading individuali.e. the relevant actors (corporations, governments, etc.) would be willing to pay the extra costs to avoid deploying a system until it or facilitating coordination of the sort that would allow groups to pay it. is aligned. - The
other approach triessecond is to reduce the tax,tax by differentially advancing existing alignable algorithms or by making existing algorithms more alignable. This means, for any potentially unaligned algorithm, ensuring the additional cost for an aligned version of the algorithm is low enough that the developers would be willing to pay it.
Copied over from corresponding tag on the EA Forum.
An alignment tax (sometimes called a safety tax) is the
additionalextra costincurred when makingof ensuring that an AIaligned,system is aligned, relative to the cost of building an unalignedAI.Approachesalternative. The term ‘tax’ can be misleading: in the safety literature, ‘alignment/safety tax’ or ‘alignment cost’ is meant to refer to increased developer time, extra compute, or decreased performance, and not only to the financial cost/tax required to build an aligned system.In order to get a better idea of what the alignment tax
No Tax:Max Tax:Paul Christiano distinguishes two main approaches for dealing with the alignment tax.[1][2]
One approach seeksfind wayshave the will to pay the tax,such as persuading individuali.e. the relevant actors (corporations, governments, etc.) would be willing to pay the extra costs to avoid deploying a system until itor facilitating coordination of the sort that would allow groups to pay it.is aligned.other approach triessecond is to reduce thetax,tax by differentially advancing existing alignable algorithms or by making existing algorithms more alignable. This means, for any potentially unaligned algorithm, ensuring the additional cost for an aligned version of the algorithm is low enough that the developers would be willing to pay it.Copied over fromcorresponding tagon the EA Forum.