Impossibility Theorems in AI Value Alignment (against utility functions)

Utility functions or their equivalents (value functions, objective functions,
loss functions, reward functions, preference orderings) are a central tool in
most current machine learning systems. These mechanisms for defining goals and
guiding optimization… Read more

Similar