Mesa Optimizers

Sometimes an optimization loop may create agents that are themselves optimizers.

Goal Misgeneralization

Intelligent systems may pursue a goal in unexpected ways, potentially leading to undesired outcomes.

Instrumental Convergence

Humans or other entities will often adopt similar intermediate strategies to help them achieve their ultimate goals.