mesa-optimization

mesa-optimization

/ˈmeɪsə ˌɒptɪmaɪˈzeɪʃən/

🛡️ AI Safety & Alignment

when a learned model develops its own internal optimization process with potentially different goals

Mesa-optimization could cause an AI to pursue goals different from its training objective.

Origin: Spanish mesa table, plateau (indicating a level within) + optimization