
layer normalization
/ˈleɪər ˌnɔːrməlaɪˈzeɪʃən/
a technique to stabilize training by normalizing activations across features
“Layer normalization helps transformers train more stably on long sequences.”
Origin: Latin norma carpenter's square, rule + -ization