layer normalization

layer normalization

/ˈleɪər ˌnɔːrməlaɪˈzeɪʃən/

🧠 LLM Architecture

a technique to stabilize training by normalizing activations across features

Layer normalization helps transformers train more stably on long sequences.

Origin: Latin norma carpenter's square, rule + -ization