feed-forward layer

feed-forward layer

/ˌfiːd ˈfɔːrwərd ˌleɪər/

🧠 LLM Architecture

neural network layers that process each position independently after attention

Feed-forward layers transform the attention outputs into richer representations.

Origin: Old English fēdan to nourish + Latin forward + layer