
quantization
/ˌkwɒntaɪˈzeɪʃən/
Reducing model precision to decrease size and increase speed
quantization in a sentence
“Quantization made the model run efficiently on consumer hardware.”
Origin of quantization
From Latin quantus (how much) + -ization suffix; creating discrete quantities from continuous values
Related Words
latent space
A representation of compressed data
modality
A particular mode in which something exists or is experienced or expressed
large language model
An AI trained on vast text data to understand and generate language
foundation model
A large model trained on broad data that can be adapted to many tasks
context window
The amount of text a model can consider at once
temperature
A parameter controlling randomness in AI outputs