Segue
Segue
Play
iOS
LLM Inference·Artificial Intelligence
nucleus sampling

nucleus sampling

/ˈnjuːkliəs ˌsæmplɪŋ/

⚡ LLM Inference

sampling from tokens comprising the top cumulative probability mass (top-p)

nucleus sampling in a sentence

“Nucleus sampling with p=0.95 adapts vocabulary size to context certainty.”

Origin of nucleus sampling

Latin nucleus kernel from nux nut

Related Words

logits

raw, unnormalized scores output by the model before conversion to probabilities

softmax

a function that converts logits into a probability distribution summing to one

KV cache

cached key-value pairs from previous tokens to speed up autoregressive generation

inference

the process of using a trained model to generate predictions or outputs

temperature

a parameter controlling randomness in generation—higher means more creative, lower means more deterministic

sampling

randomly selecting the next token from the probability distribution rather than always choosing the most likely

SegueMaster the art of eloquence
iOS AppWord of the DayContactPrivacyTerms