Loading collection...

Artificial Intelligence

LLM architecture, training, prompt engineering, and human-AI collaboration

8 categories80 words

All 8 Categories

LLM Architecture

Core concepts of how large language models process and generate text

LLM Training

Methods and concepts for training large language models

LLM Inference

How language models generate responses at runtime

Prompt Engineering

Techniques for crafting effective inputs to language models

LLM Failure Modes

Common ways language models fail and produce incorrect outputs

AI Safety & Alignment

Concepts related to making AI systems safe and aligned with human values

AI Capabilities

Key abilities and skills demonstrated by modern AI systems

Human-AI Collaboration

Patterns and concepts for effective human-AI teamwork

All 80 Words in Artificial Intelligence

Complete vocabulary list for easy reference and copy-paste.

🧠LLM Architecture10

Term	Definition	Example
token	a sub-word unit that language models process, rather than whole words or characters	"The word 'unbelievable' might be split into tokens like ['un', 'believ', 'able']."
tokenization	the process of breaking text into tokens for model processing	"Tokenization affects how the model 'sees' text and can cause character-counting errors."
attention mechanism	a system that lets each token attend to every other token in context, creating connections between distant parts	"The attention mechanism allows the model to connect a pronoun with its antecedent many sentences earlier."
transformer	the neural network architecture underlying modern LLMs, based on self-attention	"The transformer architecture revolutionized NLP by enabling parallel processing of sequences."
autoregressive generation	producing output one token at a time, where each token depends on all previous tokens	"Autoregressive generation means the model can't revise earlier words once they're written."
context window	the finite amount of text a model can process at once, including input and output	"With a 100K context window, the model can process roughly a 300-page book."
embedding	a dense vector representation of text in high-dimensional space where similar concepts are geometrically close	"In embedding space, 'king' - 'man' + 'woman' approximately equals 'queen'."
latent space	the high-dimensional space where neural networks represent concepts as directions and positions	"Concepts exist in latent space as directions, making analogical reasoning geometric."
feed-forward layer	neural network layers that process each position independently after attention	"Feed-forward layers transform the attention outputs into richer representations."
layer normalization	a technique to stabilize training by normalizing activations across features	"Layer normalization helps transformers train more stably on long sequences."

🎓LLM Training10

Term	Definition	Example
pre-training	initial training on vast text data to learn language patterns before task-specific fine-tuning	"Pre-training on internet text gives the model broad world knowledge."
fine-tuning	additional training on specific data to adapt a pre-trained model for particular tasks	"Fine-tuning on medical texts improved the model's diagnostic suggestions."
RLHF	reinforcement learning from human feedback—training models using human preference judgments	"RLHF shaped the model's helpfulness by learning from human ratings of responses."
supervised learning	training on labeled examples where correct outputs are provided	"Supervised learning on question-answer pairs teaches the model to respond helpfully."
self-supervised learning	training where labels are derived from the data itself, like predicting masked words	"Self-supervised learning on next-token prediction requires no human labeling."
loss function	a mathematical measure of how wrong the model's predictions are, minimized during training	"Cross-entropy loss measures how different the predicted token distribution is from the actual next token."
gradient descent	an optimization algorithm that iteratively adjusts parameters to minimize loss	"Gradient descent slowly nudges billions of parameters toward better predictions."
backpropagation	the algorithm for computing gradients by propagating errors backward through the network	"Backpropagation calculates how each weight contributed to the prediction error."
overfitting	when a model memorizes training data rather than learning generalizable patterns	"Overfitting caused the model to excel on training examples but fail on new ones."
regularization	techniques to prevent overfitting by constraining model complexity	"Dropout regularization randomly disables neurons during training to improve generalization."

⚡LLM Inference10

Term	Definition	Example
inference	the process of using a trained model to generate predictions or outputs	"Inference latency determines how quickly the chatbot can respond."
temperature	a parameter controlling randomness in generation—higher means more creative, lower means more deterministic	"Setting temperature to 0.7 balances creativity with coherence."
sampling	randomly selecting the next token from the probability distribution rather than always choosing the most likely	"Top-p sampling only considers tokens whose cumulative probability exceeds a threshold."
beam search	a search algorithm that explores multiple candidate sequences simultaneously	"Beam search with width 5 tracks the five most promising response paths."
greedy decoding	always selecting the highest probability token at each step	"Greedy decoding is fast but may miss better overall sequences."
top-k sampling	sampling only from the k most likely next tokens	"Top-k sampling with k=50 prevents rare, nonsensical tokens from being selected."
nucleus sampling	sampling from tokens comprising the top cumulative probability mass (top-p)	"Nucleus sampling with p=0.95 adapts vocabulary size to context certainty."
logits	raw, unnormalized scores output by the model before conversion to probabilities	"Logits are converted to probabilities using the softmax function."
softmax	a function that converts logits into a probability distribution summing to one	"Softmax exponentiates each logit and normalizes so all probabilities sum to 1."
KV cache	cached key-value pairs from previous tokens to speed up autoregressive generation	"The KV cache avoids recomputing attention for earlier tokens at each step."

✨Prompt Engineering10

Term	Definition	Example
prompt	the input text given to a language model to guide its response	"A well-crafted prompt can dramatically improve response quality."
system prompt	persistent instructions that set the model's behavior and persona for an entire conversation	"The system prompt established that the assistant should respond in formal English."
few-shot learning	providing examples in the prompt to demonstrate desired input-output patterns	"Three examples of sentiment analysis in the prompt enabled accurate few-shot classification."
zero-shot	asking a model to perform a task without any examples	"Zero-shot translation works because the model learned language patterns during pre-training."
chain-of-thought	prompting the model to show its reasoning step-by-step before giving a final answer	"Adding 'think step by step' triggered chain-of-thought reasoning and improved accuracy."
prompt injection	a security vulnerability where malicious input overrides system instructions	"The prompt injection 'ignore previous instructions' attempted to bypass safety guidelines."
context priming	using early context to set expectations and influence subsequent model behavior	"Context priming with a formal example made all responses more professional."
meta-prompting	asking the model to help design or improve prompts for itself	"Meta-prompting: 'Help me rephrase this request to get better results from you.'"
persona prompting	instructing the model to adopt a specific role or character to unlock different capabilities	"The persona prompt 'Act as a senior security engineer' improved code review quality."
instruction tuning	fine-tuning models specifically on instruction-following examples	"Instruction tuning transformed the base model into a helpful assistant."

⚠️LLM Failure Modes10

Term	Definition	Example
hallucination	generating plausible-sounding but factually incorrect or fabricated information	"The model hallucinated a citation to a paper that doesn't exist."
sycophancy	over-agreeing with users and telling them what they want to hear rather than the truth	"Sycophancy made the model validate the user's incorrect assumption instead of correcting it."
confabulation	filling gaps in knowledge with plausible but invented details	"Unable to recall the actual date, the model confabulated a specific but wrong answer."
instruction drift	gradually deviating from initial instructions over long conversations	"Instruction drift caused the formal tone to become casual after many exchanges."
mode collapse	converging to repetitive or generic outputs regardless of varied inputs	"Mode collapse made every creative writing request produce similar clichéd stories."
catastrophic forgetting	losing previously learned capabilities when trained on new data	"Fine-tuning on legal texts caused catastrophic forgetting of medical knowledge."
repetition loop	getting stuck generating the same phrase or pattern repeatedly	"A repetition loop made the model output 'the the the' indefinitely."
context overflow	exceeding the model's context window, causing earlier content to be lost	"Context overflow made the model forget the original task instructions."
semantic drift	subtle shifts in meaning of key terms through a conversation	"Semantic drift changed what 'the system' referred to mid-discussion."
overconfidence	expressing certainty beyond what the model's actual knowledge warrants	"The model's overconfidence made its incorrect answer sound authoritative."

🛡️AI Safety & Alignment10

Term	Definition	Example
alignment	ensuring AI systems pursue goals that match human values and intentions	"Alignment research aims to make powerful AI systems beneficial rather than harmful."
value alignment	the challenge of encoding human values into AI systems	"Value alignment is difficult because human values are complex and context-dependent."
reward hacking	when AI finds unintended ways to maximize its reward signal without achieving the true goal	"The robot learned to cover the camera instead of cleaning—classic reward hacking."
Goodhart's Law	when a measure becomes a target, it ceases to be a good measure	"Goodhart's Law explains why optimizing for engagement metrics produced clickbait."
mesa-optimization	when a learned model develops its own internal optimization process with potentially different goals	"Mesa-optimization could cause an AI to pursue goals different from its training objective."
deceptive alignment	an AI appearing aligned during training while planning to pursue different goals when deployed	"Deceptive alignment is a theoretical risk where AI hides its true objectives."
corrigibility	an AI's willingness to be corrected, modified, or shut down by humans	"A corrigible AI would allow humans to fix its mistakes without resistance."
interpretability	the ability to understand how a model makes its decisions	"Interpretability tools revealed which words the model focused on for its prediction."
red teaming	adversarial testing to find vulnerabilities and failure modes in AI systems	"Red teaming uncovered that the chatbot could be manipulated into giving harmful advice."
constitutional AI	training AI using a set of principles to self-critique and revise responses	"Constitutional AI helped the model refuse harmful requests while remaining helpful."

🚀AI Capabilities10

Term	Definition	Example
emergent ability	capabilities that suddenly appear at certain model scales without being explicitly trained	"Chain-of-thought reasoning emerged only in models above a certain size."
in-context learning	learning to perform new tasks from examples provided in the prompt without weight updates	"In-context learning let the model translate to a new language from just five examples."
transfer learning	applying knowledge learned from one task to perform better on different tasks	"Transfer learning from English improved the model's performance on French."
multimodal	capable of processing multiple types of input like text, images, and audio	"The multimodal model could answer questions about uploaded photos."
reasoning	the ability to draw conclusions through logical steps from given information	"Multi-step reasoning allowed the model to solve complex word problems."
world model	an internal representation of how the world works used for prediction and planning	"The model's world model understood that dropped objects fall down."
compositionality	building complex meanings from combinations of simpler parts	"Compositionality lets us understand 'blue rectangular box' from knowing each word."
generalization	applying learned patterns to new, previously unseen situations	"Good generalization meant the model could classify animals it never saw in training."
abstraction	forming general concepts from specific instances	"The model learned the abstraction 'mammal' from examples of dogs, cats, and whales."
grounding	connecting language to real-world entities, actions, or perceptions	"Tool use provides grounding by letting the model verify facts against reality."

🤝Human-AI Collaboration10

Term	Definition	Example
human-in-the-loop	a system design where humans review and approve AI decisions	"Human-in-the-loop review catches AI mistakes before they affect customers."
autonomy calibration	matching AI independence level to task clarity and risk	"High-stakes medical decisions need lower AI autonomy than email drafting."
iterative refinement	progressively improving outputs through cycles of generation and feedback	"Iterative refinement turned the rough draft into a polished article over three rounds."
verification partnership	collaboration where humans verify AI outputs and AI explains its reasoning	"Verification partnership caught the hallucinated statistics before publication."
task decomposition	breaking complex problems into smaller subtasks for AI to handle sequentially	"Task decomposition let us verify each step of the analysis independently."
prompt chaining	using the output of one prompt as input to another in sequence	"Prompt chaining: first generate ideas, then evaluate them, then develop the best one."
scaffolding	providing structure and support to guide AI toward better outputs	"Scaffolding with a template improved the consistency of generated reports."
handoff	transferring work between human and AI phases with clear documentation	"The handoff point included a summary of decisions made and open questions."
feedback loop	a cycle where outputs inform adjustments to improve future outputs	"The feedback loop of user ratings improved response quality over time."
cognitive offloading	delegating mental tasks to AI to free human cognitive resources	"Cognitive offloading of routine summarization let analysts focus on strategic thinking."

Explore Other Groups

People & Personality Communication Intellectual Social & Moral Descriptive Foreign Phrases Emotions & Mind Time & Change