Reference · 30 Terms

AI & LLM
Glossary

Plain-English definitions of the terms that actually matter — no jargon for jargon's sake. Each entry links to a full module if you want to go deeper.

See also: AI Model & LLM Pricing — compare what the major models cost.

AI AgentAn AI system that can take actions and use tools to accomplish a goal, not just chat.
AttentionThe mechanism that lets a model weigh which tokens matter for each prediction.
Chain-of-Thought (CoT)Prompting a model to reason step by step before answering.
Context WindowThe maximum amount of text (in tokens) a model can consider at once.
DistillationTraining a smaller 'student' model to mimic a larger 'teacher' model.
EmbeddingA numeric vector that captures the meaning of text so machines can compare it.
Few-Shot PromptingShowing a model 2–5 examples of the pattern you want before your real input.
Fine-TuningFurther-training a base model on specialized data to specialize its behavior.
HallucinationWhen a model produces fluent output that is factually wrong or made up.
InferenceRunning a trained model to generate output — what happens on every prompt.
Large Language Model (LLM)An AI model trained on vast amounts of text to predict and generate language.
Mixture of Experts (MoE)An architecture that activates only part of the model for each token, saving compute.
Model Context Protocol (MCP)An open standard for connecting AI models to tools and data sources.
Multimodal AIModels that handle more than text — images, audio, or video alongside language.
Open-Weights ModelA model whose trained parameters are publicly released to download and run.
ParametersThe internal numbers a model learns during training; roughly, its 'knobs'.
Prompt EngineeringThe craft of writing inputs that get the best output from a model.
Prompt InjectionA security attack where malicious text hijacks a model's instructions.
QuantizationCompressing a model by storing its numbers at lower precision to cut cost.
Reasoning ModelA model trained to deliberate at length before answering hard problems.
Retrieval-Augmented Generation (RAG)Giving a model relevant external documents at query time so it answers from facts.
System PromptHigh-level instructions that set a model's role, rules, and behavior for a session.
TemperatureA setting that controls how random vs. deterministic a model's output is.
TokenThe basic unit of text an LLM reads and generates — roughly a word-piece.
Top-k SamplingA sampling setting that keeps only the k most likely next tokens.
Top-p (Nucleus Sampling)A sampling setting that keeps the smallest set of tokens covering probability p.
TrainingThe compute-intensive process of teaching a model by adjusting its parameters.
TransformerThe neural-network architecture behind virtually all modern LLMs.
Vector DatabaseA database built to store embeddings and find the most similar ones fast.
Zero-Shot PromptingAsking a model to do a task directly, with no examples.

The Whole Picture

Understand it, not just the words.

LogitMax is a 30-module course on the AI frontier, written to be genuinely understandable.

Explore the course →

AI & LLMGlossary

Understand it, not just the words.

AI & LLM
Glossary