Agentic AI
AI systems that can plan, decide, and execute multi-step tasks using tools and memory instead of producing one-step text output.
Explore pathUse this glossary to align teams on technical language before architecture reviews, vendor evaluations, and implementation planning.
Practical definitions focused on real project execution and production operations.
AI systems that can plan, decide, and execute multi-step tasks using tools and memory instead of producing one-step text output.
Explore pathThe amount of text or tokens an LLM can process in a single request. Larger windows support longer documents and richer prompts.
Explore pathA numerical vector representation of text used for semantic search, retrieval, clustering, and similarity matching.
Explore pathAdditional model training on task or domain data to improve style consistency, behavior, and task accuracy.
Explore pathConstraining model responses to verified source context so answers are factual and traceable to known documents.
Explore pathThe process of running a trained model to generate predictions, completions, classifications, or decisions.
Explore pathAn orchestration platform for deploying and scaling containerized applications, including AI APIs and model-serving stacks.
Explore pathA framework for building LLM applications with prompt chains, tool calling, retrieval integration, and agent workflows.
Explore pathLarge Language Model. A neural model trained on broad text corpora that can understand and generate natural language.
Explore pathLow-Rank Adaptation. A parameter-efficient fine-tuning technique that adapts large models with lower compute cost.
Explore pathPractices that connect model development and operations: training pipelines, deployment, monitoring, rollback, and governance.
Explore pathTelemetry and diagnostics for distributed systems, including logs, traces, and metrics used to debug model or service behavior.
Explore pathSystematic design of prompts, instructions, and constraints to improve model reliability and output quality.
Explore pathRetrieval-Augmented Generation. A pattern where relevant documents are retrieved and injected into prompts before answer generation.
Explore pathSearch based on meaning rather than exact keyword matches, typically powered by embeddings and vector databases.
Explore pathInfrastructure as Code tooling used to define, provision, and version cloud resources across providers.
Explore pathCapability allowing LLM-driven systems to invoke APIs, databases, and external services as part of response generation.
Explore pathA storage engine optimized for nearest-neighbor search on embeddings, often used in RAG and recommendation systems.
Explore path