Core Objects
Model
A model defines inference behavior and configuration. Models are not executable until deployed.
Deployment
A deployment is a versioned, executable instance of a model. All inference requests are routed to a deployment, not directly to a model.
Conversation
Conversations represent persistent interaction state. All messages and executions occur within a conversation context.
Inference
Inference executes input against an active deployment and returns output, tool calls, and metadata.
Tool
Tools are callable functions exposed to a model during inference. Tools may require secrets, services, or dependencies.
Workspace
A workspace isolates models, deployments, tools, conversations, secrets, services, and usage under a single API key.