LanguageModel¶

The LanguageModel CRD configures LLM access through the cluster's shared LiteLLM proxy.

Overview¶

A LanguageModel defines: - Provider (Anthropic, OpenAI, Azure, etc.) - Model name and version - API credentials (via Secret reference) - Rate limits and retry policies

Quick Example¶

apiVersion: langop.io/v1alpha1
kind: LanguageModel
metadata:
  name: claude-sonnet
  namespace: my-cluster
spec:
  provider: anthropic
  modelName: claude-sonnet-4-5
  apiKeySecretRef:
    name: anthropic-credentials
    key: api-key

Complete API Reference¶

See the Complete API Reference for full field documentation including:

LanguageModel - Top-level resource
LanguageModelSpec - Specification fields
LanguageModelStatus - Status and endpoint information

Supported Providers¶

anthropic - Claude models
openai - GPT models
azure - Azure OpenAI Service
bedrock - AWS Bedrock
vertex - Google Vertex AI
openai-compatible - Any OpenAI-compatible API
custom - Custom LiteLLM configuration

Key Concepts¶

Shared Proxy Registration¶

When you create a LanguageModel:

The LanguageModel controller validates the spec and sets status.phase: Ready
The LanguageCluster controller (which watches LanguageModel resources) detects the new CR
The shared gateway-config ConfigMap is regenerated with all models in the namespace
The gateway Deployment rolls over with the updated configuration

All agents immediately have access to the new model via MODEL_ENDPOINT.

Credential Management¶

Store API keys in Secrets:

kubectl create secret generic anthropic-key \
  --from-literal=api-key=sk-ant-...

Reference in the model spec:

spec:
  apiKeySecretRef:
    name: anthropic-key
    key: api-key

The operator injects credentials into the shared proxy, never into agent pods.

Rate Limiting¶

Configure per-model rate limits:

spec:
  rateLimits:
    requestsPerMinute: 100
    tokensPerMinute: 50000

The shared proxy enforces these limits across all agents.

Provider-Specific Examples¶

Azure OpenAI¶

spec:
  provider: azure
  modelName: gpt-4
  endpoint: https://my-resource.openai.azure.com
  apiKeySecretRef:
    name: azure-credentials
    key: api-key

Azure-specific configuration (deployment name, API version) is passed via the LiteLLM proxy configuration in the LanguageCluster spec, not through CRD fields.

Self-Hosted (Ollama)¶

spec:
  provider: openai-compatible
  modelName: llama3.2
  endpoint: http://ollama.default.svc.cluster.local:11434/v1

LanguageAgent - Reference models in agents
LanguageCluster - Shared proxy architecture