Built-in LLM Models¶

Trailblaze ships with the following built-in models. When you reference a model by id in your trailblaze.yaml, all specs below are inherited automatically.

These models can change between Trailblaze releases (models added/removed, pricing updated). For stable, predictable configuration, set explicit values in your workspace trailblaze.yaml.

Anthropic¶

Model ID	Context	Max Output	Input $/1M	Output $/1M	Cached Input $/1M	Capabilities
`claude-haiku-4-5`	200K	64K	$1.00	$5.00	$0.10	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`claude-opus-4-6`	1M	128K	$5.00	$25.00	$0.50	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`claude-sonnet-4-6`	1M	64K	$3.00	$15.00	$0.30	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools

Google¶

Model ID	Context	Max Output	Input $/1M	Output $/1M	Cached Input $/1M	Capabilities
`gemini-2.5-pro`	1M	65K	$1.25	$10.00	$0.13	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`gemini-3-flash-preview`	1M	65K	$0.50	$3.00	$0.05	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`gemini-3.1-flash-lite-preview`	1M	65K	$0.25	$1.50	$0.03	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`gemini-3.1-pro-preview`	1M	65K	$2.00	$12.00	$0.20	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`gemini-3.1-pro-preview-customtools`	1M	65K	$2.00	$12.00	$0.20	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools

Ollama¶

Model ID	Context	Max Output	Input $/1M	Output $/1M	Cached Input $/1M	Capabilities
`gpt-oss:120b`	131K	65K	free	free	free	basic-json-schema, completion, document, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`gpt-oss:20b`	131K	65K	free	free	free	basic-json-schema, completion, document, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen3-vl:2b`	131K	8K	free	free	free	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen3-vl:30b`	131K	8K	free	free	free	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen3-vl:4b`	131K	8K	free	free	free	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen3-vl:8b`	131K	8K	free	free	free	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen3.5:0.8b`	131K	8K	free	free	free	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen3.5:122b`	131K	8K	free	free	free	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen3.5:27b`	131K	8K	free	free	free	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen3.5:2b`	131K	8K	free	free	free	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen3.5:35b`	131K	8K	free	free	free	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen3.5:4b`	131K	8K	free	free	free	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen3.5:9b`	131K	8K	free	free	free	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen3.5:latest`	131K	8K	free	free	free	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools

OpenAI¶

Model ID	Context	Max Output	Input $/1M	Output $/1M	Cached Input $/1M	Capabilities
`gpt-4.1`	1M	32K	$2.00	$8.00	$0.50	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`gpt-4.1-mini`	1M	32K	$0.40	$1.60	$0.10	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`gpt-5`	400K	128K	$1.25	$10.00	$0.13	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`gpt-5-mini`	400K	128K	$0.25	$2.00	$0.03	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`gpt-5.2`	400K	128K	$1.75	$14.00	$0.18	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools

OpenRouter¶

Model ID	Context	Max Output	Input $/1M	Output $/1M	Cached Input $/1M	Capabilities
`openai/gpt-oss-120b:free`	131K	131K	free	free	free	basic-json-schema, completion, document, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools
`qwen/qwen3-vl-8b-instruct`	131K	32K	$0.08	$0.50	$0.08	basic-json-schema, completion, document, image, multipleChoices, openai-endpoint-chat-completions, openai-endpoint-responses, speculation, standard-json-schema, temperature, toolChoice, tools

Using Built-in Models in YAML Config¶

Reference any model above by its ID:

providers:
  "openai":
    models:
    - id: "gpt-4.1"
    - id: "gpt-4.1-mini"
      cost:
        input_per_million: 0.3

When using a custom endpoint, specify the model specs explicitly. See the tables above for reference values:

providers:
  "my_gateway":
    type: "openai_compatible"
    base_url: "https://gateway.example.com/v1"
    models:
    - id: "my-gpt4-deployment"
      vision: true
      context_length: 1048576
      max_output_tokens: 32768

NOTE: THIS IS GENERATED DOCUMENTATION