What Are LLM Models?

LLM Models (Large Language Models) are the underlying AI systems that power coding tools and agentic platforms. This section covers model capabilities, pricing, benchmarks, and selection guidance—not the tools that use them.

Available Model Guides

Claude Opus 4.5

Anthropic’s flagship reasoning model. Highest SWE-bench score at 80.9%.

Kimi k2.5

Moonshot AI’s high-performing model with free access options. 76.8% SWE-bench.

Gemini 3 Flash

Google’s 1M context window model with free input tokens. 78.0% SWE-bench.

GLM 4.7

Zhipu AI’s model available through OpenCode and other platforms.

Quick Selection Guide

ModelSWE-benchPrice/1MBest For
Claude Opus 4.580.9%$5/$25Maximum reasoning, safety-critical
Gemini 3 Flash78.0%Free/$0.15High context, cost-sensitive
Kimi k2.576.8%$3/$3Best value, free access available

See Budget Tier, Mid-Range, and Premium comparisons for detailed analysis.