
claude-opus-4-20250514
input price:15.0 out price:75.0
Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in software engineering, achieving leading results on SWE-bench (72.5%) and Terminal-bench (43.2%). Opus 4 supports extended, agentic workflows, handling thousands of task steps continuously for hours without degradation.
Tiered Pricing

claude-opus-4-5-20251101
input price:5.0 out price:25.0
Claude Opus 4.5 is Anthropic's latest frontier reasoning model, purpose-built for complex software engineering, agentic workflows, and long-horizon computer use. It delivers strong multimodal capabilities, competitive performance on real-world coding and reasoning benchmarks, and improved robustness against prompt injection attacks. The model is designed to operate efficiently across varied effort levels, allowing developers to balance speed, depth, and token usage based on their specific task requirements—you can fine-tune token efficiency through the OpenRouter Verbosity parameter, which offers low, medium, and high settings. Beyond that, Opus 4.5 supports advanced tool use, extended context management, and coordinated multi-agent setups, making it ideal for autonomous research, debugging, multi-step planning, and spreadsheet or browser manipulation. Compared to previous Opus generations, it brings substantial improvements in structured reasoning, execution reliability, and alignment, while reducing token overhead and delivering more consistent performance on long-running tasks.
Tiered Pricing

gemini-3.1-flash-image-preview
input price:0.5 out price:3.0
Designed for speed and efficiency, the Gemini 3.1 Flash Image generation model is effective for quick, interactive responses and high throughput. Preview models may change before becoming stable and have more restrictive rate limits.
Tiered Pricing

gemini-2.5-flash-image
inputTokensPrice:0.30 outputTokensPrice:30.00
Gemini 2.5 Flash Image, a.k.a. "Nano Banana," is now generally available. It is a state of the art image generation model with contextual understanding. It is capable of image generation, edits, and multi-turn conversations.
Usage-Based Pricing

gpt-5.2-chat
inputTokensPrice:1.750 outputTokensPrice:14.000
GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on harder queries, improving accuracy on math, coding, and multi-step tasks without slowing down typical conversations. The model is warmer and more conversational by default, with better instruction following and more stable short-form reasoning. GPT-5.2 Chat is designed for high-throughput, interactive workloads where responsiveness and consistency matter more than deep deliberation.
Usage-Based Pricing