Model Selection

Qodo IDE Plugin lets you choose between multiple AI models, each with different strengths in speed, reasoning, and context handling.

Available Models

GPT-5 One family with modes you pick based on speed vs. depth:
- Fast / Standard – medium-fast general purpose; great default for coding + reasoning when a bit of latency is OK.
- Thinking – slower, deeper reasoning for complex debugging and architecture discussions.
- Pro – slowest, highest rigor for mission-critical changes and long, multi-step work.
Claude Sonnet 4.5 Claude’s fast, responsive variant—excellent for interactive chat and frequent short prompts where latency matters most.
Claude Opus 4.1 More reasoning depth than Sonnet with moderate latency; solid for multi-file refactors and detailed code edits.
Gemini 2.5 Pro Strong at large-context and multimodal work; good when you need to reason over many files or combine text with images.
Grok (XAI) Fast, witty, and concise. Well-suited for brainstorming, natural language exploration, and lighter reasoning tasks. Good option when you want responsiveness with personality, though not as strong for very large context or enterprise-grade coding.
OpenAI o-series (o3, o4-mini) Reasoning-focused models trained to “think longer” before responding.
- o3 – higher-end reasoning for hard problems; good when you want strong tool use and stepwise problem solving.
- o4-mini – smaller, faster while still strong at coding/math/visual tasks; great when you want low latency + good reasoning.

Below is an updated comparison of Qodo’s available models. Each model offers different trade-offs in terms of speed (latency), reasoning depth, cost, and best use cases. Use this guide to choose the model that aligns with your task priorities.

Model

Speed vs Claude Fast Variants

Best for…

Avoid when…

GPT-5 (Standard)

Medium — still slower than Claude Sonnet for short prompts.

Complex reasoning, large context (big docs / codebases), tool use, high accuracy tasks.

You need instant feedback / live chat / very short prompts; latency is critical.

Claude Sonnet 4.5

Fast — very low latency, usually faster than GPT-5.

Interactive UX, short queries, prototyping, frequent lightweight tasks.

Very large context or advanced reasoning beyond its depth.

Claude Opus 4.1

Medium — slower than Sonnet but still often quicker than GPT-5 on many tasks.

Balanced mix of speed and reasoning, multi-file refactors, reliable code edits.

Trivial tasks where Sonnet is sufficient; cost-sensitive workflows.

Gemini 2.5 Pro

Moderate — can be slower than Claude for short tasks, but optimized for huge contexts.

Handling very large codebases or documents, multimodal reasoning, context-heavy workflows.

Simple or short prompts (extra context overhead makes it slower); interactive tasks needing instant replies.

Using Model Selection

Select a model from the dropdown menu on the bottom left below the chatbox. The selected model will be used for your next query.

Last updated 13 days ago

Good morning

Available Models

Using Model Selection