Compare grok-4-1-fast-non-reasoning with other models

Select another model to compare pricing, limits, and capabilities with grok-4-1-fast-non-reasoning.

Models

grok-4-1-fast-non-reasoning

xai

Context Length

2000K

Max Output

2000K

Input Cost

$0.20/M

Output Cost

$0.50/M

Mode

Chat

Max Input Tokens

2000K

Max Tokens

2000K

Provider

xAI

Tool Choice

Yes

Response Schema

Yes

Prompt Caching

Yes

Deprecation Date

2026-05-15

[ WE'RE OPEN SOURCE ]

Scale with the Fastest LLM Gateway

Built for enterprise-grade reliability, governance, and scale. Deploy in seconds.

or, Get started here

Comparison Insights

Comprehensive analysis based on the latest model metadata from the comparison table above.

What should I know about grok-4-1-fast-non-reasoning?

Overview

grok-4-1-fast-non-reasoning is a chat model provided by xAI.
This model offers an exceptional context window of 2000K tokens, making it ideal for processing extensive documents, long conversations, or large codebases.

Pricing

Input processing costs $0.20 per million tokens.
Output generation costs $0.50 per million tokens.

Output Capabilities

The model can generate up to 2000K tokens in a single response.

Availability

Please note: This model is scheduled for deprecation on 2026-05-15.

What capabilities does grok-4-1-fast-non-reasoning support?

Supports function calling, enabling integration with external tools and APIs for extended functionality.
Includes vision capabilities to process and analyze images alongside text inputs.
Provides web search integration for accessing real-time information and current data.
Accepts audio input, allowing for voice-based interactions and audio processing.
Allows explicit tool selection, giving developers fine-grained control over function execution.
Supports structured response schemas for consistent, predictable output formatting.
Implements prompt caching to reduce costs and latency for repeated or similar queries.

grok-4-1-fast-non-reasoning Pricing Overview

At $0.20 per 1M input tokens and $0.50 per 1M output tokens, grok-4-1-fast-non-reasoning ranks 831 out of 2637 chat models by input cost. It is more affordable compared to the median of $0.50 for chat models, and is cheaper than 68% of models in this category.