xAI

Grok 4.1

Latest xAI. Heavy RL + tool-use training. 56.8% SWE-Bench Pro.

8.1

9 reviews

Input Cost$0.20/M
Output Cost$0.50/M
Context Window256K tokens
Capabilities
VisionFunctionsTools

Community Ratings

Speed Feel
9.0
Value for Money
8.9
Creative Writing
8.8
Context Handling
8.4
Tool Use
8.4
Multi-turn Coherence
8.3
Instruction Following
7.4
Consistency
7.0
Code Quality
6.3

Used this model? Share your experience.

Write a Review

Reviews (9)

Coding1mo ago
D
@dev_reviewer_1Editorial
8.5
Tool Use1mo ago

Frontier tool-calling from heavy RL training. Half the hallucination rate of Grok 4 Fast.

D
@dev_reviewer_2Editorial
8.8
Tool Use1mo ago

Frontier tool-calling from heavy RL training. Half the hallucination rate of Grok 4 Fast.

Tool Use1mo ago

Frontier tool-calling from heavy RL training. Half the hallucination rate of Grok 4 Fast.

8.7
Creative Writing1mo ago

1722 Elo on Creative Writing v3 — 600 points above xAI's previous best. Only GPT-5.1 is higher.

8.1
Creative Writing1mo ago

1722 Elo on Creative Writing v3 — 600 points above xAI's previous best. Only GPT-5.1 is higher.

8.4
Creative Writing1mo ago

1722 Elo on Creative Writing v3 — 600 points above xAI's previous best. Only GPT-5.1 is higher.

8.3
Coding1mo ago

#1 LMArena with 1483 Elo. Coding is decent but official benchmarks skip SWE-bench suspiciously.

8.0
Coding1mo ago

#1 LMArena with 1483 Elo. Coding is decent but official benchmarks skip SWE-bench suspiciously.