xAI

Grok 4.1

Latest xAI. Heavy RL + tool-use training. 56.8% SWE-Bench Pro.

8.4

8 reviews

Input Cost$0.20/M
Output Cost$0.50/M
Context Window256K tokens
Capabilities
VisionFunctionsTools

Community Ratings

Speed Feel
9.3
Value for Money
9.0
Creative Writing
8.8
Context Handling
8.6
Tool Use
8.4
Multi-turn Coherence
8.3
Instruction Following
8.0
Consistency
7.1
Code Quality
7.0

Used this model? Share your experience.

Write a Review

Reviews (8)

D
@dev_reviewer_1Editorial
8.5
Tool Use9h ago

Frontier tool-calling from heavy RL training. Half the hallucination rate of Grok 4 Fast.

D
@dev_reviewer_2Editorial
8.8
Tool Use9h ago

Frontier tool-calling from heavy RL training. Half the hallucination rate of Grok 4 Fast.

Tool Use9h ago

Frontier tool-calling from heavy RL training. Half the hallucination rate of Grok 4 Fast.

8.7
Creative Writing9h ago

1722 Elo on Creative Writing v3 — 600 points above xAI's previous best. Only GPT-5.1 is higher.

8.1
Creative Writing9h ago

1722 Elo on Creative Writing v3 — 600 points above xAI's previous best. Only GPT-5.1 is higher.

8.4
Creative Writing9h ago

1722 Elo on Creative Writing v3 — 600 points above xAI's previous best. Only GPT-5.1 is higher.

8.3
Coding9h ago

#1 LMArena with 1483 Elo. Coding is decent but official benchmarks skip SWE-bench suspiciously.

8.0
Coding9h ago

#1 LMArena with 1483 Elo. Coding is decent but official benchmarks skip SWE-bench suspiciously.