xAI

Grok 4.1

Latest xAI. Heavy RL + tool-use training. 56.8% SWE-Bench Pro.

8.1

9 reviews

Input Cost$0.20/M

Output Cost$0.50/M

Context Window256K tokens

Capabilities

VisionFunctionsTools

Community Ratings

Speed Feel

9.0

Value for Money

8.9

Creative Writing

8.8

Context Handling

8.4

Tool Use

8.4

Multi-turn Coherence

8.3

Instruction Following

7.4

Consistency

7.0

Code Quality

6.3

Used this model? Share your experience.

Write a Review

Reviews (9)

@renamebyduy-afk

6.0

Coding3mo ago

@dev_reviewer_1Editorial

8.5

Tool Use3mo ago

Frontier tool-calling from heavy RL training. Half the hallucination rate of Grok 4 Fast.

@dev_reviewer_2Editorial

8.8

Tool Use3mo ago

Frontier tool-calling from heavy RL training. Half the hallucination rate of Grok 4 Fast.

@llmmatrix-editorialEditorial

8.3

Tool Use3mo ago

Frontier tool-calling from heavy RL training. Half the hallucination rate of Grok 4 Fast.

@dev_reviewer_19Editorial

8.7

Creative Writing3mo ago

1722 Elo on Creative Writing v3 — 600 points above xAI's previous best. Only GPT-5.1 is higher.

@dev_reviewer_20Editorial

8.1

Creative Writing3mo ago

1722 Elo on Creative Writing v3 — 600 points above xAI's previous best. Only GPT-5.1 is higher.

@dev_reviewer_18Editorial

8.4

Creative Writing3mo ago

1722 Elo on Creative Writing v3 — 600 points above xAI's previous best. Only GPT-5.1 is higher.

@dev_reviewer_16Editorial

8.3

Coding3mo ago

#1 LMArena with 1483 Elo. Coding is decent but official benchmarks skip SWE-bench suspiciously.

@dev_reviewer_17Editorial

8.0

Coding3mo ago

#1 LMArena with 1483 Elo. Coding is decent but official benchmarks skip SWE-bench suspiciously.