artokعرتوك

See how much more Arabic costs across 18 LLM tokenizers. Benchmark scores, diacritics analysis, dialect comparison.

5x more tokens for Arabic on Claude vs English for the same sentence. Arabic speakers are paying a hidden tax on every API call.

$ pip install artok

Chapter I

The Problem

Same meaning. Different cost.

Arabic

الذكاء الاصطناعي يغير العالم

tokens (Claude)

English

AI is changing the world

tokens (Claude)

Chapter II

See It

One command. Full picture.

Chapter III

The Benchmark

Arabic friendliness score across 8 text categories: news, poetry, Quran, technical, conversational, Egyptian dialect, Gulf dialect, social media.

Tokenizer	Score /100
Mistral Large 3	92.1
Qwen 3.5	91.7
GPT-4.1	91.1
Gemini 2.5 Pro	90.2
Llama 4	83.6
DeepSeek V3.2	83.8
Jamba 1.5	75.3
Grok 2	50.3
GPT-4 (legacy)	44.5
Claude Sonnet 4.6	25.6

Chapter IV

Features

22 flags. Everything you need.

--benchmark
Arabic friendliness score 0-100 per tokenizer
--tashkeel
How diacritics inflate token counts
--heatmap
Color each word by token cost
--dialects
MSA vs Egyptian vs Gulf vs Levantine vs Moroccan
--leaderboard
Rank by efficiency + cost + value
--switch-from
Savings from switching providers
--compare-langs
Arabic vs English vs French vs any language
--cost / --words
Estimate monthly costs at scale
--recommend
Best tokenizer for your budget
--viz
Colored token split visualization
--chart
Visual bar chart comparison
--url
Analyze Arabic text from any URL
--batch
Process JSONL/CSV files
--export
Export results as SVG
--watch
Live mode — type and see counts
--update
Fetch latest pricing from GitHub
--json
JSON output for scripting
-e
Compare Arabic vs English

Chapter V

18 Tokenizers

10 providers. Pricing auto-updates via artok --update.

Tokenizer	Provider	Input $/1M	Output $/1M
GPT-4.1	OpenAI	$2.00	$8.00
GPT-4.1 mini	OpenAI	$0.40	$1.60
GPT-4.1 nano	OpenAI	$0.10	$0.40
GPT-4o	OpenAI	$2.50	$10.00
GPT-4o mini	OpenAI	$0.15	$0.60
Claude Opus 4.6	Anthropic	$5.00	$25.00
Claude Sonnet 4.6	Anthropic	$3.00	$15.00
Claude Haiku 4.5	Anthropic	$1.00	$5.00
Llama 4	Meta	$0.18	$0.18
Qwen 3.5	Alibaba	$0.10	$0.40
Mistral Large 3	Mistral	$0.50	$1.50
Mistral Small	Mistral	$0.10	$0.30
Gemini 2.5 Pro	Google	$1.25	$10.00
Gemini 3 Flash	Google	$0.50	$3.00
DeepSeek V3.2	DeepSeek	$0.27	$1.10
Grok 2	xAI	$2.00	$10.00
Command R+	Cohere	$2.50	$10.00
Jamba 1.5	AI21	$0.20	$0.40

Chapter VI

Get Started

# Install $ git clone https://github.com/Moshe-ship/artok.git $ cd artok && pip install -e ".[all]" # See the tax $ artok "الذكاء الاصطناعي يغير العالم" -e "AI is changing the world" # Benchmark $ artok --benchmark # Dialects $ artok --dialects # Tashkeel $ artok "بِسْمِ اللَّهِ الرَّحْمَنِ الرَّحِيمِ" --tashkeel # Stay current $ artok --update