artokعرتوك
See how much more Arabic costs across 18 LLM tokenizers. Benchmark scores, diacritics analysis, dialect comparison.
5x
more tokens for Arabic on Claude vs English for the same sentence. Arabic speakers are paying a hidden tax on every API call.
Chapter I
The Problem
Same meaning. Different cost.
Arabic
الذكاء الاصطناعي يغير العالم
25
tokens (Claude)
vs
English
AI is changing the world
5
tokens (Claude)
Chapter II
See It
One command. Full picture.
Chapter III
The Benchmark
Arabic friendliness score across 8 text categories: news, poetry, Quran, technical, conversational, Egyptian dialect, Gulf dialect, social media.
| Tokenizer | Score /100 |
| Mistral Large 3 | 92.1 |
| Qwen 3.5 | 91.7 |
| GPT-4.1 | 91.1 |
| Gemini 2.5 Pro | 90.2 |
| Llama 4 | 83.6 |
| DeepSeek V3.2 | 83.8 |
| Jamba 1.5 | 75.3 |
| Grok 2 | 50.3 |
| GPT-4 (legacy) | 44.5 |
| Claude Sonnet 4.6 | 25.6 |
Chapter IV
Features
22 flags. Everything you need.
- --benchmark
Arabic friendliness score 0-100 per tokenizer
- --tashkeel
How diacritics inflate token counts
- --heatmap
Color each word by token cost
- --dialects
MSA vs Egyptian vs Gulf vs Levantine vs Moroccan
- --leaderboard
Rank by efficiency + cost + value
- --switch-from
Savings from switching providers
- --compare-langs
Arabic vs English vs French vs any language
- --cost / --words
Estimate monthly costs at scale
- --recommend
Best tokenizer for your budget
- --viz
Colored token split visualization
- --chart
Visual bar chart comparison
- --url
Analyze Arabic text from any URL
- --batch
Process JSONL/CSV files
- --export
Export results as SVG
- --watch
Live mode — type and see counts
- --update
Fetch latest pricing from GitHub
- --json
JSON output for scripting
- -e
Compare Arabic vs English
Chapter V
18 Tokenizers
10 providers. Pricing auto-updates via artok --update.
| Tokenizer | Provider | Input $/1M | Output $/1M |
| GPT-4.1 | OpenAI | $2.00 | $8.00 |
| GPT-4.1 mini | OpenAI | $0.40 | $1.60 |
| GPT-4.1 nano | OpenAI | $0.10 | $0.40 |
| GPT-4o | OpenAI | $2.50 | $10.00 |
| GPT-4o mini | OpenAI | $0.15 | $0.60 |
| Claude Opus 4.6 | Anthropic | $5.00 | $25.00 |
| Claude Sonnet 4.6 | Anthropic | $3.00 | $15.00 |
| Claude Haiku 4.5 | Anthropic | $1.00 | $5.00 |
| Llama 4 | Meta | $0.18 | $0.18 |
| Qwen 3.5 | Alibaba | $0.10 | $0.40 |
| Mistral Large 3 | Mistral | $0.50 | $1.50 |
| Mistral Small | Mistral | $0.10 | $0.30 |
| Gemini 2.5 Pro | Google | $1.25 | $10.00 |
| Gemini 3 Flash | Google | $0.50 | $3.00 |
| DeepSeek V3.2 | DeepSeek | $0.27 | $1.10 |
| Grok 2 | xAI | $2.00 | $10.00 |
| Command R+ | Cohere | $2.50 | $10.00 |
| Jamba 1.5 | AI21 | $0.20 | $0.40 |
Chapter VI
Get Started
# Install
$ git clone https://github.com/Moshe-ship/artok.git
$ cd artok && pip install -e ".[all]"
# See the tax
$ artok "الذكاء الاصطناعي يغير العالم" -e "AI is changing the world"
# Benchmark
$ artok --benchmark
# Dialects
$ artok --dialects
# Tashkeel
$ artok "بِسْمِ اللَّهِ الرَّحْمَنِ الرَّحِيمِ" --tashkeel
# Stay current
$ artok --update