← Back to the AI Box
Compare S, M and L
Which size for which organisation?
All boxes share the same stack, the same behaviour, the same sovereignty guarantees. What changes: capacity, supported models, price.
gbox S
For teams of 20-40 people
Compact, silent, plugs into a standard wall outlet.
Concurrent users
up to 40
Avg throughput
≈ 28 tok/s
Time-to-first-token
≈ 800 ms
Power · noise
65 W · 22 dB
★ Most popular
gbox M
For mid-market organisations of 50-150
Reference build: enough horsepower to run Gemma 4 31B comfortably, no exotic hardware required.
Concurrent users
up to 150
Avg throughput
≈ 52 tok/s
Time-to-first-token
≈ 450 ms
Power · noise
180 W · 32 dB
gbox L
For 150+ users or multi-site
Maximalist build: 192 GB unified RAM for the heaviest models or hundreds of concurrent users.
Concurrent users
up to 400
Avg throughput
≈ 85 tok/s
Time-to-first-token
≈ 280 ms
Power · noise
350 W · 40 dB
Full specifications
| Attribute | gbox S | gbox M ★ | gbox L |
|---|---|---|---|
| Hardware | |||
| Form factor | Compact desktop · 5 × 19,7 × 19,7 cm | Mac Studio compact · 9,5 × 19,7 × 19,7 cm | Mac Studio M3 Ultra MAX ou Linux 1U rackable |
| Processor | Apple M4 Pro · 14 cœurs CPU · 20 cœurs GPU | Apple M3 Ultra · 32 cœurs CPU · 80 cœurs GPU | Apple M3 Ultra max ou AMD EPYC 7763 (variante x86) |
| Unified RAM | 64 Go | 96 Go | 192 Go |
| Storage | 2 To | 4 To | 8 To |
| AI accelerator | Apple Neural Engine 16 cœurs (38 TOPS) | Apple Neural Engine 32 cœurs (60 TOPS) · 80-core GPU | Apple Neural Engine 32 cœurs (Mac) ou 2 × NVIDIA L40S (variante x86) |
| Network | 1 × 10 GbE · Wi-Fi 6E · Thunderbolt 4 | 2 × 10 GbE · Wi-Fi 6E · 6 × Thunderbolt 4 | 2 × 10 GbE · 1 × 25 GbE optionnel · IPMI (variante x86) |
| Physical | |||
| Dimensions (h × w × d) | 5 × 19,7 × 19,7 cm | 9,5 × 19,7 × 19,7 cm | Mac Studio compact · ou 1U rack 19" |
| Weight | 1.4 kg | 3.6 kg | 3.6 kg |
| Power draw | 65 W | 180 W | 350 W |
| Noise level | 22 dB | 32 dB | 40 dB |
| Performance | |||
| Concurrent users | 40 users | 150 users | 400 users |
| Avg throughput | ≈ 28 tok/s | ≈ 52 tok/s | ≈ 85 tok/s |
| Time-to-first-token | ≈ 800 ms | ≈ 450 ms | ≈ 280 ms |
| Models | |||
| Supported models | Gemma 4 9B · Mistral Small 3 · Llama 3 8B · Qwen 2.5 7B | Gemma 4 31B · Mistral Large 2 · Llama 3 70B (q4) · Qwen 2.5 32B · Codestral 22B | Gemma 4 31B fp16 · Llama 3 70B · Qwen 2.5 72B · Mixtral 8x22B · DeepSeek V3 (q4) |
| Pricing | |||
| CAPEX (one-shot) | 12 000 € | 25 000 € | 38 000 € |
| Annual support | 4 800 € / an | 9 600 € / an | 14 400 € / an |
Decision helper
How to choose?
gbox S
Starting or piloting
6-month PoC, pilot team, 30-person subsidiary, or consulting firm with occasional needs. The S checks all boxes at a reasonable price.
gbox M
Deploying at scale
Our default recommendation: mid-market 50–150 people, 30B models at full resolution, low latency, no exotic hardware needed.
gbox L
You have hard constraints
Multi-site, 70B+ models, > 80 tok/s, 1U datacenter rack (Linux x86), or heavy concurrent loads.