Real hardware. Real numbers. No vendor spin.
55.4tok/s · qwen3:14b · BIFROST
6.3tok/s · bifrost-1a-hearth · HEARTH
11.0tok/s · bifrost-1a-overflow · HEARTH VEGA8
131.8tok/s · Qwen3-1.7B-GGUF · FORGE GPU
36.9tok/s · qwen3:1.7b · FORGE NPU
FLEET INTELLIGENCE live from Prometheus · 2026-04-10
100%Local Inference
1Total Requests
nanmsp50 Latency
0Cloud Requests
Band distribution: COMPLEX: 1 · TRIVIAL: 1
BIFROST GPU AMD Radeon RX 9070 XT 16GB RDNA4 · OS Windows 11 Pro · ROCm 7.1 gfx1201
MODELBACKENDPROMPTOUTPUTTOK/SHARDWARE
qwen3:14bollama2525623.5bifrost
qwen3:14bollama2525652.7bifrost
bifrost-t1bollama4310545.7bifrost
bifrost-t1bollama4310546.7bifrost
bifrost-t1bollama4310647.1bifrost
qwen3:14b4725655.4
qwen3:14b472568.5
gemma4:e4b4725631.3
gemma4:e4b4725616.6
HEARTH GPU RX 5700 XT 8GB RDNA1 · OS Windows 11 · ROCm N/A
MODELBACKENDPROMPTOUTPUTTOK/SHARDWARE
bifrost-1a-hearthollama164286.3hearth
HEARTH VEGA8 GPU Vega 8 iGPU 16GB GTT · OS Windows 11 · ROCm N/A
MODELBACKENDPROMPTOUTPUTTOK/SHARDWARE
bifrost-1a-overflowollama13725611.0hearth-vega8
FORGE GPU GPU Radeon 8060S 96GB unified · OS Ubuntu 24.04 · ROCm 7.2 gfx1151
MODELBACKENDPROMPTOUTPUTTOK/SHARDWARE
bifrost-t2-gemma4ollama17725649.9forge
bifrost-t2-qwen3ollama47201263.1
bifrost-t2-qwen3ollama154450160.3
bifrost-t2-qwen3ollama47209462.8
bifrost-t2p5-r1ollama369615.0
bifrost-t2p5-r1timed out
bifrost-t2p5-r1timed out
Qwen3-1.7B-GGUFlemonade19512131.8
Qwen3-1.7B-GGUFlemonade126512128.4
FORGE NPU GPU XDNA2 NPU — 0W GPU draw · OS Ubuntu 24.04 · ROCm FLM NPU
MODELBACKENDPROMPTOUTPUTTOK/SHARDWARE
qwen3:1.7bollama2825635.9forge-npu
qwen3:1.7bollama2825636.9forge-npu