Real hardware. Real numbers. No vendor spin.
101.6tok/s · mistral:7b-instruct-v0.3-q4_0 · AMD Ryzen 9 3950X + RX 9070 XT
58.5tok/s · mistral:latest · AMD Ryzen 7 5700G + RX 5700 XT
11.0tok/s · bifrost-1a-overflow · AMD Ryzen 7 5700G iGPU Vega 8
131.8tok/s · Qwen3-1.7B-GGUF · AMD Ryzen AI Max+ 395 (Radeon 8060S)
52.9tok/s · llama3.2:1b · AMD Ryzen AI Max+ 395 XDNA2 NPU
5.0tok/s · llama3.3:70b · AMD Ryzen AI Max+ 395 (Radeon 8060S) [llama-server T2.5]
FLEET INTELLIGENCE live from Prometheus · 2026-05-08
100%Local Inference
1Total Requests
nanmsp50 Latency
0Cloud Requests
Band distribution: COMPLEX: 2 · TRIVIAL: 1
AMD Ryzen 9 3950X + RX 9070 XT GPU AMD Radeon RX 9070 XT 16GB RDNA4 · OS Windows 11 Pro · ROCm 7.1 gfx1201
MODELBACKENDPROMPTOUTPUTTOK/SHARDWARE
qwen3:14bollama2525623.5bifrost
qwen3:14bollama2525652.7bifrost
bifrost-t1bollama4310545.7bifrost
bifrost-t1bollama4310546.7bifrost
bifrost-t1bollama4310647.1bifrost
bifrost-t1bollama4310546.1bifrost
bifrost-t1bollama431049.0bifrost
gemma4:e4bollama3130028.7bifrost
gemma4:e4bollama3025631.0bifrost
bifrost-t1bollamaNone6943.4bifrost
bifrost-t1bollama4310545.9bifrost
bifrost-t1bollama431066.9bifrost
bifrost-t1bollama4310740.1bifrost
qwen3:14bollama2525619.4bifrost
qwen3:14bollama2525651.6bifrost
mistral:7b-instruct-v0.3-q4_0ollama2125631.0bifrost
mistral:7b-instruct-v0.3-q4_0ollama21256101.6bifrost
gemma4:e4bollama302566.9bifrost
gemma4:e4bollama3025626.9bifrost
phi4:14bollama2525612.9bifrost
phi4:14bollama2525652.6bifrost
qwen3:14b4725655.4
qwen3:14b472568.5
gemma4:e4b4725631.3
gemma4:e4b4725616.6
AMD Ryzen 7 5700G + RX 5700 XT GPU RX 5700 XT 8GB RDNA1 · OS Windows 11 · ROCm N/A
MODELBACKENDPROMPTOUTPUTTOK/SHARDWARE
bifrost-1a-hearthollama164286.3hearth
bifrost-1a-hearthollama1642923.8hearth
mistral:latestollama213007.9hearth
mistral:latestollama2125656.6hearth
mistral:latestollamaNone10858.5hearth
mistral:latestollama2125656.9hearth
bifrost-1a-hearthollama164271.7hearth
bifrost-1a-hearthollama164283.8hearth
bifrost-1a-hearthollama1642623.4hearth
bifrost-1a-hearthollama1642623.0hearth
qwen3.5:9btimed out
qwen3.5:9btimed out
mistral:7bollama212562.8hearth
mistral:7bollama212562.8hearth
mistral:7bollama2125656.1hearth
mistral:7bollama2124956.1hearth
mistral:7bollama2119657.3hearth
AMD Ryzen 7 5700G iGPU Vega 8 GPU Vega 8 iGPU 16GB GTT · OS Windows 11 · ROCm N/A
MODELBACKENDPROMPTOUTPUTTOK/SHARDWARE
bifrost-1a-overflowollama13725611.0hearth-vega8
bifrost-distillollama8472566.1hearth-vega8
qwen3.5:4btimed out
qwen3.5:4btimed out
AMD Ryzen AI Max+ 395 (Radeon 8060S) GPU Radeon 8060S 96GB unified · OS Ubuntu 24.04 · ROCm 7.2 gfx1151
MODELBACKENDPROMPTOUTPUTTOK/SHARDWARE
bifrost-t2-gemma4ollama17725649.9forge
bifrost-t2-gemma4ollama17725647.9forge
bifrost-t2-gemma4ollama17725647.9forge
bifrost-t2-gemma4ollama17830045.5forge
gemma4:e4bollama3130052.6forge
bifrost-t2-gemma4ollamaNone20049.8forge
bifrost-forge-t1ollamaNone10258.0forge
bifrost-t2-gemma4ollama17825648.4forge
bifrost-forge-t1ollama3225646.3forge
bifrost-t2-gemma4ollama17825648.9forge
gpt-oss:120bollama822568.6forge
gpt-oss:120bollama8225635.0forge
gpt-oss:120bollama822568.6forge
gpt-oss:120bollama8225635.1forge
bifrost-t2-qwen3ollama47201263.1
bifrost-t2-qwen3ollama154450160.3
bifrost-t2-qwen3ollama47209462.8
bifrost-t2p5-r1ollama369615.0
bifrost-t2p5-r1timed out
bifrost-t2p5-r1timed out
Qwen3-1.7B-GGUFlemonade19512131.8
Qwen3-1.7B-GGUFlemonade126512128.4
AMD Ryzen AI Max+ 395 XDNA2 NPU GPU XDNA2 NPU — 0W GPU draw · OS Ubuntu 24.04 · ROCm FLM NPU
MODELBACKENDPROMPTOUTPUTTOK/SHARDWARE
qwen3:1.7bollama2825635.9forge-npu
qwen3:1.7bollama2825636.9forge-npu
gemma3:4bollama2425616.8forge-npu
qwen3:1.7bollama2830036.7forge-npu
llama3.2:1bflmNone13449.0forge-npu
llama3.2:1bollama5025652.5forge-npu
llama3.2:1bollama5025652.9forge-npu
gpt-oss:20bollama822567.3forge-npu
gpt-oss:20bollama8225615.1forge-npu
AMD Ryzen AI Max+ 395 (Radeon 8060S) [llama-server T2.5] GPU Radeon 8060S 96GB unified · OS Ubuntu 24.04 · ROCm 7.2 gfx1151 (llamacpp-rocm)
MODELBACKENDPROMPTOUTPUTTOK/SHARDWARE
llama3.3:70bllama_server502565.0forge-t25
llama3.3:70b-Q4_K_Mllama_serverNone1185.0forge-t25
llama3.3:70bllama_server502565.0forge-t25
llama3.3:70bllama_server502565.0forge-t25
llama3.3:70bllama_server502565.0forge-t25
llama3.3:70bllama_server502565.0forge-t25
Llama-3.3-70B-Instruct-Q4_K_M.ggufllama_server502565.0forge-t25
Llama-3.3-70B-Instruct-Q4_K_M.ggufllama_server502565.0forge-t25