Skip to content

Instantly share code, notes, and snippets.

@armand1m
armand1m / qwen3-vllm.sh
Last active February 22, 2026 22:03
qwen3-coder-next - vllm 0.15.1 - transformers 5 - optimized for dgx spark
#!/bin/bash
docker run -d \
--name vllm \
--restart unless-stopped \
--gpus all \
--ipc host \
--shm-size 64gb \
--memory 110g \
--memory-swap 120g \
--pids-limit 4096 \