I wanted to see how vLLM's speed in RPC mode degrades with input length. It was done with GLM 4.5 Air and 2x2 3090s. While doing this I realized the host should be on the computer with the better CPU...