The current model streamer implementation is highly effective for single-process model loading and is already integrated with vLLM's sharded model loader. However, vLLM's default loader behavior for ...