You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
16 lines
525 B
16 lines
525 B
# Initialize the VLLM wrapper (model loads on first use)
|
|
The first call to `run()` will download and load the model.
|
|
|
|
# Initialize the VLLM wrapper (model loads on first use)
|
|
# The model is loaded when `run()` is first invoked
|
|
|
|
# Model initialization is deferred until `run()` is called
|
|
|
|
# Initialize the vLLM wrapper (weights load on first run)
|
|
The model is loaded lazily when `run()` is invoked.
|
|
|
|
# The weights are loaded lazily when you first call `run()`
|
|
|
|
# Model weights are loaded during the first call to `run()`
|
|
|