quantized models don't have .to

ValueError: `.to` is not supported for `4-bit` or `8-bit` bitsandbytes models. Please use the model as it is, since the model has already been set to the correct devices and casted to the correct `dtype`.
2 years ago · 78b1e83c3f
parent 8f91a7dd9b
commit 78b1e83c3f
1 changed files with 1 additions and 1 deletions
--- a/swarms/models/huggingface.py
+++ b/swarms/models/huggingface.py
@ -181,7 +181,7 @@ class HuggingfaceLLM(AbstractLLM):
                quantization_config=bnb_config,
                *args,
                **kwargs,
-            ).to(self.device)
+            )
        else:
            self.model = AutoModelForCausalLM.from_pretrained(
                self.model_id, *args, **kwargs