# Available Models

| Model Name            | Description                                                                                             | Input Price  | Output Price | Use Cases                                                              |
|-----------------------|---------------------------------------------------------------------------------------------------------|--------------|--------------|------------------------------------------------------------------------|
| **Qwen-vl**            | Qwen VL for real-world multi-modal function calling.                                                   | $5/1M Tokens | $10/1M Tokens | Multi-modal interactions and function handling in complex environments.|
| **XComposer2-4khd-7b** | One of the highest performing VLMs (Video Language Models).                                            | $4/1M Tokens | $8/1M Tokens | High-resolution video processing and understanding.                     |


## What models should we add?
[Book a call with us to learn more about your needs:](https://calendly.com/swarm-corp/30min)