5.2 KiB

Raw Blame History

`vLLM` Documentation

Overview
Installation
vLLM Class
- Initialization
- Methods
  - run
Usage Examples
Common Issues and Troubleshooting
References and Resources

Overview

Welcome to the documentation for the vLLM (Variable-Length Language Model) library. vLLM is a powerful tool for generating text using pre-trained language models. This documentation will provide a comprehensive guide on how to use vLLM effectively.

Purpose

vLLM is designed to simplify the process of generating text using language models, specifically the Facebook opt-13b model. It allows you to fine-tune various parameters to achieve the desired text generation outcomes.

Key Features

Seamless integration with the Facebook opt-13b language model.
Flexible configuration options for model parameters.
Support for generating text for various natural language processing tasks.

Installation

Before using vLLM, you need to install swarms. You can install vLLM using pip:

pip install swarms

vLLM Class

The vLLM class is the core component of the vLLM library. It provides a high-level interface for generating text using the Facebook opt-13b language model.

Initialization

To initialize the vLLM class, you can use the following parameters:

model_name (str, optional): The name of the language model to use. Defaults to "facebook/opt-13b".
tensor_parallel_size (int, optional): The size of the tensor parallelism. Defaults to 4.
trust_remote_code (bool, optional): Whether to trust remote code. Defaults to False.
revision (str, optional): The revision of the language model. Defaults to None.
temperature (float, optional): The temperature parameter for text generation. Defaults to 0.5.
top_p (float, optional): The top-p parameter for text generation. Defaults to 0.95.

from swarms.models import vLLM

# Initialize vLLM with default parameters
vllm = vLLM()

# Initialize vLLM with custom parameters
custom_vllm = vLLM(
    model_name="custom/model",
    tensor_parallel_size=8,
    trust_remote_code=True,
    revision="abc123",
    temperature=0.7,
    top_p=0.8
)

Methods

run

The run method is used to generate text using the vLLM model. It takes a task parameter, which is a text prompt or description of the task you want the model to perform. It returns the generated text as a string.

# Generate text using vLLM
result = vllm.run("Generate a creative story about a dragon.")
print(result)

Usage Examples

Here are three usage examples demonstrating different ways to use vLLM:

Example 1: Basic Text Generation

from swarms.models import vLLM

# Initialize vLLM
vllm = vLLM()

# Generate text for a given task
generated_text = vllm.run("Generate a summary of a scientific paper.")
print(generated_text)

Example 2: Custom Model and Parameters

from swarms.models import vLLM

# Initialize vLLM with custom model and parameters
custom_vllm = vLLM(
    model_name="custom/model",
    tensor_parallel_size=8,
    trust_remote_code=True,
    revision="abc123",
    temperature=0.7,
    top_p=0.8
)

# Generate text with custom configuration
generated_text = custom_vllm.run("Create a poem about nature.")
print(generated_text)

Example 3: Batch Processing

from swarms.models import vLLM

# Initialize vLLM
vllm = vLLM()

# Generate multiple texts in batch
tasks = [
    "Translate the following sentence to French: 'Hello, world!'",
    "Write a short story set in a futuristic world.",
    "Summarize the main points of a news article about climate change."
]

for task in tasks:
    generated_text = vllm.run(task)
    print(generated_text)

Common Issues and Troubleshooting

ImportError: If you encounter an ImportError related to vLLM, make sure you have installed it using pip install vllm.
Model Configuration: Ensure that you provide valid model names and configurations when initializing vLLM. Invalid model names or parameters can lead to errors.
Text Generation: Be cautious with text generation parameters like temperature and top_p. Experiment with different values to achieve the desired text quality.

References and Resources

For more information and resources related to vLLM and language models, refer to the following:

This concludes the documentation for the vLLM library. We hope this guide helps you effectively use vLLM for text generation tasks. If you have any questions or encounter issues, please refer to the troubleshooting section or seek assistance from the vLLM community. Happy text generation!

5.2 KiB Raw Blame History

vLLM Documentation

Table of Contents