added Chatbot Server README.md

1 year ago · a116d886fb
parent 0534d08300
commit a116d886fb
1 changed files with 65 additions and 0 deletions
--- a/swarms/server/README.md
+++ b/swarms/server/README.md
@ -0,0 +1,65 @@
+# RAG Chatbot Server
+
+* This server is currently used to host a conversational RAG (Retrieval Augmented Generation) Chatbot.  
+
+* It is currently set up to use OpenAI or vLLM (or other OpenAI compatible APIs such as available via LMStudio or Ollama).  
+
+* Support for Metal is also available if configured in the .env file via the USE_METAL environment variable, but this doesn't apply when using vLLM.  
+
+* Switching from vLLM to another host like Olama requires commenting/uncommenting some code at this time, but will be dynamic later.  
+
+* Support for all the Swarms models will be added but currently the prompts are designed for Llama 2 and 3.  Some work needs to be done to support other prompt formats and models that are not compatible with Llama, such as ChatGPT.
+
+# Running vLLM
+
+Running vLLM in a docker container saves a lot of trouble.  Use dockerRunVllm.sh to set up and start vLLM.  This command will allow you to control vLLM using docker commands:
+
+`docker stop vllm`
+
+`docker start vllm`
+
+`docker attach vllm`
+
+Run the dockerRunVllm.sh command again to get a fresh copy of the latest vLLM docker image (you will be prompted to rename or remove the existing one if the name is the same.)
+
+#Starting the Chatbot API Server
+
+In order to start the server you have to run uvicorn or FastAPI CLI or use the following launch.json in VSCode/Cursor or whatever to debug it.
+
+## Start server with uvicorn
+
+Run the following shell cmd:
+
+`uvicorn swarms.server.server:app --port 8888`
+
+To debug using uvicorn use this launch.json configuration:
+
+`"configurations": [
+        {
+            "name": "Python: FastAPI",
+            "type": "debugpy",
+            "request": "launch",
+            "module": "uvicorn",
+            "args": [
+                "swarms.server.server:app",  // Use dot notation for module path
+                "--reload",
+                "--port",
+                "8888"
+            ],
+            "jinja": true,
+            "justMyCode": true,
+            "env": {
+                "PYTHONPATH": "${workspaceFolder}/swarms"
+            }
+        }
+    ]
+`
+## Start server using FastAPI CLI
+
+You can run the Chatbot server in production mode using FastAPI CLI:
+
+`fastapi run swarms/server/server.py --port 8888'
+
+To run in dev mode use this command:
+
+`fastapi dev swarms/server/server.py --port 8888'