You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
swarms/tests/README.md

3.9 KiB

The pseudocode for unit tests covering the WorkerNode and BossNode might look something like this:

  1. Initialize the WorkerNode and BossNode instances with the necessary dependencies.
  2. Test the create_agent method of the WorkerNode. Ensure it creates an agent as expected.
  3. Test the run_agent method of the WorkerNode. Check if it runs the agent as expected.
  4. Test the create_task method of the BossNode. Check if it creates a task as expected.
  5. Test the execute_task method of the BossNode. Ensure it executes the task as expected.

In Python, this would look something like:

import pytest

def test_WorkerNode_create_agent():
    # assuming llm, tools, and vectorstore are initialized properly
    worker_node = WorkerNode(llm, tools, vectorstore)
    worker_node.create_agent('test_agent', 'test_role', False, {})
    assert worker_node.agent is not None
    assert worker_node.agent.chain.verbose

def test_WorkerNode_run_agent():
    worker_node = WorkerNode(llm, tools, vectorstore)
    worker_node.create_agent('test_agent', 'test_role', False, {})
    worker_node.run_agent('test prompt')  # check it runs without error

def test_BossNode_create_task():
    # assuming llm, vectorstore, task_execution_chain are initialized properly
    boss_node = BossNode(llm, vectorstore, task_execution_chain, False, 3)
    task = boss_node.create_task('test task')
    assert task == {'objective': 'test task'}

def test_BossNode_execute_task():
    boss_node = BossNode(llm, vectorstore, task_execution_chain, False, 3)
    task = boss_node.create_task('test task')
    boss_node.execute_task(task)  # check it runs without error

You would run these tests with a testing tool such as pytest. This is just an example and does not cover all possible test cases. Ideally, your tests should be more comprehensive, and should include negative test cases as well, to check that your code handles errors correctly.

The code you have provided has quite a few interconnected components, so it would be good to design tests that examine not just the individual pieces but how well they integrate and interact. Here are three additional tests you could consider:

  1. Test that the tools in the WorkerNode are correctly instantiated and are working as expected: Since the tools are a key part of the functionality in the WorkerNode, it's important to verify they're initialized correctly. You could choose one tool to test in detail, or write a generic test that loops through all tools and verifies they're properly set up.

  2. Test that the AgentExecutor in the BossNode is correctly instantiated: This is an important component in the BossNode and it's important to make sure it's functioning correctly.

  3. Test that the LLMChain in the BossNode works as expected: This is another critical component of the BossNode, so it's worth having a test that specifically targets it.

Here is an example of what these tests could look like:

def test_WorkerNode_tools():
    worker_node = WorkerNode(llm, tools, vectorstore)
    worker_node.create_agent('test_agent', 'test_role', False, {})
    
    # Check that all tools are instantiated
    for tool in worker_node.tools:
        assert tool is not None

def test_BossNode_AgentExecutor():
    boss_node = BossNode(llm, vectorstore, task_execution_chain, False, 3)
    
    # Check that the AgentExecutor is correctly initialized
    assert boss_node.baby_agi.task_execution_chain is not None

def test_BossNode_LLMChain():
    boss_node = BossNode(llm, vectorstore, task_execution_chain, False, 3)
    
    # Check that the LLMChain in ZeroShotAgent is working
    assert boss_node.baby_agi.task_execution_chain.agent.llm_chain is not None

As before, these tests are somewhat simplistic and primarily check for existence and instantiation. Real-world testing would likely involve more complex and specific tests for functionality and error-handling.