Commit Graph

6 Commits (0f662d433015e43eb2c49105ad7ceb64a66fdda3)

Author SHA1 Message Date
thinhlpg 2fec4f2f42 refactor: change repo stucture (move code from src/ to src/deepsearch) 2 months ago
thinhlpg 77f121662f test: add tests for reward_retry function scenarios 2 months ago
thinhlpg 3081d6e36b test: added tests for new reward functions: search strategy and search diversity 2 months ago
thinhlpg d0e6068055 fix: strengthen reward correctness logic to handle final message is not asnwer form assistant. Also update logs for reward functions for better debug 2 months ago
thinhlpg 1bd609dfae test: enhance reward correctness tests with validation logic 2 months ago
thinhlpg 3910ef343a test: add unit tests for agent, reward functions, and tokenizer adapters 3 months ago