ReZero-Search-LLM-Agent-Fork

Author	SHA1	Message	Date
thinhlpg	eebf914a81	refactor: moved modules from src/deepsearch to src/	6 months ago
thinhlpg	2fec4f2f42	refactor: change repo stucture (move code from src/ to src/deepsearch)	6 months ago
thinhlpg	77f121662f	test: add tests for reward_retry function scenarios	6 months ago
thinhlpg	3081d6e36b	test: added tests for new reward functions: search strategy and search diversity	6 months ago
thinhlpg	d0e6068055	fix: strengthen reward correctness logic to handle final message is not asnwer form assistant. Also update logs for reward functions for better debug - Added 'logs/' directory to .gitignore to exclude log files. - Introduced log_chat_state function to log chat states and rewards to JSONL files. - Updated reward functions to log chat states with validation results for better tracking and debugging.	6 months ago
thinhlpg	1bd609dfae	test: enhance reward correctness tests with validation logic - Updated test cases to include role and tag validation for assistant messages. - Ensured that only properly formatted messages with answer tags are accepted. - Added new test for validating various incorrect formats and their expected outcomes.	6 months ago
thinhlpg	133cb1ab90	test: add Qwen tokenizer adapter tests Implemented unit tests for the Qwen tokenizer adapter, including format handling, mask generation, and multi-turn conversation support	6 months ago
thinhlpg	3910ef343a	test: add unit tests for agent, reward functions, and tokenizer adapters	6 months ago