ReZero-Search-LLM-Agent-Fork

Commit Graph

Author	SHA1	Message	Date
thinhlpg	504f0c6c8e	feat: update reward_em_chunk to match only the LAST required paragraph of the reasoning chain and adjust related tests	3 months ago
thinhlpg	358875a035	feat: enhance reward_em_chunk function to match multiple paragraphs, add test	3 months ago
thinhlpg	eebf914a81	refactor: moved modules from src/deepsearch to src/	3 months ago
thinhlpg	2fec4f2f42	refactor: change repo stucture (move code from src/ to src/deepsearch)	3 months ago
thinhlpg	77f121662f	test: add tests for reward_retry function scenarios	3 months ago
thinhlpg	3081d6e36b	test: added tests for new reward functions: search strategy and search diversity	3 months ago
thinhlpg	d0e6068055	fix: strengthen reward correctness logic to handle final message is not asnwer form assistant. Also update logs for reward functions for better debug - Added 'logs/' directory to .gitignore to exclude log files. - Introduced log_chat_state function to log chat states and rewards to JSONL files. - Updated reward functions to log chat states with validation results for better tracking and debugging.	3 months ago
thinhlpg	1bd609dfae	test: enhance reward correctness tests with validation logic - Updated test cases to include role and tag validation for assistant messages. - Ensured that only properly formatted messages with answer tags are accepted. - Added new test for validating various incorrect formats and their expected outcomes.	3 months ago
thinhlpg	133cb1ab90	test: add Qwen tokenizer adapter tests Implemented unit tests for the Qwen tokenizer adapter, including format handling, mask generation, and multi-turn conversation support	3 months ago
thinhlpg	3910ef343a	test: add unit tests for agent, reward functions, and tokenizer adapters	3 months ago

10 Commits (7cd4d18ee6b0d475efd9c7e745dab5ff1508da68)