ReZero-Search-LLM-Agent-Fork

Commit Graph

Author	SHA1	Message	Date
thinhlpg	504f0c6c8e	feat: update reward_em_chunk to match only the LAST required paragraph of the reasoning chain and adjust related tests	8 months ago
thinhlpg	358875a035	feat: enhance reward_em_chunk function to match multiple paragraphs, add test	8 months ago
thinhlpg	eebf914a81	refactor: moved modules from src/deepsearch to src/	8 months ago
thinhlpg	2fec4f2f42	refactor: change repo stucture (move code from src/ to src/deepsearch)	8 months ago
thinhlpg	77f121662f	test: add tests for reward_retry function scenarios	8 months ago
thinhlpg	3081d6e36b	test: added tests for new reward functions: search strategy and search diversity	8 months ago
thinhlpg	d0e6068055	fix: strengthen reward correctness logic to handle final message is not asnwer form assistant. Also update logs for reward functions for better debug - Added 'logs/' directory to .gitignore to exclude log files. - Introduced log_chat_state function to log chat states and rewards to JSONL files. - Updated reward functions to log chat states with validation results for better tracking and debugging.	8 months ago
thinhlpg	1bd609dfae	test: enhance reward correctness tests with validation logic - Updated test cases to include role and tag validation for assistant messages. - Ensured that only properly formatted messages with answer tags are accepted. - Added new test for validating various incorrect formats and their expected outcomes.	8 months ago
thinhlpg	133cb1ab90	test: add Qwen tokenizer adapter tests Implemented unit tests for the Qwen tokenizer adapter, including format handling, mask generation, and multi-turn conversation support	8 months ago
thinhlpg	3910ef343a	test: add unit tests for agent, reward functions, and tokenizer adapters	8 months ago

10 Commits (504f0c6c8e137cc56806b3e84760df05bbf8e8f5)