ReZero-Search-LLM-Agent-Fork

Commit Graph

Author	SHA1	Message	Date
thinhlpg	9738b80353	feat: update max generations and output length in evaluation scripts, add memory fraction to server launch	4 weeks ago
thinhlpg	14ef79a4f5	feat: [WIP] add bench scripts	1 month ago
thinhlpg	7376f596a5	feat: add Gradio demo for DeepSearch and update configuration settings	1 month ago
thinhlpg	7ff3623102	chore: update .gitignore, modify Makefile for installation, and add pyproject.toml for project configuration	1 month ago
thinhlpg	d0e6068055	fix: strengthen reward correctness logic to handle final message is not asnwer form assistant. Also update logs for reward functions for better debug - Added 'logs/' directory to .gitignore to exclude log files. - Introduced log_chat_state function to log chat states and rewards to JSONL files. - Updated reward functions to log chat states with validation results for better tracking and debugging.	1 month ago
thinhlpg	1047e2fa1c	chore: update .gitignore and requirements for unsloth versions	1 month ago
thinhlpg	83f86869f6	chore: update .gitignore and add new toys data files	1 month ago
thinhlpg	d2f03b96ab	feat: enhance evaluation script and remove deprecated shell script - Updated eval.py to streamline model evaluation using vLLM and unsloth. - Deleted eval.sh as its functionality is now integrated into eval.py. - Updated .gitignore to exclude eval_logs directory.	1 month ago
thinhlpg	60233f2113	chore: update .gitignore	1 month ago
thinhlpg	fd32bcacfd	chores: update worklog and research progress	1 month ago
thinhlpg	3c2deaced9	refactor: restructure code base, better centralize logging logic	1 month ago
thinhlpg	7d4de89186	chore: update worklog 250324 - Added `train_autodidact_1B.py` for quick test. - Update `00_worklog.md`, `dataset.md`, and `reward-functions.md` to reflect new training strategies and reward functions.	2 months ago
thinhlpg	a58722e16f	feat: add initial project structure and core functionality - Added initial files from AutoDiact as starting point - Enhanced `README.md` with project overview and setup instructions. . - Removed `ugly_code_file.py` as part of cleanup. - Added various documentation files and assets for project clarity. - Included Jupyter notebooks for training and experimentation.	2 months ago
Thinh Le	bf32fdd897	Initial commit	2 months ago

14 Commits (0b4bf54833ef7294cf2ca9309351f9cd0df136fc)