ReZero-Search-LLM-Agent-Fork

Commit Graph

Author	SHA1	Message	Date
thinhlpg	af7f38c792	feat: add code for qwen architecture	6 months ago
thinhlpg	90b45c62ab	docs: update docs and notebooks for the past few days, (observation, debugging) - observation: model hallucniate the search result, docs about debugigng and adapting to r1 distil base model, notebooks on the detail of making training r1 distil works	6 months ago
thinhlpg	f6b6cca2ce	feat: add multiple reference notebooks for model training and inference Big thanks to author(s) for the great reference code!	6 months ago
thinhlpg	7f2f43aa46	chore: clean up notebooks	7 months ago
thinhlpg	3c2deaced9	refactor: restructure code base, better centralize logging logic	7 months ago
thinhlpg	04d56325bb	feat: add new reward functions, add less dumb data generation logic, implement better logging	7 months ago
thinhlpg	7d4de89186	chore: update worklog 250324 - Added `train_autodidact_1B.py` for quick test. - Update `00_worklog.md`, `dataset.md`, and `reward-functions.md` to reflect new training strategies and reward functions.	7 months ago
thinhlpg	1bdee261b6	feat: add draft data generation and documentation - Updated `00_worklog.md` to reflect optimizations for speed and quality in dataset generation. - Introduced new documentation files: `choosing-llm-and-prompt-101.md`, `ds-pipeline-v0.md`, and `paraphrase-prompt.md` for better clarity on LLM choices and dataset pipeline. - Added a Jupyter notebook `250324_generate_data_anatomy.ipynb` to explore the data generation process	7 months ago
thinhlpg	f19354a8c9	chore: clean up notebook output	7 months ago
thinhlpg	a58722e16f	feat: add initial project structure and core functionality - Added initial files from AutoDiact as starting point - Enhanced `README.md` with project overview and setup instructions. . - Removed `ugly_code_file.py` as part of cleanup. - Added various documentation files and assets for project clarity. - Included Jupyter notebooks for training and experimentation.	7 months ago

10 Commits (d8e949ec7c66ae251430a5a4ee49543535537ea7)