7 Commits (bec864038ba4efb6a6dd771bb6604c9543cb46ac)

Author SHA1 Message Date
thinhlpg bec864038b feat: increase max tokens and new tokens in evaluation scripts
4 weeks ago
thinhlpg 424459d840 feat: update evaluation scripts to enhance model configuration and dataset loading, including increased max tokens and added logging
4 weeks ago
thinhlpg eebf914a81 refactor: moved modules from src/deepsearch to src/
1 month ago
thinhlpg 2fec4f2f42 refactor: change repo stucture (move code from src/ to src/deepsearch)
1 month ago
thinhlpg 1a18cd7bfd feat: update training and evaluation configurations (editable agent generation scripts)
1 month ago
thinhlpg 6d994feeb2 feat: enhance evaluation scripts for base and LoRA models
1 month ago
thinhlpg 31dcbf5d8a feat: refactor whole code base, add logic for training R1 distil base models, change some template and reward logics
1 month ago