4 Commits (9738b80353c9be77e4b0ac2163274d29354dfd2a)

Author SHA1 Message Date
thinhlpg eebf914a81 refactor: moved modules from src/deepsearch to src/
1 month ago
thinhlpg 2fec4f2f42 refactor: change repo stucture (move code from src/ to src/deepsearch)
1 month ago
thinhlpg af7f38c792 feat: add code for qwen architecture
1 month ago
thinhlpg 31dcbf5d8a feat: refactor whole code base, add logic for training R1 distil base models, change some template and reward logics
1 month ago