9 Commits (04d56325bb1ec7c911adddf6059173e9f09249f2)
 

Author SHA1 Message Date
thinhlpg 04d56325bb feat: add new reward functions, add less dumb data generation logic, implement better logging
2 months ago
thinhlpg b22b02ea1d feat: changed `<reasoning>` tags to `<think>
2 months ago
thinhlpg 7d4de89186 chore: update worklog 250324
2 months ago
thinhlpg 1bdee261b6 feat: add draft data generation and documentation
2 months ago
thinhlpg f19354a8c9 chore: clean up notebook output
2 months ago
thinhlpg f60ab499eb chore: update worklog
2 months ago
thinhlpg a58722e16f feat: add initial project structure and core functionality
2 months ago
Thinh Le 91c2476c28 chore: initial commit - the ugliest code i've ever written 💀
2 months ago
Thinh Le bf32fdd897 Initial commit
2 months ago