13 Commits (7ee65269fb2e6d3e0ce234d9111d44eeb662c391)

Author SHA1 Message Date
thinhlpg 14ef79a4f5 feat: [WIP] add bench scripts
1 month ago
thinhlpg 7376f596a5 feat: add Gradio demo for DeepSearch and update configuration settings
1 month ago
thinhlpg 7ff3623102 chore: update .gitignore, modify Makefile for installation, and add pyproject.toml for project configuration
1 month ago
thinhlpg d0e6068055 fix: strengthen reward correctness logic to handle final message is not asnwer form assistant. Also update logs for reward functions for better debug
1 month ago
thinhlpg 1047e2fa1c chore: update .gitignore and requirements for unsloth versions
1 month ago
thinhlpg 83f86869f6 chore: update .gitignore and add new toys data files
1 month ago
thinhlpg d2f03b96ab feat: enhance evaluation script and remove deprecated shell script
1 month ago
thinhlpg 60233f2113 chore: update .gitignore
1 month ago
thinhlpg fd32bcacfd chores: update worklog and research progress
1 month ago
thinhlpg 3c2deaced9 refactor: restructure code base, better centralize logging logic
1 month ago
thinhlpg 7d4de89186 chore: update worklog 250324
2 months ago
thinhlpg a58722e16f feat: add initial project structure and core functionality
2 months ago
Thinh Le bf32fdd897 Initial commit
2 months ago