You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
thinhlpg 1a18cd7bfd
feat: update training and evaluation configurations (editable agent generation scripts)
1 month ago
..
UnslothGRPOTrainerTemp.py chore: disable logging, enable torch complie 1 month ago
__init__.py feat: enhance evaluation script and remove deprecated shell script 1 month ago
agent.py feat: enhance evaluation script and remove deprecated shell script 1 month ago
config.py feat: refactor whole code base, add logic for training R1 distil base models, change some template and reward logics 1 month ago
embeddings.py refactor: restructure code base, better centralize logging logic 1 month ago
evaluation.py feat: update training and evaluation configurations (editable agent generation scripts) 1 month ago
prompts.py feat: refine user prompt logic for improved clarity and structure 1 month ago
rewards.py feat: enhance reward_retry function to handle missing answer tags 1 month ago
search_module.py feat: refactor whole code base, add logic for training R1 distil base models, change some template and reward logics 1 month ago
tokenizer_adapter.py feat: add code for qwen architecture 1 month ago