You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
thinhlpg 424459d840
feat: update evaluation scripts to enhance model configuration and dataset loading, including increased max tokens and added logging
4 weeks ago
..
UnslothGRPOTrainerTemp.py feat: update model configuration (longer context) and dataset loading logic for improved performance and flexibility 4 weeks ago
__init__.py refactor: moved modules from src/deepsearch to src/ 1 month ago
agent.py feat: update model configuration (longer context) and dataset loading logic for improved performance and flexibility 4 weeks ago
embeddings.py feat: add scripts for musique data processing 4 weeks ago
evaluation.py feat: update evaluation scripts to enhance model configuration and dataset loading, including increased max tokens and added logging 4 weeks ago
prompts.py refactor: moved modules from src/deepsearch to src/ 1 month ago
rewards.py feat: update reward_em_chunk to match only the LAST required paragraph of the reasoning chain and adjust related tests 4 weeks ago
search_module.py feat: update evaluation scripts to enhance model configuration and dataset loading, including increased max tokens and added logging 4 weeks ago
tokenizer_adapter.py refactor: moved modules from src/deepsearch to src/ 1 month ago