ReZero-Search-LLM-Agent-Fork/train.sh

7 lines
55 B

export CUDA_VISIBLE_DEVICES=0
python train_grpo.py