|
3 months ago | |
---|---|---|
.. | ||
250324_generate_data_anatomy.ipynb | 3 months ago | |
250325_fak_you_chattemplate.ipynb | 3 months ago | |
250325_inspect_qa_dataset.ipynb | 3 months ago | |
250325_visualize_reward_function.ipynb | 3 months ago | |
250329_saving_inference.ipynb | 3 months ago | |
250331_train_grpo_r1_distil.ipynb | 3 months ago | |
250331_train_grpo_r1_distil_llama3b.ipynb | 3 months ago | |
250402_inspect_mask.ipynb | 3 months ago | |
Llama3_1_(8B)_GRPO.ipynb | 3 months ago | |
Llama_3_1_8b_2x_faster_inference.ipynb | 3 months ago | |
Qwen2_5_(3B)_GRPO.ipynb | 3 months ago | |
README.md | 3 months ago | |
train_autodidact.ipynb | 3 months ago |