|
|
8 months ago | |
|---|---|---|
| .. | ||
| 250324_generate_data_anatomy.ipynb | 8 months ago | |
| 250325_fak_you_chattemplate.ipynb | 8 months ago | |
| 250325_inspect_qa_dataset.ipynb | 8 months ago | |
| 250325_visualize_reward_function.ipynb | 8 months ago | |
| 250329_saving_inference.ipynb | 8 months ago | |
| 250331_train_grpo_r1_distil.ipynb | 8 months ago | |
| 250331_train_grpo_r1_distil_llama3b.ipynb | 8 months ago | |
| 250402_inspect_mask.ipynb | 8 months ago | |
| 250407_cook_vllm_sglang.ipynb | 8 months ago | |
| 250408_cook_gradio_agent_demo.ipynb | 8 months ago | |
| 250408_cook_search_api.ipynb | 8 months ago | |
| 250410_cook_better_data.ipynb | 8 months ago | |
| Llama3_1_(8B)_GRPO.ipynb | 8 months ago | |
| Llama_3_1_8b_2x_faster_inference.ipynb | 8 months ago | |
| Qwen2_5_(3B)_GRPO.ipynb | 8 months ago | |
| README.md | 8 months ago | |
| train_autodidact.ipynb | 8 months ago | |