File size: 236 Bytes
97e19a6 |
1 2 3 4 5 |
base SFT for the model, use the down-stream -winton model SFT: https://wandb.ai/new-eden/AFM-SFT/runs/u8fj6r6o?nw=nwuserdeltavector KTO: https://wandb.ai/new-eden/AFM-SFT/runs/fgkl4ijs?nw=nwuserdeltavector (Early stopped at ckpt 100~) |