File size: 236 Bytes
97e19a6
 
 
 
 
1
2
3
4
5
base SFT for the model, use the down-stream -winton model

SFT: https://wandb.ai/new-eden/AFM-SFT/runs/u8fj6r6o?nw=nwuserdeltavector

KTO: https://wandb.ai/new-eden/AFM-SFT/runs/fgkl4ijs?nw=nwuserdeltavector (Early stopped at ckpt 100~)