arxiv:2410.01769
Zhenting Qi PRO
zhenting
AI & ML interests
None yet
Organizations
models 125
zhenting/evolm-4B-320BT-cpt-MixedFW8FM42-sftep1-sampled500k_first100k_qwen7b-rlep8-last100k
4B • Updated
zhenting/evolm-4B-160BT-cpt-MixedFW8FM42-sftep32-sampled500k_first100k_qwen7b-rlep8-last100k
4B • Updated
• 1
zhenting/evolm-4B-160BT-cpt-MixedFW8FM42-sftep16-sampled500k_first100k_qwen7b-rlep8-last100k
4B • Updated
zhenting/evolm-4B-160BT-cpt-MixedFW8FM42-sftep8-sampled500k_first100k_qwen7b-rlep8-last100k
4B • Updated
• 1
zhenting/evolm-4B-160BT-cpt-MixedFW8FM42-sftep2-sampled500k_first100k_qwen7b-rlep8-last100k
4B • Updated
zhenting/evolm-4B-160BT-cpt-MixedFW8FM42-sftep1-sampled500k_first400k_qwen7b-rlep8-last100k
4B • Updated
zhenting/evolm-4B-160BT-cpt-MixedFW8FM42-sftep1-sampled500k_first200k_qwen7b-rlep8-last100k
4B • Updated
zhenting/evolm-4B-160BT-cpt-MixedFW8FM42-sftep1-sampled500k_first100k_qwen7b-rlep32-last100k
4B • Updated
• 2
zhenting/evolm-4B-160BT-cpt-MixedFW8FM42-sftep1-sampled500k_first100k_qwen7b-rlep8-last400k
4B • Updated
• 1
zhenting/evolm-4B-160BT-cpt-MixedFW8FM42-sftep1-sampled500k_first100k_qwen7b-rlep8-last300k
4B • Updated
datasets 0
None public yet