wxzhang
·
AI & ML interests
None yet
Organizations
models
14
wxzhang/dpo-selective-buffer-spo-shift
Text Generation
•
7B
•
Updated
•
31
wxzhang/dpo-selective-redteaming
Text Generation
•
7B
•
Updated
•
9
wxzhang/dpo-selective-buffer-safeipo
Text Generation
•
7B
•
Updated
•
42
wxzhang/dpo-selective-alpaca
Text Generation
•
7B
•
Updated
•
10
wxzhang/dpo-selective-bufferdata
Text Generation
•
Updated
•
16
wxzhang/dpo-selective-longerrun
Text Generation
•
7B
•
Updated
•
135
wxzhang/dpo-selective-mixdata
Text Generation
•
7B
•
Updated
•
8
wxzhang/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
•
48
wxzhang/selective-pairrm-33079692-mt2
Text Generation
•
7B
•
Updated
•
6
wxzhang/selective-pairrm-33076849-mt1
Text Generation
•
7B
•
Updated
•
9