ProgramTrace

non-profit

AI & ML interests

None defined yet.

Collections 4

View 4 collections

models 8

PTPReasoning/Llama-3.1-8B-RL-Clean-V2

8B • Updated Jul 29, 2025

PTPReasoning/Llama-3.1-8B-RL-Baseline-V2

8B • Updated Jul 26, 2025

PTPReasoning/Llama-3.1-8B-SFT-Baseline

Text Generation • 8B • Updated Jul 25, 2025

PTPReasoning/Llama-3.1-8B-SFT-Clean-V2

Text Generation • 8B • Updated Jul 25, 2025

PTPReasoning/Qwen2.5-7B-Base-RL-Clean-V2

Text Generation • 8B • Updated May 3, 2025 • 1

PTPReasoning/Qwen2.5-7B-Base-RL-Baseline

Text Generation • 8B • Updated Apr 30, 2025 • 1

PTPReasoning/Qwen2.5-7B-Base-SFT-Clean-V2

Text Generation • 8B • Updated Apr 23, 2025 • 2

PTPReasoning/Qwen2.5-7B-Base-SFT-Baseline-V2

Text Generation • 8B • Updated Apr 23, 2025 • 1

datasets 12

PTPReasoning/finqa

Viewer • Updated Jul 27, 2025 • 1.15k • 67

PTPReasoning/hotpot_qa

Viewer • Updated Jul 27, 2025 • 500 • 38

PTPReasoning/PubMedQA

Viewer • Updated May 14, 2025 • 1.5k • 7

PTPReasoning/MedCalc-Bench-v1.0

Viewer • Updated May 14, 2025 • 22.5k • 7 • 2

PTPReasoning/PTP-RL-ITL-Final-Clean-V2

Viewer • Updated Apr 21, 2025 • 19k • 3

PTPReasoning/PTP-SFT-ITL-Final-Baseline-V2

Viewer • Updated Apr 21, 2025 • 4.12k • 6

PTPReasoning/PTP-SFT-ITL-Final-Clean-V2

Viewer • Updated Apr 21, 2025 • 4.21k • 6

PTPReasoning/PTP-RL-MedCalc-Bench

Viewer • Updated Apr 14, 2025 • 9.34k • 4

PTPReasoning/PTP-RL-DAPO-EN

Viewer • Updated Apr 14, 2025 • 14.1k • 4

PTPReasoning/mmlu_pro_biology

Viewer • Updated Apr 11, 2025 • 717 • 4

View 12 datasets