Pretrained LLMs from scratch.
Y. Yu
PursuitOfDataScience
AI & ML interests
LLM, GPU Computing, PyTorch
Recent Activity
updated
a dataset
about 11 hours ago
PursuitOfDataScience/0.5M-thinking
updated
a dataset
about 11 hours ago
PursuitOfDataScience/0.5M-thinking
updated
a dataset
about 11 hours ago
PursuitOfDataScience/0.5M-thinking
Organizations
None yet
Sandbox Models
Trial & Error models for various tasks.
-
PursuitOfDataScience/roberta-large-ner
Token Classification • 0.4B • Updated • 7 -
PursuitOfDataScience/distilbert-base-cased-ner
Token Classification • 65.2M • Updated • 8 -
PursuitOfDataScience/bert-base-ner
Token Classification • 0.1B • Updated • 11 -
PursuitOfDataScience/t5-large-summary-model
0.7B • Updated • 4
ArgonneAI
Pretrained LLMs from scratch.
Sandbox Models
Trial & Error models for various tasks.
-
PursuitOfDataScience/roberta-large-ner
Token Classification • 0.4B • Updated • 7 -
PursuitOfDataScience/distilbert-base-cased-ner
Token Classification • 65.2M • Updated • 8 -
PursuitOfDataScience/bert-base-ner
Token Classification • 0.1B • Updated • 11 -
PursuitOfDataScience/t5-large-summary-model
0.7B • Updated • 4
models
21
PursuitOfDataScience/llama3.2-1b-thinking
Text Generation
•
1B
•
Updated
•
1
PursuitOfDataScience/llama-3-2-1b-open-r1-mot-sft
Text Generation
•
1B
•
Updated
•
3
PursuitOfDataScience/qwen2.5-0.5b-r1-dpo
Text Generation
•
0.5B
•
Updated
•
5
PursuitOfDataScience/qwen2.5-0.5b-dpo
Text Generation
•
0.5B
•
Updated
•
9
PursuitOfDataScience/qwen2.5-0.5b-open-r1-mot-cot-sft
Text Generation
•
0.5B
•
Updated
•
5
PursuitOfDataScience/llama3.2-1b-dpo
Text Generation
•
1B
•
Updated
•
3
PursuitOfDataScience/qwen2.5-0.5b-ultrachat-sft-multi-turn
0.5B
•
Updated
•
3
PursuitOfDataScience/finetuned-llama-3.2-3b-math-reasoning
3B
•
Updated
•
3
PursuitOfDataScience/finetuned-llama-3.2-3b-dpo
Text Generation
•
3B
•
Updated
•
2
PursuitOfDataScience/Qwen2.5-1.5B-Instruct-Lora-Deepseek-R1
2B
•
Updated
•
6
datasets
41
PursuitOfDataScience/0.5M-thinking
Viewer
•
Updated
•
404k
•
40
PursuitOfDataScience/MiniMax-M2.1-Mixture-of-Thoughts
Viewer
•
Updated
•
349k
•
134
•
1
PursuitOfDataScience/gsm8k-thinking
Viewer
•
Updated
•
8.79k
•
5
PursuitOfDataScience/bbc-news-llama4-maverick-summary
Viewer
•
Updated
•
174k
•
25
PursuitOfDataScience/govreport-llama4-maverick-summary
Viewer
•
Updated
•
19.5k
•
29
•
1
PursuitOfDataScience/arxiv-llama4-maverick-abstract
Viewer
•
Updated
•
198k
•
64
PursuitOfDataScience/xsum-llama4-maverick-summary
Viewer
•
Updated
•
227k
•
17
PursuitOfDataScience/cnn-dailymail-llama4-maverick-summary
Viewer
•
Updated
•
312k
•
25
PursuitOfDataScience/earnings-call-llama4-maverick-summary
Viewer
•
Updated
•
191k
•
56
PursuitOfDataScience/mistral-awesome-chatgpt-prompts
Viewer
•
Updated
•
203
•
17