aayush garg PRO

garg-aayush

garg-aayush

AI & ML interests

Generative modelling

Recent Activity

upvoted an article 3 days ago

Transformers v5: Simple model definitions powering the AI ecosystem

upvoted an article 3 days ago

What I Learned Building SFT from the Ground Up

published an article 3 days ago

What I Learned Building SFT from the Ground Up

View all activity

Organizations

Articles 1

Article

What I Learned Building SFT from the Ground Up

Collections 4

View 4 collections

models 45

datasets 4

garg-aayush/sft-cs336-assign5-datasets

Preview • Updated 4 days ago • 87

garg-aayush/GPT4-LLM-Cleaned-10K

Viewer • Updated May 24, 2024 • 10k • 30

garg-aayush/ultrachat-refined-100K-2048

Viewer • Updated Apr 23, 2024 • 110k • 18

garg-aayush/mini-platypus-1K

Viewer • Updated Apr 18, 2024 • 1k • 15 • 1

aayush garg PRO

AI & ML interests

Recent Activity

Organizations

Articles 1

What I Learned Building SFT from the Ground Up

Collections 4

Qwen3 Technical Report

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Training language models to follow instructions with human feedback

Proximal Policy Optimization Algorithms

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Qwen3 Technical Report

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Training language models to follow instructions with human feedback

Proximal Policy Optimization Algorithms

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

models 45

garg-aayush/llama31-8b-sft-mask

garg-aayush/llama31-8b-sft-nomask

garg-aayush/ckpt-140

garg-aayush/ckpt-100

garg-aayush/test

garg-aayush/llama-2-7b-miniplatypus-1K

garg-aayush/zephyr-7b-sft-qlora

garg-aayush/wolf_plushie

garg-aayush/vase

garg-aayush/teapot

datasets 4

garg-aayush/sft-cs336-assign5-datasets

garg-aayush/GPT4-LLM-Cleaned-10K

garg-aayush/ultrachat-refined-100K-2048

garg-aayush/mini-platypus-1K

aayush garg PRO

AI & ML interests

Recent Activity

Organizations

Articles 1

What I Learned Building SFT from the Ground Up

Collections 4

models 45 Sort: Recently updated

datasets 4 Sort: Recently updated

models 45

datasets 4