Katherine Tieu

kthrn22

https://kthrn22.github.io

AI & ML interests

LLMs, Agents, RL, Multimodal Learning, GNNs

Recent Activity

upvoted a paper 14 days ago

Agentic Reasoning for Large Language Models

upvoted a paper 21 days ago

Your Group-Relative Advantage Is Biased

upvoted a paper about 1 month ago

LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding

View all activity

Organizations

upvoted a paper 14 days ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published 17 days ago • 190

upvoted a paper 21 days ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published 22 days ago • 149

upvoted 3 papers about 1 month ago

LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding

Paper • 2512.16229 • Published Dec 18, 2025 • 16

INTELLECT-3: Technical Report

Paper • 2512.16144 • Published Dec 18, 2025 • 20

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published Dec 22, 2025 • 64

upvoted 2 papers about 2 months ago

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Paper • 2510.09541 • Published Oct 10, 2025 • 17

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29, 2025 • 48

upvoted 3 papers 2 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published Nov 25, 2025 • 121

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published Nov 19, 2025 • 43

upvoted a paper 3 months ago

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published Nov 11, 2025 • 34

updated a model 3 months ago

kthrn22/anlp-hw2-outputs

Updated Nov 12, 2025

published a model 3 months ago

kthrn22/anlp-hw2-outputs

Updated Nov 12, 2025

liked a Space 3 months ago

The Smol Training Playbook

📚

2.95k

The secrets to building world-class LLMs

upvoted a paper 3 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 122

upvoted an article 3 months ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

Aug 26, 2024

•

updated a model 4 months ago

kthrn22/adv-nlp-hw1-kt42

Text Classification • 22.7M • Updated Oct 14, 2025 • 2

published a model 4 months ago

kthrn22/adv-nlp-hw1-kt42

Text Classification • 22.7M • Updated Oct 14, 2025 • 2

upvoted 2 papers 4 months ago

Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published Oct 13, 2025 • 32

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190