wangchenglong's picture

wangchenglong

wangclnlp

·

https://wangclnlp.github.io/wangchenglong.github.io/

wangclnlp

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 days ago

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

upvoted a paper 28 days ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

updated a collection 3 months ago

View all activity

Organizations

upvoted a paper 21 days ago

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published 22 days ago • 26

upvoted a paper 28 days ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Paper • 2601.18491 • Published about 1 month ago • 125

updated a collection 3 months ago

Probing-RM

Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models • 2 items • Updated Nov 20, 2025

New activity in ifnoc/MRMBench 3 months ago

Update README.md

#2 opened 3 months ago by

updated a collection 3 months ago

Probing-RM

Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models • 2 items • Updated Nov 20, 2025

upvoted 3 papers 6 months ago

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 104

GRAM-R^2: Self-Training Generative Foundation Reward Models for Reward Reasoning

Paper • 2509.02492 • Published Sep 2, 2025 • 1

GRAM: A Generative Foundation Reward Model for Reward Generalization

Paper • 2506.14175 • Published Jun 17, 2025 • 1

updated a collection 6 months ago

GRAM-RR

Self-Training Generative Foundation Reward Models for Reward Reasoning • 4 items • Updated Nov 8, 2025

updated 2 models 6 months ago

wangclnlp/GRAM-RR-LLaMA-3.2-3B-RewardModel

Text Generation • 3B • Updated Sep 4, 2025 • 1

wangclnlp/GRAM-RR-LLaMA-3.1-8B-RewardModel

Text Generation • 8B • Updated Sep 4, 2025 • 4 • 2

updated a dataset 6 months ago

wangclnlp/GRAM-RR-TrainingData

Updated Sep 4, 2025 • 6

published a dataset 6 months ago

wangclnlp/GRAM-RR-TrainingData

Updated Sep 4, 2025 • 6

published 2 models 6 months ago

wangclnlp/GRAM-RR-LLaMA-3.2-3B-RewardModel

Text Generation • 3B • Updated Sep 4, 2025 • 1

wangclnlp/GRAM-RR-LLaMA-3.1-8B-RewardModel

Text Generation • 8B • Updated Sep 4, 2025 • 4 • 2

updated a collection 6 months ago

GRAM-RR

Self-Training Generative Foundation Reward Models for Reward Reasoning • 4 items • Updated Nov 8, 2025

upvoted a collection 7 months ago

GRAM

Generative Foundation Reward Models for Reward Generalization • 8 items • Updated Jun 19, 2025 • 1