Shang Hong Sim's picture

19 1

Shang Hong Sim

shanghong

·

https://shanghongsim.github.io/

AI & ML interests

Neural decoding, neuroengineering, signal processing

Recent Activity

upvoted an article 22 days ago

Welcome GPT OSS, the new open-source model family from OpenAI!

updated a collection about 1 month ago

updated a collection about 1 month ago

View all activity

Organizations

upvoted an article 22 days ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

+10

Aug 5

•

509

upvoted 4 papers about 2 months ago

Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics

Paper • 2510.17797 • Published Oct 20 • 10

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Paper • 2510.19363 • Published Oct 22 • 61

Establishing Best Practices for Building Rigorous Agentic Benchmarks

Paper • 2507.02825 • Published Jul 3 • 1

Towards General Agentic Intelligence via Environment Scaling

Paper • 2509.13311 • Published Sep 16 • 71

upvoted an article 4 months ago

Article

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

Aug 9

•

11

upvoted a collection 5 months ago

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 253

upvoted 3 articles 5 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8

•

735

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Jul 5, 2024

•

303

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

+1

Mar 20, 2024

•

105

upvoted an article 9 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7

•

255

upvoted a paper 10 months ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published Feb 13 • 37

upvoted 3 collections 10 months ago

DeepSeek-R1

10 items • Updated 12 days ago • 821

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7 • 66

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 175

upvoted an article 10 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

Jan 28

•

887

upvoted a collection 10 months ago

Trust-Align

12 items • Updated Feb 11 • 3

upvoted a paper 11 months ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published Jan 18 • 15

upvoted a paper about 1 year ago

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Paper • 2411.06176 • Published Nov 9, 2024 • 45