2 19 6

Xiaohan Xu

Tebmer

https://tebmer.github.io/

tebmer

AI & ML interests

Text-to-SQL

Recent Activity

updated a dataset 18 days ago

birdsql/mini-interact

updated a dataset 18 days ago

birdsql/bird-interact-full

updated a dataset 18 days ago

birdsql/bird-interact-lite

View all activity

Organizations

upvoted 4 papers about 2 months ago

PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

Paper • 2510.13809 • Published Oct 15 • 37

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28 • 173

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published May 2, 2024 • 64

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Paper • 2510.08189 • Published Oct 9 • 26

upvoted 2 papers 2 months ago

BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions

Paper • 2510.05318 • Published Oct 6 • 21

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Paper • 2509.26490 • Published Sep 30 • 19

upvoted a paper 3 months ago

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27 • 36

upvoted a paper 5 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 79

upvoted 2 papers 6 months ago

SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications

Paper • 2506.18951 • Published Jun 23 • 21

SHARE: An SLM-based Hierarchical Action CorREction Assistant for Text-to-SQL

Paper • 2506.00391 • Published May 31 • 9

upvoted a paper 10 months ago

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published Aug 27, 2024 • 144

upvoted a paper 12 months ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 63

upvoted 2 papers about 1 year ago

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization

Paper • 2411.06208 • Published Nov 9, 2024 • 21

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published Oct 31, 2024 • 63

upvoted a paper over 1 year ago

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs

Paper • 2305.03111 • Published May 4, 2023 • 11

upvoted 4 papers almost 2 years ago

Xiaohan Xu

AI & ML interests

Recent Activity

Organizations

Tebmer's activity