Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Visual Haystacks

community
https://visual-haystacks.github.io/
visual-haystacks
Activity Feed

AI & ML interests

None defined yet.

Patrick (Tsung-Han) Wu's profile picture Rohan Gulati's profile picture

tsunghanwu 
authored a paper 3 months ago

Are Large Reasoning Models Interruptible?

Paper • 2510.11713 • Published Oct 13, 2025 • 4
tsunghanwu 
authored 2 papers 7 months ago

Search Arena: Analyzing Search-Augmented LLMs

Paper • 2506.05334 • Published Jun 5, 2025 • 17

Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint

Paper • 2505.23759 • Published May 29, 2025 • 5
tsunghanwu 
authored a paper 8 months ago

LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery

Paper • 2505.02829 • Published May 5, 2025
tsunghanwu 
authored 5 papers 9 months ago

Self-correcting LLM-controlled Diffusion Models

Paper • 2311.16090 • Published Nov 27, 2023 • 1

See, Say, and Segment: Teaching LMMs to Overcome False Premises

Paper • 2312.08366 • Published Dec 13, 2023

Visual Haystacks: Answering Harder Questions About Sets of Images

Paper • 2407.13766 • Published Jul 18, 2024 • 2

CLAIR-A: Leveraging Large Language Models to Judge Audio Captions

Paper • 2409.12962 • Published Sep 19, 2024 • 2

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper • 2504.13169 • Published Apr 17, 2025 • 39
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs