[EMNLP'25] A Benchmark for Assessing VLM Safety with Real-World Memes
DongGeon Lee
oneonlee
AI & ML interests
Data-centric natural language processing, AI Safety
Recent Activity
upvoted
a
collection
3 days ago
COMPASS
authored
a paper
3 days ago
Everyday Physics in Korean Contexts: A Culturally Grounded Physical
Reasoning Benchmark
authored
a paper
3 days ago
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+
Languages and Cultures