LiteraryQA: Towards Effective Evaluation of Long-document Narrative QA Paper • 2510.13494 • Published Oct 15, 2025 • 2
LiteraryQA: Towards Effective Evaluation of Long-document Narrative QA Paper • 2510.13494 • Published Oct 15, 2025 • 2
Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering Paper • 2503.14996 • Published Mar 19, 2025 • 3