ConflictScore: Measuring How Language Models Handle Conflicting Evidence

Published in In submission, 2026

Recommended citation: Siyi Liu, Patrick Xia, et al. "ConflictScore: Measuring How Language Models Handle Conflicting Evidence." In submission.
Download Paper