Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Pages

Posts

ConflictScore: Measuring How Language Models Handle Conflicting Evidence

4 minute read

Published:

TL;DR: Existing “factuality/faithfulness” metrics usually ask: is the answer supported by the evidence?
ConflictScore asks a sharper question: what if the evidence set itself disagrees—and the model acts overconfident anyway?
We introduce a claim-level metric (CS-C, CS-R), a benchmark (ConflictBench), and show conflict-aware regeneration improves truthfulness on TruthfulQA.

portfolio

News Framing

Detecting frames in news headlines and analyzing framing trends surrounding US gun violence.

publications

Design Challenges for a Multi-Perspective Search Engine

Published in NAACL 2022 Findings, 2022

Designing a search engine that presents multiple perspectives on controversial topics.

Recommended citation: Sihao Chen*, Siyi Liu*, Xander Uyttendaele, Yi Zhang, William Bruno, Dan Roth. "Design Challenges for a Multi-Perspective Search Engine." NAACL 2022 Findings.
Download Paper

talks

teaching