Rel-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance

Zhou, Kaitlyn; Hwang, Jena D.; Ren, Xiang; Dziri, Nouha; Jurafsky, Dan; Sap, Maarten

Computer Science > Computation and Language

arXiv:2407.07950 (cs)

[Submitted on 10 Jul 2024 (v1), last revised 3 Oct 2024 (this version, v2)]

Title:Rel-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance

Authors:Kaitlyn Zhou, Jena D. Hwang, Xiang Ren, Nouha Dziri, Dan Jurafsky, Maarten Sap

View PDF HTML (experimental)

Abstract:The ability to communicate uncertainty, risk, and limitation is crucial for the safety of large language models. However, current evaluations of these abilities rely on simple calibration, asking whether the language generated by the model matches appropriate probabilities. Instead, evaluation of this aspect of LLM communication should focus on the behaviors of their human interlocutors: how much do they rely on what the LLM says? Here we introduce an interaction-centered evaluation framework called Rel-A.I. (pronounced "rely"}) that measures whether humans rely on LLM generations. We use this framework to study how reliance is affected by contextual features of the interaction (e.g, the knowledge domain that is being discussed), or the use of greetings communicating warmth or competence (e.g., "I'm happy to help!"). We find that contextual characteristics significantly affect human reliance behavior. For example, people rely 10% more on LMs when responding to questions involving calculations and rely 30% more on LMs that are perceived as more competent. Our results show that calibration and language quality alone are insufficient in evaluating the risks of human-LM interactions, and illustrate the need to consider features of the interactional context.

Comments:	Preprint
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2407.07950 [cs.CL]
	(or arXiv:2407.07950v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.07950

Submission history

From: Kaitlyn Zhou [view email]
[v1] Wed, 10 Jul 2024 18:00:05 UTC (9,052 KB)
[v2] Thu, 3 Oct 2024 16:54:59 UTC (9,638 KB)

Computer Science > Computation and Language

Title:Rel-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Rel-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators