Bridging HCI and AI Research for the Evaluation of Conversational SE Assistants

Authors: ...
 11th Feb 2025  arXiv Download
Posted by Alumni
April 17, 2025

Software Engineering

As Large Language Models (LLMs) are increasingly adopted in software engineering, recently in the form of conversational assistants, ensuring these technologies align with developers' needs is essential. The limitations of traditional human-centered methods for evaluating LLM-based tools at scale raise the need for automatic evaluation. In this paper, we advocate combining insights from human-computer interaction (HCI) and artificial intelligence (AI) research to enable human-centered automatic evaluation of LLM-based conversational SE assistants. We identify requirements for such evaluation and challenges down the road, working towards a framework that ensures these assistants are designed and deployed in line with user needs. learn more on arXiv
AUTHORS
Software Engineering
Software Engineering
ATTACHMENTS