Large Language Models (LLMs) are increasingly embedded into software products across diverse industries, enhancing user experiences, but at the same time introducing numerous challenges for developers. Unique characteristics of LLMs force developers, who are accustomed to traditional software development and evaluation, out of their comfort zones as the LLM components shatter standard assumptions about software systems. This study explores the emerging solutions that software developers are adopting to navigate the encountered challenges. Leveraging a mixed-method research, including 26 interviews and a survey with 332 responses, the study identifies 19 emerging solutions regarding quality assurance that practitioners across several product teams at Microsoft are exploring. The findings provide valuable insights that can guide the development and evaluation of LLM-based products more broadly in the face of these challenges.
翻译:大语言模型正日益广泛地嵌入各行业软件产品中,在提升用户体验的同时,也为开发者带来了诸多挑战。LLM组件打破了关于软件系统的传统假设,其独特特性迫使习惯于传统软件开发与评估模式的开发者走出舒适区。本研究通过混合研究方法(包括26次访谈及332份问卷调查),系统探讨了软件开发者为应对这些挑战所采用的新兴解决方案。研究识别出微软多个产品团队正在探索的19项关于质量保证的新兴解决方案。这些发现为更广泛地指导基于LLM产品的开发与评估提供了重要见解,有助于应对当前面临的挑战。