In recent years, open-source software (OSS) has become increasingly prevalent in developing software products. While OSS documentation is the primary source of information provided by the developers' community about a product, its role in the industry's adoption process has yet to be examined. We conducted semi-structured interviews and an online survey to provide insight into this area. Based on interviews and survey insights, we developed a topic model to collect relevant information from OSS documentation automatically. Additionally, according to our survey responses regarding challenges associated with OSS documentation, we propose a novel information augmentation approach, DocMentor, by combining OSS documentation corpus TF-IDF scores and ChatGPT. Through explaining technical terms and providing examples and references, our approach enhances the documentation context and improves practitioners' understanding. Our tool's effectiveness is assessed by surveying practitioners.
翻译:近年来,开源软件(OSS)在软件开发产品中日益普及。尽管OSS文档是开发者社区提供的产品主要信息来源,但其在工业界采纳过程中的作用仍有待探究。我们通过半结构化访谈和在线调查深入研究了这一领域。基于访谈和调查的见解,我们开发了一个主题模型来自动收集OSS文档中的相关信息。此外,针对调查中发现的OSS文档相关挑战,我们提出了一种新型信息增强方法——DocMentor,该方法结合了OSS文档语料库的TF-IDF分数与ChatGPT。通过解释技术术语、提供示例和参考资料,我们的方法增强了文档上下文并提升了从业者的理解能力。我们通过调查从业者评估了该工具的有效性。