Recent advances in deep learning methods for natural language processing (NLP) have created new business opportunities and made NLP research critical for industry development. As one of the big players in the field of NLP, together with governments and universities, it is important to track the influence of industry on research. In this study, we seek to quantify and characterize industry presence in the NLP community over time. Using a corpus with comprehensive metadata of 78,187 NLP publications and 701 resumes of NLP publication authors, we explore the industry presence in the field since the early 90s. We find that industry presence among NLP authors has been steady before a steep increase over the past five years (180% growth from 2017 to 2022). A few companies account for most of the publications and provide funding to academic researchers through grants and internships. Our study shows that the presence and impact of the industry on natural language processing research are significant and fast-growing. This work calls for increased transparency of industry influence in the field.
翻译:深度学习在自然语言处理领域的最新进展创造了新的商业机会,并使NLP研究对产业发展至关重要。作为NLP领域的主要参与者之一,与政府和大学共同推动该领域发展,追踪产业界对研究的影响具有重要意义。本研究旨在量化和描述产业界在NLP学术界随时间推移的存在状况。通过使用包含78,187篇NLP出版物完整元数据的语料库以及701位NLP出版物作者的履历,我们探索了自90年代初以来产业界在该领域的存在情况。研究发现,NLP作者中的产业界参与度在过去五年急剧增长之前保持稳定(2017年至2022年间增长180%)。少数公司贡献了大部分出版物,并通过资助和实习项目为学术研究者提供支持。研究表明,产业界对自然语言处理研究的存在和影响显著且快速增长。这项工作呼吁提高该领域产业界影响的透明度。