ESGReveal is an innovative method proposed for efficiently extracting and analyzing Environmental, Social, and Governance (ESG) data from corporate reports, catering to the critical need for reliable ESG information retrieval. This approach utilizes Large Language Models (LLM) enhanced with Retrieval Augmented Generation (RAG) techniques. The ESGReveal system includes an ESG metadata module for targeted queries, a preprocessing module for assembling databases, and an LLM agent for data extraction. Its efficacy was appraised using ESG reports from 166 companies across various sectors listed on the Hong Kong Stock Exchange in 2022, ensuring comprehensive industry and market capitalization representation. Utilizing ESGReveal unearthed significant insights into ESG reporting with GPT-4, demonstrating an accuracy of 76.9% in data extraction and 83.7% in disclosure analysis, which is an improvement over baseline models. This highlights the framework's capacity to refine ESG data analysis precision. Moreover, it revealed a demand for reinforced ESG disclosures, with environmental and social data disclosures standing at 69.5% and 57.2%, respectively, suggesting a pursuit for more corporate transparency. While current iterations of ESGReveal do not process pictorial information, a functionality intended for future enhancement, the study calls for continued research to further develop and compare the analytical capabilities of various LLMs. In summary, ESGReveal is a stride forward in ESG data processing, offering stakeholders a sophisticated tool to better evaluate and advance corporate sustainability efforts. Its evolution is promising in promoting transparency in corporate reporting and aligning with broader sustainable development aims.
翻译:ESGReveal是一种创新方法,旨在从企业报告中高效提取和分析环境、社会及治理(ESG)数据,以满足对可靠ESG信息检索的迫切需求。该方法利用结合检索增强生成(RAG)技术的大语言模型(LLM)实现。ESGReveal系统包含用于定向查询的ESG元数据模块、用于构建数据库的预处理模块以及用于数据提取的LLM智能体。其效能通过2022年香港证券交易所上市、涵盖各行业和市值的166家公司ESG报告进行了评估。利用ESGReveal结合GPT-4,揭示了ESG报告中的关键洞见,数据提取准确率达76.9%,披露分析准确率达83.7%,优于基线模型,凸显了该框架提升ESG数据分析精度的能力。此外,研究显示对强化ESG披露的需求,其中环境数据和社会数据披露率分别为69.5%和57.2%,表明企业透明度有待提升。尽管当前版ESGReveal无法处理图像信息(该功能拟于后续版本增强),但本研究呼吁持续探索,以进一步开发并比较不同LLM的分析能力。总之,ESGReveal是ESG数据处理领域的一大进步,为利益相关者提供了更优评估和推进企业可持续发展的精密工具,其演进有望提升企业报告透明度,并与更广泛的可持续发展目标相契合。