Large language models (LLMs) have demonstrated significant potential in the realm of natural language understanding and programming code processing tasks. Their capacity to comprehend and generate human-like code has spurred research into harnessing LLMs for code analysis purposes. However, the existing body of literature falls short in delivering a systematic evaluation and assessment of LLMs' effectiveness in code analysis, particularly in the context of obfuscated code. This paper seeks to bridge this gap by offering a comprehensive evaluation of LLMs' capabilities in performing code analysis tasks. Additionally, it presents real-world case studies that employ LLMs for code analysis. Our findings indicate that LLMs can indeed serve as valuable tools for automating code analysis, albeit with certain limitations. Through meticulous exploration, this research contributes to a deeper understanding of the potential and constraints associated with utilizing LLMs in code analysis, paving the way for enhanced applications in this critical domain.
翻译:大型语言模型(LLM)在自然语言理解与编程代码处理任务中展现出显著潜力。其理解与生成类人代码的能力,催生了利用LLM进行代码分析的相关研究。然而,现有文献缺乏对LLM在代码分析中有效性的系统评估,尤其在混淆代码场景下。本文通过全面评估LLM执行代码分析任务的能力,并辅以运用LLM进行代码分析的真实案例研究,旨在弥合这一空白。研究结果表明,尽管存在一定局限,LLM确实可作为自动化代码分析的有效工具。通过严谨探索,本研究深化了对LLM在代码分析中潜力与约束的理解,为该关键领域的优化应用铺平了道路。