Large Language Models (LLMs) demonstrate ever-increasing abilities in mathematical and algorithmic tasks, yet their geometric reasoning skills are underexplored. We investigate LLMs' abilities in constructive geometric problem-solving one of the most fundamental steps in the development of human mathematical reasoning. Our work reveals notable challenges that the state-of-the-art LLMs face in this domain despite many successes in similar areas. LLMs exhibit biases in target variable selection and struggle with 2D spatial relationships, often misrepresenting and hallucinating objects and their placements. To this end, we introduce a framework that formulates an LLMs-based multi-agents system that enhances their existing reasoning potential by conducting an internal dialogue. This work underscores LLMs' current limitations in geometric reasoning and improves geometric reasoning capabilities through self-correction, collaboration, and diverse role specializations.
翻译:大型语言模型(LLMs)在数学和算法任务中展现出日益增强的能力,但其几何推理技能仍未被充分探索。我们研究了LLMs在构造性几何问题求解中的能力,这是人类数学推理发展中最基础的环节之一。我们的工作揭示了最先进的LLMs在这一领域中存在的显著挑战,尽管其在类似领域取得了诸多成功。LLMs在目标变量选择上表现出偏见,并且在处理二维空间关系时存在困难,常常错误表征和幻觉化物体及其位置。为此,我们引入了一个框架,通过构建基于LLMs的多智能体系统,利用内部对话来增强其现有的推理潜力。这项工作强调了LLMs目前在几何推理中的局限性,并通过自我纠正、协作和多样化角色分工提升了其几何推理能力。