In recent advancements within the domain of Large Language Models (LLMs), there has been a notable emergence of agents capable of addressing Robotic Process Automation (RPA) challenges through enhanced cognitive capabilities and sophisticated reasoning. This development heralds a new era of scalability and human-like adaptability in goal attainment. In this context, we introduce AUTONODE (Autonomous User-interface Transformation through Online Neuro-graphic Operations and Deep Exploration). AUTONODE employs advanced neuro-graphical techniques to facilitate autonomous navigation and task execution on web interfaces, thereby obviating the necessity for predefined scripts or manual intervention. Our engine empowers agents to comprehend and implement complex workflows, adapting to dynamic web environments with unparalleled efficiency. Our methodology synergizes cognitive functionalities with robotic automation, endowing AUTONODE with the ability to learn from experience. We have integrated an exploratory module, DoRA (Discovery and mapping Operation for graph Retrieval Agent), which is instrumental in constructing a knowledge graph that the engine utilizes to optimize its actions and achieve objectives with minimal supervision. The versatility and efficacy of AUTONODE are demonstrated through a series of experiments, highlighting its proficiency in managing a diverse array of web-based tasks, ranging from data extraction to transaction processing.
翻译:在大型语言模型(LLMs)领域的最新进展中,显著涌现出能够通过增强的认知能力和复杂推理来应对机器人流程自动化(RPA)挑战的智能体。这一发展标志着在目标实现方面实现了可扩展性与类人适应性的新时代。在此背景下,我们提出AUTONODE(自主用户界面转换通过在线神经图形操作与深度探索)。AUTONODE采用先进的神经图形技术,促进在网页界面上的自主导航与任务执行,从而消除了对预定义脚本或人工干预的需求。我们的引擎使智能体能够理解并实施复杂工作流,以前所未有的效率适应动态网络环境。我们的方法将认知功能与机器人自动化协同结合,赋予AUTONODE从经验中学习的能力。我们整合了一个探索性模块DoRA(面向图检索智能体的发现与映射操作),该模块在构建知识图谱中发挥关键作用,引擎利用该知识图谱优化其行动并在最小监督下达成目标。通过一系列实验,AUTONODE的多样性与有效性得到验证,突出了其在管理从数据提取到事务处理等各类网页任务中的熟练能力。