SignalLLM：一种面向自动化信号处理的通用LLM智能体框架 (SignalLLM: A General-Purpose LLM Agent Framework for Automated Signal Processing)

Modern signal processing (SP) pipelines, whether model-based or data-driven, often constrained by complex and fragmented workflow, rely heavily on expert knowledge and manual engineering, and struggle with adaptability and generalization under limited data. In contrast, Large Language Models (LLMs) offer strong reasoning capabilities, broad general-purpose knowledge, in-context learning, and cross-modal transfer abilities, positioning them as powerful tools for automating and generalizing SP workflows. Motivated by these potentials, we introduce SignalLLM, the first general-purpose LLM-based agent framework for general SP tasks. Unlike prior LLM-based SP approaches that are limited to narrow applications or tricky prompting, SignalLLM introduces a principled, modular architecture. It decomposes high-level SP goals into structured subtasks via in-context learning and domain-specific retrieval, followed by hierarchical planning through adaptive retrieval-augmented generation (RAG) and refinement; these subtasks are then executed through prompt-based reasoning, cross-modal reasoning, code synthesis, model invocation, or data-driven LLM-assisted modeling. Its generalizable design enables the flexible selection of problem solving strategies across different signal modalities, task types, and data conditions. We demonstrate the versatility and effectiveness of SignalLLM through five representative tasks in communication and sensing, such as radar target detection, human activity recognition, and text compression. Experimental results show superior performance over traditional and existing LLM-based methods, particularly in few-shot and zero-shot settings.

翻译：现代信号处理（SP）流程，无论是基于模型还是数据驱动，常受限于复杂且碎片化的工作流，严重依赖专家知识与人工工程，并在数据有限的情况下难以实现适应性与泛化性。相比之下，大型语言模型（LLMs）具备强大的推理能力、广泛的通用知识、上下文学习以及跨模态迁移能力，使其成为自动化与泛化SP工作流的有力工具。基于这些潜力，我们提出了SignalLLM，首个面向通用SP任务的基于LLM的通用智能体框架。与先前局限于狭窄应用或复杂提示的基于LLM的SP方法不同，SignalLLM引入了一种原则化、模块化的架构。它通过上下文学习和领域特定检索将高层SP目标分解为结构化子任务，随后通过自适应检索增强生成（RAG）与精炼进行分层规划；这些子任务随后通过基于提示的推理、跨模态推理、代码合成、模型调用或数据驱动的LLM辅助建模来执行。其可泛化的设计使得能够灵活选择跨不同信号模态、任务类型和数据条件的问题解决策略。我们通过在通信与感知领域的五个代表性任务（如雷达目标检测、人体活动识别和文本压缩）中展示了SignalLLM的多功能性与有效性。实验结果表明，其性能优于传统方法及现有基于LLM的方法，尤其在少样本和零样本设置下表现突出。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日