Handwriting decoding as a challenging motor task for EEG Foundation Models

Recent attempts at creating Foundation Models (FMs) for Electroencephalography (EEG) have achieved state-of-the-art performance on multiple tasks including Motor Imagery (MI). These MI tasks have typically involved coarse classification between imagined limb movements. However, the development of foundation models necessitates diverse datasets, both for pretraining and evaluating the progress of these models. In this work, we propose handwriting decoding as a challenging motor task for FMs. We show that several existing datasets are potentially confounded, and introduce a dataset that more rigorously evaluates models. On this dataset, we find that current FMs, despite showing SOTA performance in multiple MI datasets are outperformed by smaller task-specific models. We also highlight challenges specific to EEG-based handwriting decoding to inform future work. In our 4-letter classification task, we show that (a) Knowledge of movement-onset is crucial to reported decoding performance in prior works, with average performance across subjects dropping from $41.3\%$ to $32.4\%$. (b) Increasing test-time signal quality provides significant performance improvements ($45\%$ to $78\%$ in our best subject) compared to scaling training data with single-trial EEG. (c) While scaling training data steadily improves decoding performance, existing FMs do not outperform specialist models in handwriting decoding. We make our code available at https://anonymous.4open.science/r/EEG-Handwriting-BCI-DFCD/

翻译：近期构建脑电图（EEG）基础模型（FMs）的尝试已在包括运动想象（MI）在内的多项任务上取得了最先进性能。这些MI任务通常涉及想象肢体运动的粗粒度分类。然而，基础模型的发展需要多样化的数据集，既用于预训练也用于评估模型进展。本研究提出将手写解码作为基础模型的一项挑战性运动任务。我们发现现有多个数据集可能存在混淆因素，并引入了一个更严格评估模型的数据集。在该数据集上，我们发现当前的基础模型尽管在多个MI数据集中表现优异，但其性能却被更小的任务专用模型所超越。我们还强调了基于EEG的手写解码所特有的挑战，以指导未来工作。在四字母分类任务中，我们证明：（a）运动起始时间的知识是先前工作中报告解码性能的关键，受试者平均性能从$41.3\%$降至$32.4\%$；（b）与通过单次试验EEG扩展训练数据相比，提高测试时的信号质量可带来显著的性能提升（最佳受试者从$45\%$增至$78\%$）；（c）尽管扩展训练数据能稳步提升解码性能，但现有基础模型在手写解码中仍未超越专用模型。我们已将代码开源：https://anonymous.4open.science/r/EEG-Handwriting-BCI-DFCD/

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【ICML2025】FOUNDER：将基础模型嵌入世界模型以实现开放式具身决策

专知会员服务

13+阅读 · 2025年7月19日

基于 Transformer 的脑电解码综述询问 ChatGPT

专知会员服务

12+阅读 · 2025年7月6日

基础模型驱动的智能体服务部署：综述

专知会员服务

53+阅读 · 2024年12月19日

《改变地面电子战训练：通过综合频谱捕获和电子战模拟为小单元训练带来真实感》最新99页

专知会员服务

46+阅读 · 2024年10月23日