Auditory spatial attention detection (ASAD) aims to decode the attended spatial location with EEG in a multiple-speaker setting. ASAD methods are inspired by the brain lateralization of cortical neural responses during the processing of auditory spatial attention, and show promising performance for the task of auditory attention decoding (AAD) with neural recordings. In the previous ASAD methods, the spatial distribution of EEG electrodes is not fully exploited, which may limit the performance of these methods. In the present work, by transforming the original EEG channels into a two-dimensional (2D) spatial topological map, the EEG data is transformed into a three-dimensional (3D) arrangement containing spatial-temporal information. And then a 3D deep convolutional neural network (DenseNet-3D) is used to extract temporal and spatial features of the neural representation for the attended locations. The results show that the proposed method achieves higher decoding accuracy than the state-of-the-art (SOTA) method (94.3% compared to XANet's 90.6%) with 1-second decision window for the widely used KULeuven (KUL) dataset, and the code to implement our work is available on Github: https://github.com/xuxiran/ASAD_DenseNet
翻译:听觉空间注意力检测(ASAD)旨在多说话人场景下通过脑电图(EEG)解码注意的空间位置。ASAD方法受听觉空间注意力处理过程中大脑皮层神经响应偏侧化的启发,在基于神经记录的听觉注意力解码(AAD)任务中展现出良好性能。以往ASAD方法未能充分利用EEG电极的空间分布特性,这可能限制了其性能表现。本研究通过将原始EEG通道转换为二维空间拓扑图,将EEG数据重构为包含时空信息的三维结构,进而采用三维深度卷积神经网络(DenseNet-3D)提取注意位置神经表征的时空特征。实验结果表明,在广泛使用的KULeuven(KUL)数据集上,采用1秒决策窗口时,所提方法的解码准确率(94.3%)优于当前最优方法(XANet的90.6%)。实现本研究的代码已开源至Github:https://github.com/xuxiran/ASAD_DenseNet