"Task-relevant autoencoding" enhances machine learning for human neuroscience

Seyedmehdi Orouji,Vincent Taschereau-Dumouchel,Aurelio Cortese,Brian Odegaard,Cody Cushing,Mouslim Cherkaoui,Mitsuo Kawato,Hakwan Lau,Megan A. K. Peters

from arxiv, 41 pages, 11 figures, 5 tables including supplemental material

In human neuroscience, machine learning can help reveal lower-dimensional neural representations relevant to subjects' behavior. However, state-of-the-art models typically require large datasets to train, so are prone to overfitting on human neuroimaging data that often possess few samples but many input dimensions. Here, we capitalized on the fact that the features we seek in human neuroscience are precisely those relevant to subjects' behavior. We thus developed a Task-Relevant Autoencoder via Classifier Enhancement (TRACE), and tested its ability to extract behaviorally-relevant, separable representations compared to a standard autoencoder, a variational autoencoder, and principal component analysis for two severely truncated machine learning datasets. We then evaluated all models on fMRI data from 59 subjects who observed animals and objects. TRACE outperformed all models nearly unilaterally, showing up to 12% increased classification accuracy and up to 56% improvement in discovering "cleaner", task-relevant representations. These results showcase TRACE's potential for a wide variety of data related to human behavior.

翻译：在人类神经科学中，机器学习有助于揭示与受试者行为相关的低维神经表征。然而，当前最先进的模型通常需要大规模数据集进行训练，因此在面对样本数量少但输入维度高的人类神经影像数据时容易过拟合。在此，我们利用了人类神经科学中寻求的特征恰好与受试者行为相关这一事实，进而开发了一种通过分类器增强的任务相关自编码器（TRACE），并与标准自编码器、变分自编码器及主成分分析进行对比，测试其在两个严重截断的机器学习数据集上提取行为相关、可分表征的能力。随后，我们使用59名观察动物和物件的受试者的功能磁共振成像数据对所有模型进行了评估。TRACE几乎在所有方面均优于其他模型，分类准确率提升高达12%，在发现"更清晰"的任务相关表征方面改进幅度达56%。这些结果展示了TRACE在涉及人类行为的各类数据中的巨大潜力。

相关内容

Machine Learning

关注 2251

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

Nat. Biotechnol. | 机器学习为生物库驱动的药物发现提供动力

专知会员服务

11+阅读 · 2022年9月12日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

【AI应用】Facebook-利用神经网络求解高等数学方程, Using neural networks to solve advanced mathematics equations

专知会员服务

34+阅读 · 2020年1月15日