Brain-Diffuser: Natural scene reconstruction from fMRI signals using generative latent diffusion - 专知论文

会员服务 ·

0

fMRI · MoDELS · 潜在 · 解码 · 多峰值 ·

2023 年 3 月 9 日

Brain-Diffuser: Natural scene reconstruction from fMRI signals using generative latent diffusion

翻译：脑扩散器：基于功能磁共振信号利用生成式潜在扩散进行自然场景重建

Furkan Ozcelik,Rufin VanRullen

In neural decoding research, one of the most intriguing topics is the reconstruction of perceived natural images based on fMRI signals. Previous studies have succeeded in re-creating different aspects of the visuals, such as low-level properties (shape, texture, layout) or high-level features (category of objects, descriptive semantics of scenes) but have typically failed to reconstruct these properties together for complex scene images. Generative AI has recently made a leap forward with latent diffusion models capable of generating high-complexity images. Here, we investigate how to take advantage of this innovative technology for brain decoding. We present a two-stage scene reconstruction framework called ``Brain-Diffuser''. In the first stage, starting from fMRI signals, we reconstruct images that capture low-level properties and overall layout using a VDVAE (Very Deep Variational Autoencoder) model. In the second stage, we use the image-to-image framework of a latent diffusion model (Versatile Diffusion) conditioned on predicted multimodal (text and visual) features, to generate final reconstructed images. On the publicly available Natural Scenes Dataset benchmark, our method outperforms previous models both qualitatively and quantitatively. When applied to synthetic fMRI patterns generated from individual ROI (region-of-interest) masks, our trained model creates compelling ``ROI-optimal'' scenes consistent with neuroscientific knowledge. Thus, the proposed methodology can have an impact on both applied (e.g. brain-computer interface) and fundamental neuroscience.

翻译：在神经解码研究中，最引人入胜的课题之一是基于功能磁共振信号重建感知的自然图像。以往研究虽能成功再现视觉的不同维度，例如低层级属性（形状、纹理、布局）或高层级特征（物体类别、场景描述语义），但通常难以针对复杂场景图像同时重建这些属性。近年来，生成式AI取得突破性进展，潜在扩散模型已能生成高复杂度图像。本研究探索如何利用这一创新技术进行脑解码。我们提出名为"Brain-Diffuser"的两阶段场景重建框架：第一阶段从功能磁共振信号出发，采用VDVAE（极深变分自编码器）模型重建捕获低层级属性与整体布局的图像；第二阶段使用基于预测多模态（文本与视觉）特征条件约束的潜在扩散模型（Versatile Diffusion）图像到图像框架，生成最终重建图像。在公开的Natural Scenes Dataset基准测试中，本方法在定性和定量评估上均超越既有模型。当应用于由个体ROI（感兴趣区域）掩膜生成的合成功能磁共振模式时，训练后的模型能够创建与神经科学认知一致的"ROI最优"场景。因此，本方法对应用性（如脑机接口）和基础神经科学均具有重要影响。

0

相关内容

fMRI

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

84+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

一类稳态Schödinger-Poisson-Slater方程标准化解的研究

国家自然科学基金

1+阅读 · 2015年12月31日

LncRNA-AV310809在腹膜透析相关腹膜纤维化中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

乏氧应激上调GOLM1促进肝细胞癌恶性潜能的作用及其机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

LncRNA调控CREB对抑郁症作用机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

蛋白复合体新组分SCMC5的鉴定、功能和分子作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

伤害厌恶对道德判断的影响及其神经机制

国家自然科学基金

0+阅读 · 2013年12月31日

NiTiAl基合金中析出相的结构和强化机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

成体神经干细胞静息和激活的REST和miRNA17-92负反馈调控

国家自然科学基金

0+阅读 · 2012年12月31日

Sam68在舌癌中的分子致癌特征及其调控舌癌淋巴结转移的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

干预periostin表达对瘢痕疙瘩和正常皮肤成纤维细胞功能的影响

国家自然科学基金

0+阅读 · 2009年12月31日

Reconstructing seen images from human brain activity via guided stochastic search

Arxiv

0+阅读 · 2023年4月30日

Blended Latent Diffusion

Arxiv

0+阅读 · 2023年4月30日

Generative Diffusion Models on Graphs: Methods and Applications

Arxiv

1+阅读 · 2023年4月28日

Artificial Intelligence in Material Engineering: A review on applications of AI in Material Engineering

Arxiv

0+阅读 · 2023年4月27日

Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings

Arxiv

0+阅读 · 2023年4月27日

Physics-informed Guided Disentanglement in Generative Networks

Arxiv

0+阅读 · 2023年4月27日

Multimodal Composite Association Score: Measuring Gender Bias in Generative Multimodal Models

Arxiv

0+阅读 · 2023年4月26日

Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model

Arxiv

0+阅读 · 2023年4月24日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

Scene Graph Generation: A Comprehensive Survey

Arxiv

26+阅读 · 2022年1月3日

VIP会员

文章信息

相关主题

最新内容

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

专知会员服务

6+阅读 · 6月25日

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

专知会员服务

5+阅读 · 6月25日

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

专知会员服务

7+阅读 · 6月25日

网状网络及其在军事领域的运用

网状网络及其在军事领域的运用

专知会员服务

7+阅读 · 6月25日

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

专知会员服务

7+阅读 · 6月25日

无美国参与的欧洲战争方式（万字长文）

无美国参与的欧洲战争方式（万字长文）

专知会员服务

8+阅读 · 6月25日

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

专知会员服务

9+阅读 · 6月25日

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

专知会员服务

9+阅读 · 6月25日

《国防领域敏感性分析白皮书》

《国防领域敏感性分析白皮书》

专知会员服务

8+阅读 · 6月25日

综述 | 从问答到任务完成：Agent系统与Harness设计

综述 | 从问答到任务完成：Agent系统与Harness设计

专知会员服务

9+阅读 · 6月24日

Agentic RL：框架、实践与长程智能体训练

Agentic RL：框架、实践与长程智能体训练

专知会员服务

10+阅读 · 6月24日

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

专知会员服务

11+阅读 · 6月24日

重新思考无人机时代的生存能力

重新思考无人机时代的生存能力

专知会员服务

10+阅读 · 6月24日

装甲突击旅：现代战争思考、战斗与组织

装甲突击旅：现代战争思考、战斗与组织

专知会员服务

7+阅读 · 6月24日

在人工智能加速决策环境中拓展OODA循环

在人工智能加速决策环境中拓展OODA循环

专知会员服务

10+阅读 · 6月24日

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

84+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

网状网络及其在军事领域的运用

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Reconstructing seen images from human brain activity via guided stochastic search

Arxiv

0+阅读 · 2023年4月30日

Blended Latent Diffusion

Arxiv

0+阅读 · 2023年4月30日

Generative Diffusion Models on Graphs: Methods and Applications

Arxiv

1+阅读 · 2023年4月28日

Artificial Intelligence in Material Engineering: A review on applications of AI in Material Engineering

Arxiv

0+阅读 · 2023年4月27日

Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings

Arxiv

0+阅读 · 2023年4月27日

Physics-informed Guided Disentanglement in Generative Networks

Arxiv

0+阅读 · 2023年4月27日

Multimodal Composite Association Score: Measuring Gender Bias in Generative Multimodal Models

Arxiv

0+阅读 · 2023年4月26日

Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model

Arxiv

0+阅读 · 2023年4月24日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

Scene Graph Generation: A Comprehensive Survey

Arxiv

26+阅读 · 2022年1月3日

相关基金

一类稳态Schödinger-Poisson-Slater方程标准化解的研究

国家自然科学基金

1+阅读 · 2015年12月31日

LncRNA-AV310809在腹膜透析相关腹膜纤维化中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

乏氧应激上调GOLM1促进肝细胞癌恶性潜能的作用及其机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

LncRNA调控CREB对抑郁症作用机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

蛋白复合体新组分SCMC5的鉴定、功能和分子作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

伤害厌恶对道德判断的影响及其神经机制

国家自然科学基金

0+阅读 · 2013年12月31日

NiTiAl基合金中析出相的结构和强化机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

成体神经干细胞静息和激活的REST和miRNA17-92负反馈调控

国家自然科学基金

0+阅读 · 2012年12月31日

Sam68在舌癌中的分子致癌特征及其调控舌癌淋巴结转移的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

干预periostin表达对瘢痕疙瘩和正常皮肤成纤维细胞功能的影响

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员