二值化Mamba-Transformer：一种用于轻量化四拜耳混合事件视觉传感器去马赛克的网络 (Binarized Mamba-Transformer for Lightweight Quad Bayer HybridEVS Demosaicing)

Quad Bayer demosaicing is the central challenge for enabling the widespread application of Hybrid Event-based Vision Sensors (HybridEVS). Although existing learning-based methods that leverage long-range dependency modeling have achieved promising results, their complexity severely limits deployment on mobile devices for real-world applications. To address these limitations, we propose a lightweight Mamba-based binary neural network designed for efficient and high-performing demosaicing of HybridEVS RAW images. First, to effectively capture both global and local dependencies, we introduce a hybrid Binarized Mamba-Transformer architecture that combines the strengths of the Mamba and Swin Transformer architectures. Next, to significantly reduce computational complexity, we propose a binarized Mamba (Bi-Mamba), which binarizes all projections while retaining the core Selective Scan in full precision. Bi-Mamba also incorporates additional global visual information to enhance global context and mitigate precision loss. We conduct quantitative and qualitative experiments to demonstrate the effectiveness of BMTNet in both performance and computational efficiency, providing a lightweight demosaicing solution suited for real-world edge devices. Our codes and models are available at https://github.com/Clausy9/BMTNet.

翻译：四拜耳去马赛克是实现混合事件视觉传感器广泛应用的核心挑战。尽管现有基于学习的方法通过利用长程依赖建模已取得良好效果，但其复杂性严重限制了在实际移动设备上的部署应用。为应对这些限制，我们提出了一种基于Mamba的轻量化二值神经网络，旨在高效且高性能地完成混合事件视觉传感器RAW图像的去马赛克任务。首先，为有效捕捉全局与局部依赖关系，我们引入了一种混合的二值化Mamba-Transformer架构，该架构融合了Mamba与Swin Transformer的优势。其次，为显著降低计算复杂度，我们提出了二值化Mamba模块，该模块将所有投影操作二值化，同时保留核心的选择性扫描机制为全精度计算。Bi-Mamba还整合了额外的全局视觉信息，以增强全局上下文感知并缓解精度损失。我们通过定量与定性实验验证了BMTNet在性能与计算效率方面的有效性，为实际边缘设备提供了一种轻量化的去马赛克解决方案。我们的代码与模型已开源：https://github.com/Clausy9/BMTNet。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日