AxWin Transformer: A Context-Aware Vision Transformer Backbone with Axial Windows - 专知论文

会员服务 ·

0

Attention · 变换 · Backbone · Vision · Microsoft Windows ·

2023 年 5 月 2 日

AxWin Transformer: A Context-Aware Vision Transformer Backbone with Axial Windows

翻译：暂无翻译

Fangjian Lin,Yizhe Ma,Sitong Wu,Long Yu,Shengwei Tian

Recently Transformer has shown good performance in several vision tasks due to its powerful modeling capabilities. To reduce the quadratic complexity caused by the attention, some outstanding work restricts attention to local regions or extends axial interactions. However, these methos often lack the interaction of local and global information, balancing coarse and fine-grained information. To address this problem, we propose AxWin Attention, which models context information in both local windows and axial views. Based on the AxWin Attention, we develop a context-aware vision transformer backbone, named AxWin Transformer, which outperforming the state-of-the-art methods in both classification and downstream segmentation and detection tasks.

翻译：暂无翻译

0

相关内容

Attention

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

专知会员服务

85+阅读 · 2023年6月19日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

专知

31+阅读 · 2018年6月4日

【论文推荐】最新六篇知识图谱相关论文—Zero-shot识别、卷积二维知识图谱、变分知识图谱推理、张量分解、推荐

【论文推荐】最新六篇知识图谱相关论文—Zero-shot识别、卷积二维知识图谱、变分知识图谱推理、张量分解、推荐

专知

50+阅读 · 2018年4月25日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

Fe基块体非晶合金中异质非晶结构及纳米晶形成演变机理

国家自然科学基金

0+阅读 · 2015年12月31日

适用于无线传感器网络SOC的低功耗低成本SAR型A/D转换器设计研究

国家自然科学基金

0+阅读 · 2013年12月31日

硅基和TiC基纳米体系结构和性能预测

国家自然科学基金

0+阅读 · 2013年12月31日

熔盐堆环境下结构材料辐照损伤机制及其高温熔盐腐蚀特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ti3AlC2增强锌基复合材料的界面结构与摩擦学特性研究

国家自然科学基金

0+阅读 · 2011年12月31日

Block-State Transformer

Arxiv

0+阅读 · 2023年6月15日

SepViT: Separable Vision Transformer

Arxiv

0+阅读 · 2023年6月15日

EDTER: Edge Detection with Transformer

Arxiv

11+阅读 · 2022年3月16日

A Survey on Vision Transformer

Arxiv

17+阅读 · 2022年2月23日

SiT: Self-supervised vIsion Transformer

Arxiv

19+阅读 · 2021年4月8日

VIP会员

文章信息

相关主题

Microsoft Windows

相关VIP内容

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

专知会员服务

85+阅读 · 2023年6月19日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向现代战场的特种作战无人机网络

《面向无GPS及复杂环境的鲁棒自主探索导航框架》350页

《网络化部队中的任务式指挥：近期美海军与空军条令及作战概念对任务式指挥的采纳》最新报告

《俄乌战场：俄罗斯“沙希德”-136无人机部署月度分析（2025.12）》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

专知

31+阅读 · 2018年6月4日

【论文推荐】最新六篇知识图谱相关论文—Zero-shot识别、卷积二维知识图谱、变分知识图谱推理、张量分解、推荐

【论文推荐】最新六篇知识图谱相关论文—Zero-shot识别、卷积二维知识图谱、变分知识图谱推理、张量分解、推荐

专知

50+阅读 · 2018年4月25日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

相关论文

Block-State Transformer

Arxiv

0+阅读 · 2023年6月15日

SepViT: Separable Vision Transformer

Arxiv

0+阅读 · 2023年6月15日

EDTER: Edge Detection with Transformer

Arxiv

11+阅读 · 2022年3月16日

A Survey on Vision Transformer

Arxiv

17+阅读 · 2022年2月23日

SiT: Self-supervised vIsion Transformer

Arxiv

19+阅读 · 2021年4月8日

相关基金

Fe基块体非晶合金中异质非晶结构及纳米晶形成演变机理

国家自然科学基金

0+阅读 · 2015年12月31日

适用于无线传感器网络SOC的低功耗低成本SAR型A/D转换器设计研究

国家自然科学基金

0+阅读 · 2013年12月31日

硅基和TiC基纳米体系结构和性能预测

国家自然科学基金

0+阅读 · 2013年12月31日

熔盐堆环境下结构材料辐照损伤机制及其高温熔盐腐蚀特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ti3AlC2增强锌基复合材料的界面结构与摩擦学特性研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员