面向机器的无训练自适应量化可变码率图像编码 (Training-Free Adaptive Quantization for Variable Rate Image Coding for Machines) - 专知论文

会员服务 ·

0

自适应 · 图像编码 · 码率控制 · CVPR 2022 · 计算成本 ·

Training-Free Adaptive Quantization for Variable Rate Image Coding for Machines

翻译：面向机器的无训练自适应量化可变码率图像编码

Yui Tatsumi,Ziyue Zeng,Hiroshi Watanabe

from arxiv, Accepted to IEEE 44th International Conference on Consumer Electronics (ICCE 2026)

Image Coding for Machines (ICM) has become increasingly important with the rapid integration of computer vision technology into real-world applications. However, most neural network-based ICM frameworks operate at a fixed rate, thus requiring individual training for each target bitrate. This limitation may restrict their practical usage. Existing variable rate image compression approaches mitigate this issue but often rely on additional training, which increases computational costs and complicates deployment. Moreover, variable rate control has not been thoroughly explored for ICM. To address these challenges, we propose a training-free framework for quantization strength control which enables flexible bitrate adjustment. By exploiting the scale parameter predicted by the hyperprior network, the proposed method adaptively modulates quantization step sizes across both channel and spatial dimensions. This allows the model to preserve semantically important regions while coarsely quantizing less critical areas. Our architectural design further enables continuous bitrate control through a single parameter. Experimental results demonstrate the effectiveness of our proposed method, achieving up to 11.07% BD-rate savings over the non-adaptive variable rate baseline. The code is available at https://github.com/qwert-top/AQVR-ICM.

翻译：随着计算机视觉技术在实际应用中的快速普及，面向机器的图像编码（ICM）日益重要。然而，大多数基于神经网络的ICM框架只能在固定码率下运行，因此需要针对每个目标码率进行单独训练。这一限制可能阻碍其实际应用。现有的可变码率图像压缩方法虽能缓解此问题，但通常依赖额外的训练，从而增加计算成本并使得部署复杂化。此外，可变码率控制在ICM领域尚未得到充分探索。为解决这些挑战，我们提出了一种无需训练的量化强度控制框架，可实现灵活的码率调整。该方法通过利用超先验网络预测的尺度参数，在通道和空间维度上自适应地调制量化步长。这使得模型能够在粗略量化次要区域的同时，保留语义上重要的区域。我们的架构设计进一步实现了通过单一参数进行连续码率控制。实验结果表明，所提方法具有显著效果，相比非自适应的可变码率基线，最高可节省11.07%的BD-rate。代码已发布于 https://github.com/qwert-top/AQVR-ICM。

0

相关内容

自适应

【TPAMI2022】TransCL：基于Transformer的压缩学习，更灵活更强大

【TPAMI2022】TransCL：基于Transformer的压缩学习，更灵活更强大

专知会员服务

24+阅读 · 2022年8月2日

【干货书】《Transformers 机器学习:深度探究》，Transformers for Machine Learning A Deep Dive

【干货书】《Transformers 机器学习:深度探究》，Transformers for Machine Learning A Deep Dive

专知会员服务

473+阅读 · 2022年4月21日

自编码器及其应用综述

专知会员服务

37+阅读 · 2021年10月16日

何恺明团队新论文！自监督学习+Transformer=MoCoV3，解决训练不稳定性

专知会员服务

37+阅读 · 2021年4月7日

机器学习的可解释性

机器学习的可解释性

专知会员服务

69+阅读 · 2020年12月18日

【KDD2020-清华大学】自适应图编码器，Adaptive Graph Encoder for Attributed Graph Embedding

【KDD2020-清华大学】自适应图编码器，Adaptive Graph Encoder for Attributed Graph Embedding

专知会员服务

99+阅读 · 2020年7月6日

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

专知会员服务

26+阅读 · 2020年4月2日

【Google大脑】AutoML-Zero: 从无到有演化机器学习算法，Evolving Machine Learning

专知会员服务

26+阅读 · 2020年3月11日

【AISTATS2020接受论文】变分自编码器和非线性独立分量分析:一个统一的框架（Variational Autoencoders and Nonlinear ICA: A Unifying Framework）

【AISTATS2020接受论文】变分自编码器和非线性独立分量分析:一个统一的框架（Variational Autoencoders and Nonlinear ICA: A Unifying Framework）

专知会员服务

28+阅读 · 2020年1月11日

【论文】自训练噪声student模型提高ImageNet分类准确率（Self-training with Noisy Student improves ImageNet classification），谷歌研究科学家Quoc V. Le等

【论文】自训练噪声student模型提高ImageNet分类准确率（Self-training with Noisy Student improves ImageNet classification），谷歌研究科学家Quoc V. Le等

专知会员服务

24+阅读 · 2019年11月20日

【干货书】《Transformers 机器学习:深度探究》，284页pdf

【干货书】《Transformers 机器学习:深度探究》，284页pdf

专知

72+阅读 · 2022年4月21日

ICCV 2019教程《面向计算机视觉的可解释机器学习》，附280页PPT下载

ICCV 2019教程《面向计算机视觉的可解释机器学习》，附280页PPT下载

专知

33+阅读 · 2019年11月1日

CMU大学76页深度学习课程：变分自编码器（VAE, Variational Autoencoder）

CMU大学76页深度学习课程：变分自编码器（VAE, Variational Autoencoder）

专知

28+阅读 · 2018年8月15日

【干货】用极少量样本有效的训练分类器-对抗自编码器PyTorch手把手实战系列

【干货】用极少量样本有效的训练分类器-对抗自编码器PyTorch手把手实战系列

专知

17+阅读 · 2018年5月10日

【学界】极端图像压缩的生成对抗网络，可生成低码率的高质量图像

【学界】极端图像压缩的生成对抗网络，可生成低码率的高质量图像

GAN生成式对抗网络

10+阅读 · 2018年4月25日

【干货】深入理解变分自编码器

【干货】深入理解变分自编码器

专知

21+阅读 · 2018年3月22日

【干货】对抗自编码器PyTorch手把手实战系列——PyTorch实现对抗自编码器

【干货】对抗自编码器PyTorch手把手实战系列——PyTorch实现对抗自编码器

专知

51+阅读 · 2018年3月20日

【干货】深入理解自编码器（附代码实现）

【干货】深入理解自编码器（附代码实现）

专知

136+阅读 · 2018年3月9日

【干货】对抗自编码器PyTorch手把手实战系列——PyTorch实现自编码器

【干货】对抗自编码器PyTorch手把手实战系列——PyTorch实现自编码器

专知

45+阅读 · 2018年3月8日

【干货】一文读懂什么是变分自编码器

【干货】一文读懂什么是变分自编码器

专知

12+阅读 · 2018年2月11日

近似计算中基于概率图模型的软错误量化方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于压缩感知理论的图像采样、编码和重建研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于内容分析的低复杂度高效视频编码方法

国家自然科学基金

0+阅读 · 2015年12月31日

保持结构的交互式图像及视频编辑方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于比特置信度的低复杂度多进制LDPC码译码算法

国家自然科学基金

0+阅读 · 2015年12月31日

面向无线异构网络中多媒体信息组播的多速率网络编码理论和应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

多纹理多深度的3D视频码率控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向类脑计算存储器的调制编码理论及方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

SHVC质量可伸缩视频编码的快速算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

可重构的环境自适应RS码软判决译码器研究

国家自然科学基金

0+阅读 · 2014年12月31日

Adaptive 1D Video Diffusion Autoencoder

Arxiv

0+阅读 · 2月4日

Switchcodec: Adaptive residual-expert sparse quantization for high-fidelity neural audio coding

Arxiv

0+阅读 · 1月28日

Training-Free In-Context Forensic Chain for Image Manipulation Detection and Localization

Arxiv

0+阅读 · 1月21日

Image Complexity-Aware Adaptive Retrieval for Efficient Vision-Language Models

Arxiv

0+阅读 · 1月15日

DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning

Arxiv

0+阅读 · 1月13日

StackPilot: Autonomous Function Agents for Scalable and Environment-Free Code Execution

Arxiv

0+阅读 · 1月13日

Automated Visualization Code Synthesis via Multi-Path Reasoning and Feedback-Driven Optimization

Arxiv

0+阅读 · 1月10日

Adaptive Conditional Contrast-Agnostic Deformable Image Registration with Uncertainty Estimation

Arxiv

0+阅读 · 1月9日

Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings

Arxiv

0+阅读 · 1月7日

Dynamic Quantization Error Propagation in Encoder-Decoder ASR Quantization

Arxiv

0+阅读 · 1月5日

VIP会员

文章信息

相关主题

相关VIP内容

【TPAMI2022】TransCL：基于Transformer的压缩学习，更灵活更强大

【TPAMI2022】TransCL：基于Transformer的压缩学习，更灵活更强大

专知会员服务

24+阅读 · 2022年8月2日

【干货书】《Transformers 机器学习:深度探究》，Transformers for Machine Learning A Deep Dive

【干货书】《Transformers 机器学习:深度探究》，Transformers for Machine Learning A Deep Dive

专知会员服务

473+阅读 · 2022年4月21日

自编码器及其应用综述

专知会员服务

37+阅读 · 2021年10月16日

何恺明团队新论文！自监督学习+Transformer=MoCoV3，解决训练不稳定性

专知会员服务

37+阅读 · 2021年4月7日

机器学习的可解释性

机器学习的可解释性

专知会员服务

69+阅读 · 2020年12月18日

【KDD2020-清华大学】自适应图编码器，Adaptive Graph Encoder for Attributed Graph Embedding

【KDD2020-清华大学】自适应图编码器，Adaptive Graph Encoder for Attributed Graph Embedding

专知会员服务

99+阅读 · 2020年7月6日

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

专知会员服务

26+阅读 · 2020年4月2日

【Google大脑】AutoML-Zero: 从无到有演化机器学习算法，Evolving Machine Learning

专知会员服务

26+阅读 · 2020年3月11日

【AISTATS2020接受论文】变分自编码器和非线性独立分量分析:一个统一的框架（Variational Autoencoders and Nonlinear ICA: A Unifying Framework）

【AISTATS2020接受论文】变分自编码器和非线性独立分量分析:一个统一的框架（Variational Autoencoders and Nonlinear ICA: A Unifying Framework）

专知会员服务

28+阅读 · 2020年1月11日

【论文】自训练噪声student模型提高ImageNet分类准确率（Self-training with Noisy Student improves ImageNet classification），谷歌研究科学家Quoc V. Le等

【论文】自训练噪声student模型提高ImageNet分类准确率（Self-training with Noisy Student improves ImageNet classification），谷歌研究科学家Quoc V. Le等

专知会员服务

24+阅读 · 2019年11月20日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基于自适应表征的高效视觉建模

《多域作战中融合网络、电子战与动能机动》

AI智能体时代大模型安全风险与攻防新挑战

迈向个性化大语言模型驱动的智能体：基础、评估与未来方向

相关资讯

【干货书】《Transformers 机器学习:深度探究》，284页pdf

【干货书】《Transformers 机器学习:深度探究》，284页pdf

专知

72+阅读 · 2022年4月21日

ICCV 2019教程《面向计算机视觉的可解释机器学习》，附280页PPT下载

ICCV 2019教程《面向计算机视觉的可解释机器学习》，附280页PPT下载

专知

33+阅读 · 2019年11月1日

CMU大学76页深度学习课程：变分自编码器（VAE, Variational Autoencoder）

CMU大学76页深度学习课程：变分自编码器（VAE, Variational Autoencoder）

专知

28+阅读 · 2018年8月15日

【干货】用极少量样本有效的训练分类器-对抗自编码器PyTorch手把手实战系列

【干货】用极少量样本有效的训练分类器-对抗自编码器PyTorch手把手实战系列

专知

17+阅读 · 2018年5月10日

【学界】极端图像压缩的生成对抗网络，可生成低码率的高质量图像

【学界】极端图像压缩的生成对抗网络，可生成低码率的高质量图像

GAN生成式对抗网络

10+阅读 · 2018年4月25日

【干货】深入理解变分自编码器

【干货】深入理解变分自编码器

专知

21+阅读 · 2018年3月22日

【干货】对抗自编码器PyTorch手把手实战系列——PyTorch实现对抗自编码器

【干货】对抗自编码器PyTorch手把手实战系列——PyTorch实现对抗自编码器

专知

51+阅读 · 2018年3月20日

【干货】深入理解自编码器（附代码实现）

【干货】深入理解自编码器（附代码实现）

专知

136+阅读 · 2018年3月9日

【干货】对抗自编码器PyTorch手把手实战系列——PyTorch实现自编码器

【干货】对抗自编码器PyTorch手把手实战系列——PyTorch实现自编码器

专知

45+阅读 · 2018年3月8日

【干货】一文读懂什么是变分自编码器

【干货】一文读懂什么是变分自编码器

专知

12+阅读 · 2018年2月11日

相关论文

Adaptive 1D Video Diffusion Autoencoder

Arxiv

0+阅读 · 2月4日

Switchcodec: Adaptive residual-expert sparse quantization for high-fidelity neural audio coding

Arxiv

0+阅读 · 1月28日

Training-Free In-Context Forensic Chain for Image Manipulation Detection and Localization

Arxiv

0+阅读 · 1月21日

Image Complexity-Aware Adaptive Retrieval for Efficient Vision-Language Models

Arxiv

0+阅读 · 1月15日

DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning

Arxiv

0+阅读 · 1月13日

StackPilot: Autonomous Function Agents for Scalable and Environment-Free Code Execution

Arxiv

0+阅读 · 1月13日

Automated Visualization Code Synthesis via Multi-Path Reasoning and Feedback-Driven Optimization

Arxiv

0+阅读 · 1月10日

Adaptive Conditional Contrast-Agnostic Deformable Image Registration with Uncertainty Estimation

Arxiv

0+阅读 · 1月9日

Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings

Arxiv

0+阅读 · 1月7日

Dynamic Quantization Error Propagation in Encoder-Decoder ASR Quantization

Arxiv

0+阅读 · 1月5日

相关基金

近似计算中基于概率图模型的软错误量化方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于压缩感知理论的图像采样、编码和重建研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于内容分析的低复杂度高效视频编码方法

国家自然科学基金

0+阅读 · 2015年12月31日

保持结构的交互式图像及视频编辑方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于比特置信度的低复杂度多进制LDPC码译码算法

国家自然科学基金

0+阅读 · 2015年12月31日

面向无线异构网络中多媒体信息组播的多速率网络编码理论和应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

多纹理多深度的3D视频码率控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向类脑计算存储器的调制编码理论及方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

SHVC质量可伸缩视频编码的快速算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

可重构的环境自适应RS码软判决译码器研究

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员