保护您的视频内容：破坏基于视频的LLM自动标注 (Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations)

Recently, video-based large language models (video-based LLMs) have achieved impressive performance across various video comprehension tasks. However, this rapid advancement raises significant privacy and security concerns, particularly regarding the unauthorized use of personal video data in automated annotation by video-based LLMs. These unauthorized annotated video-text pairs can then be used to improve the performance of downstream tasks, such as text-to-video generation. To safeguard personal videos from unauthorized use, we propose two series of protective video watermarks with imperceptible adversarial perturbations, named Ramblings and Mutes. Concretely, Ramblings aim to mislead video-based LLMs into generating inaccurate captions for the videos, thereby degrading the quality of video annotations through inconsistencies between video content and captions. Mutes, on the other hand, are designed to prompt video-based LLMs to produce exceptionally brief captions, lacking descriptive detail. Extensive experiments demonstrate that our video watermarking methods effectively protect video data by significantly reducing video annotation performance across various video-based LLMs, showcasing both stealthiness and robustness in protecting personal video content. Our code is available at https://github.com/ttthhl/Protecting_Your_Video_Content.

翻译：近年来，基于视频的大语言模型（video-based LLMs）在各种视频理解任务中取得了令人瞩目的性能。然而，这一快速发展引发了严重的隐私和安全担忧，尤其是在基于视频的LLMs未经授权使用个人视频数据进行自动标注方面。这些未经授权的视频-文本标注对随后可用于提升下游任务（如文本到视频生成）的性能。为了保护个人视频免遭未经授权的使用，我们提出了两种具有不可察觉对抗扰动的保护性视频水印系列，分别命名为Ramblings和Mutes。具体而言，Ramblings旨在误导基于视频的LLMs为视频生成不准确的描述，从而通过视频内容与描述之间的不一致性降低视频标注的质量。另一方面，Mutes则被设计为促使基于视频的LLMs产生异常简短的描述，缺乏细节信息。大量实验表明，我们的视频水印方法通过显著降低多种基于视频的LLMs的视频标注性能，有效保护了视频数据，在保护个人视频内容方面展现了良好的隐蔽性和鲁棒性。我们的代码可在https://github.com/ttthhl/Protecting_Your_Video_Content获取。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日