A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering

Chaoning Zhang,Fachrina Dewi Puspitasari,Sheng Zheng,Chenghao Li,Yu Qiao,Taegoo Kang,Xinru Shan,Chenshuang Zhang,Caiyan Qin,Francois Rameau,Lik-Hang Lee,Sung-Ho Bae,Choong Seon Hong

from arxiv, First survey on Segment Anything Model (SAM), work under progress

Segment anything model (SAM) developed by Meta AI Research has recently attracted significant attention. Trained on a large segmentation dataset of over 1 billion masks, SAM is capable of segmenting any object on a certain image. In the original SAM work, the authors turned to zero-short transfer tasks (like edge detection) for evaluating the performance of SAM. Recently, numerous works have attempted to investigate the performance of SAM in various scenarios to recognize and segment objects. Moreover, numerous projects have emerged to show the versatility of SAM as a foundation model by combining it with other models, like Grounding DINO, Stable Diffusion, ChatGPT, etc. With the relevant papers and projects increasing exponentially, it is challenging for the readers to catch up with the development of SAM. To this end, this work conducts the first yet comprehensive survey on SAM. This is an ongoing project and we intend to update the manuscript on a regular basis. Therefore, readers are welcome to contact us if they complete new works related to SAM so that we can include them in our next version.

翻译：Meta AI 研究团队提出的分割一切模型（SAM）近期引起了广泛关注。该模型在包含超过10亿个掩膜的大规模分割数据集上训练，能够对任意图像中的任意对象进行分割。在原始SAM工作中，作者将零样本迁移任务（如边缘检测）作为评估指标。近期大量研究开始探索SAM在不同场景下识别与分割对象的性能表现。此外，众多项目展示了SAM作为基础模型的通用性，将其与其他模型（如Grounding DINO、Stable Diffusion、ChatGPT等）相结合。随着相关论文与项目呈指数级增长，读者难以全面追踪SAM的发展脉络。为此，本文首次对SAM进行系统性综述。本工作为持续性项目，我们将定期更新文稿。欢迎读者将SAM相关新成果联系我们，以便纳入后续版本。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日