Automated video analysis is critical for wildlife conservation. A foundational task in this domain is multi-animal tracking (MAT), which underpins applications such as individual re-identification and behavior recognition. However, existing datasets are limited in scale, constrained to a few species, or lack sufficient temporal and geographical diversity - leaving no suitable benchmark for training general-purpose MAT models applicable across wild animal populations. To address this, we introduce SA-FARI, the largest open-source MAT dataset for wild animals. It comprises 11,609 camera trap videos collected over approximately 10 years (2014-2024) from 741 locations across 4 continents, spanning 99 species categories. Each video is exhaustively annotated culminating in ~46 hours of densely annotated footage containing 16,224 masklet identities and 942,702 individual bounding boxes, segmentation masks, and species labels. Alongside the task-specific annotations, we publish anonymized camera trap locations for each video. Finally, we present comprehensive benchmarks on SA-FARI using state-of-the-art vision-language models for detection and tracking, including SAM 3, evaluated with both species-specific and generic animal prompts. We also compare against vision-only methods developed specifically for wildlife analysis. SA-FARI is the first large-scale dataset to combine high species diversity, multi-region coverage, and high-quality spatio-temporal annotations, offering a new foundation for advancing generalizable multianimal tracking in the wild. The dataset is available at https://www.conservationxlabs.com/sa-fari.


翻译:自动化视频分析对野生动物保护至关重要。该领域的基础任务是多动物追踪(MAT),其为个体重识别和行为识别等应用提供支撑。然而,现有数据集在规模上受限,仅涵盖少数物种,或缺乏足够的时空多样性——目前尚无适用于训练跨野生动物种群的通用MAT模型的合适基准。为此,我们推出SA-FARI,这是目前最大的开源野生动物MAT数据集。该数据集包含11,609段相机陷阱视频,采集时间跨度约10年(2014-2024年),覆盖4大洲的741个地点,涵盖99个物种类别。每段视频均经过详尽标注,总计约46小时的高密度标注影像,包含16,224个掩码标识、942,702个独立边界框、分割掩码及物种标签。除任务专用标注外,我们还公开了每段视频的匿名化相机陷阱地理位置信息。最后,我们基于SA-FARI使用最先进的视觉语言模型(包括SAM 3)建立了全面的检测与追踪基准测试,并通过物种专用提示词和通用动物提示词进行评估。同时,我们还与专为野生动物分析开发的纯视觉方法进行了对比。SA-FARI是首个融合高物种多样性、多区域覆盖和高质量时空标注的大规模数据集,为推进野外通用多动物追踪研究提供了新基础。数据集可通过https://www.conservationxlabs.com/sa-fari获取。

0
下载
关闭预览

相关内容

数据集,又称为资料集、数据集合或资料集合,是一种由数据所组成的集合。
Data set(或dataset)是一个数据的集合,通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量,如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数,该数据集的数据可能包括一个或多个成员。
Top
微信扫码咨询专知VIP会员