EventCrab：利用帧与点协同实现基于事件的动作识别及拓展 (EventCrab: Harnessing Frame and Point Synergy for Event-based Action Recognition and Beyond)

Event-based Action Recognition (EAR) possesses the advantages of high-temporal resolution capturing and privacy preservation compared with traditional action recognition. Current leading EAR solutions typically follow two regimes: project unconstructed event streams into dense constructed event frames and adopt powerful frame-specific networks, or employ lightweight point-specific networks to handle sparse unconstructed event points directly. However, such two regimes are blind to a fundamental issue: failing to accommodate the unique dense temporal and sparse spatial properties of asynchronous event data. In this article, we present a synergy-aware framework, i.e., EventCrab, that adeptly integrates the "lighter" frame-specific networks for dense event frames with the "heavier" point-specific networks for sparse event points, balancing accuracy and efficiency. Furthermore, we establish a joint frame-text-point representation space to bridge distinct event frames and points. In specific, to better exploit the unique spatiotemporal relationships inherent in asynchronous event points, we devise two strategies for the "heavier" point-specific embedding: i) a Spiking-like Context Learner (SCL) that extracts contextualized event points from raw event streams. ii) an Event Point Encoder (EPE) that further explores event-point long spatiotemporal features in a Hilbert-scan way. Experiments on four datasets demonstrate the significant performance of our proposed EventCrab, particularly gaining improvements of 5.17% on SeAct and 7.01% on HARDVS.

翻译：与传统动作识别相比，基于事件的动作识别（EAR）具有高时间分辨率捕捉和隐私保护的优势。当前主流的EAR解决方案通常遵循两种范式：将非结构化的事件流投影为稠密的结构化事件帧并采用强大的帧专用网络，或采用轻量级的点专用网络直接处理稀疏的非结构化事件点。然而，这两种范式均忽视了一个根本问题：未能适应异步事件数据独特的稠密时间与稀疏空间特性。本文提出一种协同感知框架，即EventCrab，它巧妙地将适用于稠密事件帧的“轻量级”帧专用网络与适用于稀疏事件点的“重量级”点专用网络相结合，在准确性与效率之间取得平衡。此外，我们构建了一个联合的帧-文本-点表示空间，以桥接不同的事件帧与事件点。具体而言，为更好地挖掘异步事件点固有的独特时空关系，我们为“重量级”点专用嵌入设计了两种策略：i) 类脉冲上下文学习器，从原始事件流中提取上下文化的事件点；ii) 事件点编码器，以希尔伯特扫描方式进一步探索事件点的长程时空特征。在四个数据集上的实验证明了我们提出的EventCrab具有显著性能，尤其在SeAct和HARDVS数据集上分别取得了5.17%和7.01%的性能提升。

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日