We propose Lighthouse, a user-friendly library for reproducible video moment retrieval and highlight detection (MR-HD). Although researchers proposed various MR-HD approaches, the research community holds two main issues. The first is a lack of comprehensive and reproducible experiments across various methods, datasets, and video-text features. This is because no unified training and evaluation codebase covers multiple settings. The second is user-unfriendly design. Because previous works use different libraries, researchers set up individual environments. In addition, most works release only the training codes, requiring users to implement the whole inference process of MR-HD. Lighthouse addresses these issues by implementing a unified reproducible codebase that includes six models, three features, and five datasets. In addition, it provides an inference API and web demo to make these methods easily accessible for researchers and developers. Our experiments demonstrate that Lighthouse generally reproduces the reported scores in the reference papers. The code is available at https://github.com/line/lighthouse.
翻译:我们提出了Lighthouse,一个用于可复现视频片段检索与高光检测(MR-HD)的用户友好库。尽管研究者已提出多种MR-HD方法,但该研究领域仍存在两个主要问题。其一是缺乏跨方法、数据集与视频-文本特征的全面可复现实验,这是因为目前没有统一的训练与评估代码库能够覆盖多种设置。其二是用户友好性不足。由于已有工作使用不同的程序库,研究者需要分别搭建独立环境。此外,大多数工作仅发布训练代码,要求用户自行实现完整的MR-HD推理流程。Lighthouse通过实现统一的复现代码库解决了这些问题,该库涵盖六个模型、三种特征与五个数据集。同时,我们提供了推理API与网页演示,使研究者和开发者能够便捷地使用这些方法。实验表明,Lighthouse能够基本复现原始论文中报告的性能指标。代码已发布于https://github.com/line/lighthouse。