OV-VOD: Open-Vocabulary Video Object Detection
Published in Proceedings of the ACM International Conference on Mmultimedia (ACM MM), 2025
We are the first to explore the task setting and evaluation benchmark for video object detection under the open-vocabulary paradigm, and we also propose a new baseline method for open-vocabulary video object detection.
Recommended citation: Zhihong Zheng, Yang Cao, Junlong Gao, and Hanzi Wang. OV-VOD: Open-Vocabulary Video Object Detection. In Proceedings of the ACM International Conference on Mmultimedia, pages 489-498, 2025.
Download Paper
