[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
video-understanding
weakly-supervised-learning
video-captioning
multimodal-learning
vision-and-language
dense-video-captioning
pre-training
temporal-language-grounding
video-chapter-generation
vid2seq
-
Updated
Nov 13, 2023 - Jupyter Notebook