Video Grounding
Dataset and Code
Structure of Video Grounding Dataset
- Annotations are stored in .txt format
- Annotations:
clipsegment_start_timesegment_end_time##"annnotation_sentence"
Evaluation Metrics
- We follow VSLNet and use both Recall@1 and Mean IoU
- For the evaluation code, please refer to https://github.com/IsaacChanghau/VSLNet/blob/master/util/runner_utils_t7.py
Instructions to run Video Grounding models
- To prepare the environment, please refer to
-
For feature extraction, we used the I3D model and frame rate of 24 fps.
- Move and replace files according to the directories in
$DIR_AK_VG/video_grounding/code/code_new:- Helper script to move / create symbolic links to files
- Remember to change the root directory
$DIR_ROOTin$DIR_AK_VG/video_grounding/code/code_new/prepare_dir_VG.sh bash $DIR_AK_VG/video_grounding/code/code_new/prepare_dir_VG.sh
- Remember to change the root directory
- Helper script to move / create symbolic links to files