VTG-GPT

This is our implementation for the paper VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT.

VTG-GPT leverages frozen GPTs to enable zero-shot inference without training.

VTG-GPT
Acknowledgement
Citation

Preparation

Install dependencies

conda create -n vtg-gpt python=3.10
conda activate vtg-gpt
pip install -r requirements.txt

Unzip caption files

cd data/qvhighlights/caption/
unzip val.zip

Inference on QVHighlights val split

# inference
python infer_qvhighlights.py val

# evaluation
bash standalone_eval/eval.sh

Run the above code to get:

Metrics	R1@0.5	R1@0.7	mAP@0.5	mAP@0.75	mAP@avg
Values	59.03	38.90	56.11	35.44	35.57

MiniGPT-v2 for Image captioning

TODO

Baichuan2 for Query debiasing

TODO

Acknowledgement

We thank Youyao Jia for helpful discussions.

This code is based on Moment-DETR and SeViLA. We used resources from MiniGPT-4, Baichuan2, LLaMa2. We thank the authors for their awesome open-source contributions.

Citation

If you find this project useful for your research, please kindly cite our paper.

@article{xu2024vtg,
  title={VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT},
  author={Xu, Yifang and Sun, Yunzhuo and Xie, Zien and Zhai, Benxiang and Du, Sidan},
  journal={Applied Sciences},
  volume={14},
  number={5},
  pages={1894},
  year={2024},
  publisher={MDPI}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data/qvhighlights		data/qvhighlights
manuscript		manuscript
outputs/qvhighlights		outputs/qvhighlights
standalone_eval		standalone_eval
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
infer_qvhighlights.py		infer_qvhighlights.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data/qvhighlights

data/qvhighlights

manuscript

manuscript

outputs/qvhighlights

outputs/qvhighlights

standalone_eval

standalone_eval

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

infer_qvhighlights.py

infer_qvhighlights.py

requirements.txt

requirements.txt

Repository files navigation

VTG-GPT

Preparation

Inference on QVHighlights val split

MiniGPT-v2 for Image captioning

Baichuan2 for Query debiasing

Acknowledgement

Citation

About

Releases

Packages

Languages

License

YoucanBaby/VTG-GPT

Folders and files

Latest commit

History

Repository files navigation

VTG-GPT

Preparation

Inference on QVHighlights val split

MiniGPT-v2 for Image captioning

Baichuan2 for Query debiasing

Acknowledgement

Citation

About

Resources

License

Stars

Watchers

Forks

Languages