VT-SSum

VT-SSum is a benchmark dataset with spoken language for video transcript segmentation and summarization

Statistics of VT-SSum

Source	train	dev	test
Vedio	7,692	962	962

Baselines

Summarization

Evaluation results on the test data of VT-SSum with models fine-tuned on different datasets

Models	Top3-Precision	Top3-Recall	Top3-F1	Top5-Precision	Top5-Recall	Top5-F1
CNN/DM	31.49	58.41	40.92	27.10	74.46	39.74
VT-SSum	37.86	69.04	48.90	29.79	80.79	43.53
CNN/DM→VT-SSum	38.10	69.48	49.22	29.88	81.02	43.66

Evaluation results on the test data of AMI with models fine-tuned on different datasets

Models	Top3-Precision	Top3-Recall	Top3-F1	Top5-Precision	Top5-Recall	Top5-F1
CNN/DM	45.30	59.03	51.26	39.03	72.61	50.77
VT-SSum	51.80	67.96	58.79	42.72	79.26	55.51
CNN/DM→VT-SSum	52.66	68.72	59.62	42.99	79.78	55.87

Segmentation

Evaluation result of the segmentation on the test data of VT-SSum

Models	Accuracy
LSTM	90.33
UniLMv2_base	92.14
UniLMv2_large	93.00

Format of the data

Each file(*.json) consists of 6 fields:

id: The id of video.
title: The title of the current video.
info: Some information of current video, such as time of published/recorded.
url: The link to the current video.
segmentation: The segmentation part of the VT-SSum. This field consists of a list:
```
  [
      [sent_0 in seg_0, sent_1 in seg_0, ..., sent_n in seg_0],
       ..., 
      [sent_0 in seg_k, sent_1 in seg_k, ..., sent_m in seg_k]
  ]
```
where k is the number of segments in the current video, and n/m is the number of sentences in the segment.

summarization: The summarization part of the VT-SSum. This field consists of a dict:

  {
      "clip_0":
          {
              "is_summarization_sample": true/false,
              "summarization_data": [
                  {
                      "sent": "sent_0",
                      "label": 0/1,
                  },
                  ...
                  {
                      "sent": "sent_n",
                      "label": 0/1,
                  },
              ]
          }
      ...,
      "clip_k":
          {
              ...
          }
  }

where is_summarization_sample indicates that the current segment has summary and be used in the training/evaluation of the summarization task.

Paper and Citation

VT-SSum: A Benchmark Dataset for Video Transcript Segmentation and Summarization [Preprint]

If you find VT-SSum useful in your research, please cite the following paper:

@article{lv2021vt,
  title={VT-SSum: A Benchmark Dataset for Video Transcript Segmentation and Summarization},
  author={Lv, Tengchao and Cui, Lei and Vasilijevic, Momcilo and Wei, Furu},
  journal={arXiv preprint arXiv:2106.05606},
  year={2021}
}

License

This project is licensed under CC BY-NC-ND 4.0

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
dev		dev
test		test
train		train
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dev

dev

test

test

train

train

README.md

README.md

Repository files navigation

VT-SSum

Statistics of VT-SSum

Baselines

Summarization

Segmentation

Format of the data

Paper and Citation

License

About

Releases

Packages

Contributors 2

Dod-o/VT-SSum

Folders and files

Latest commit

History

Repository files navigation

VT-SSum

Statistics of VT-SSum

Baselines

Summarization

Segmentation

Format of the data

Paper and Citation

License

About

Resources

Stars

Watchers

Forks