SVO

Official Repository for the SIGIR 2024 paper: Short Video Ordering via Position Decoding and Successor Prediction.

The code and the dataset will be released soon after the paper is published officially.

2024.7.15: The code has been released. 2024.12.18: The SVO dataset has been released.

About

Short video collection is an easy way for users to consume coherent content on various online short video platforms, such as TikTok, YouTube, Douyin, and WeChat Channel. However, short video creators occasionally publish videos in a disorganized manner due to various reasons, such as revisions, secondary creations, deletions, and reissues, which often result in a poor browsing experience for users. Therefore, accurately reordering videos within a collection based on their content coherence is a vital task that can enhance user experience and presents an intriguing research problem in the field of video narrative reasoning. In this work, we curate a dedicated multimodal dataset for this Short Video Ordering (SVO) task and present the performance of some benchmark methods on the dataset. In addition, we further propose an advanced SVO framework with the aid of position decoding and successor prediction. Extensive experiments demonstrate that our method achieves the best performance on our open SVO dataset, and each component of the framework contributes to the final performance.

Dataset

The data required to re-implement our work has been uploaded to the following URL: https://drive.google.com/drive/folders/11FcKWZBlPoBOMVMQOcSqD0DuisZbkgD7?usp=sharing

Important Note: This dataset is intended for research purposes only and may not be used for commercial purposes. Due to considerations regarding copyright and the protection of user privacy, we are unable to directly open-source the original video data. Therefore, we have made available the video frame features extracted using the CLIP model. This data can not be used technically, nor should anyone attempt to use it, to reconstruct the original videos. However, it is sufficient for reproducing our methods or for further research that does not infringe upon user privacy.

License

This work is licensed under CC BY-NC 4.0

Citation

If you find our paper and code useful in your research, please consider giving a star ⭐ and citation 📝 :)

@inproceedings{ge2024short,
  title={Short Video Ordering via Position Decoding and Successor Prediction},
  author={Ge, Shiping and Chen, Qiang and Jiang, Zhiwei and Yin, Yafeng and Chen, Ziyao and Gu, Qing},
  booktitle={Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval},
  pages={2167--2176},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
Figure		Figure
src		src
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SVO

About

Dataset

License

Citation

About

Releases

Packages

Languages

License

ShipingGe/SVO

Folders and files

Latest commit

History

Repository files navigation

SVO

About

Dataset

License

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages