TransGMC: Gated Multi-Scale Transformer for Temporal Action Localization

[TMM 2023] TransGMC: Gated Multi-Scale Transformer for Temporal Action Localization

TransGMC: Gated Multi-Scale Transformer for Temporal Action Localization

Introduction

This code repo implements TransGMC in TMM 2023. TransGMC achieves an average mAP of 67.5% on THUMOS14, 36.1% on ActivityNet v1.3, 24.9% on EPIC-Kitchens 100, and 23.2% on Ego4D.

Environment

Name	Version
Python	3.10.9
Torch	1.11.0
Cuda	11.3
Nvidia	NVIDIA GeForce RTX 3090

Data Preparation/Installation/More Details

Please refer to ActionFormer for more details.

Training and Evaluation

Train(Take epic_slowfast_verb as an example)

python ./train.py --config ./configs/epic_slowfast_verb.yaml --output model

Evaluation (Take epic_slowfast_verb as an example)

python ./eval.py ./configs/epic_slowfast_verb.yaml ./ckpt/epic_slowfast_verb_model/

Model

Dataset	Model
Epic Kitchens (verb)	google drive download link
Epic Kitchens (noun)	google drive download link
Ego4D(S+O+E)	google drive download link
Thumos14	google drive download link
ActivityNet v1.3 (I3D)	google drive download link

References

If you are using our code, please consider citing the following paper.

@article{yang2023gated,
  title={Gated Multi-Scale Transformer for Temporal Action Localization},
  author={Yang, Jin and Wei, Ping and Ren, Ziyang and Zheng, Nanning},
  journal={IEEE Transactions on Multimedia},
  year={2023},
  publisher={IEEE}
}

@inproceedings{zhang2022actionformer,
  title={ActionFormer: Localizing Moments of Actions with Transformers},
  author={Zhang, Chen-Lin and Wu, Jianxin and Li, Yin},
  booktitle={European Conference on Computer Vision},
  series={LNCS},
  volume={13664},
  pages={492-510},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
configs		configs
libs		libs
tools		tools
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
teaser.png		teaser.png
train.py		train.py
visualization results.gif		visualization results.gif
visualization results.mp4		visualization results.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TransGMC: Gated Multi-Scale Transformer for Temporal Action Localization

Introduction

Environment

Data Preparation/Installation/More Details

Training and Evaluation

Model

References

The implementation of this code is built upon ActionFormer, and we would like to express gratitude for the open-source contribution of ActionFormer.

About

Releases

Packages

Languages

License

EdenGabriel/TransGMC

Folders and files

Latest commit

History

Repository files navigation

TransGMC: Gated Multi-Scale Transformer for Temporal Action Localization

Introduction

Environment

Data Preparation/Installation/More Details

Training and Evaluation

Model

References

The implementation of this code is built upon ActionFormer, and we would like to express gratitude for the open-source contribution of ActionFormer.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages