Easily implement SOTA video understanding methods with PyTorch on multiple machines and GPUs
X-Temporal is an open source video understanding codebase from Sensetime X-Lab group that provides state-of-the-art video classification models, including papers "Temporal Segment Networks", "Temporal Interlacing Network", "Temporal Shift Module", "ResNet 3D", "SlowFast Networks for Video Recognition", and "Non-local Neural Networks".
This repo includes all models and codes used in our 1st place solution in ICCV19-Multi Moments in Time Challenge Challenge Website
Support popular video understanding frameworksSlowFast R(2+1)D R3D TSN TIN TSM
Support various datasets (Kinetics, Something2Something, Multi-Moments in Time...)Take raw video as input Take video RGB frames as input Take video Flow frames as input Support Multi-label dataset
High-performance and modular design can help rapid implementation and evaluation of novel video research ideas.