Self-Supervised Action Recognition Linear
1 papers with code • 2 benchmarks • 2 datasets
This task has no description! Would you like to contribute one?
Most implemented papers
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning
First, masked data reconstruction is performed to learn modality-specific representations from audio and visual streams.