1 code implementation • 8 Aug 2023 • Dianze Li, Jianing Li, Yonghong Tian
Then, we design a spatiotemporal Transformer architecture to detect objects via an end-to-end sequence prediction problem, where the novel temporal Transformer module leverages rich temporal cues from two visual streams to improve the detection performance.