Generative Video Models

FuseFormer is a Transformer-based model designed for video inpainting via fine-grained feature fusion based on novel Soft Split and Soft Composition operations. The soft split divides feature map into many patches with given overlapping interval while the soft composition stitches them back into a whole feature map where pixels in overlapping regions are summed up. FuseFormer builds soft composition and soft split into its feedforward network for further enhancing subpatch level feature fusion.

Source: FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting

Papers


Paper Code Results Date Stars

Tasks


Task Papers Share
Seeing Beyond the Visible 1 50.00%
Video Inpainting 1 50.00%

Categories