W.A.L.T Video Diffusion icon
W.A.L.T Video Diffusion icon

W.A.L.T Video Diffusion

 4 likes

W.A.L.T is a transformer-based method for photorealistic video generation via diffusion modeling. It uses a causal encoder to compress images and videos into a unified latent space, and a window attention architecture for joint spatial and spatiotemporal generative modeling.

W.A.L.T Video Diffusion screenshot 1

License model

  • FreeProprietary

Application type

Platforms

  • Self-Hosted
  No rating
4 likes
0comments
0 news articles

Features

Suggest and vote on features
  1.  Image to Image Generation
  2.  AI-Powered

W.A.L.T Video Diffusion News & Activities

Highlights All activities

Recent News

No news, maybe you know any news worth sharing?
Share a News Tip

Recent activities

Show all activities

W.A.L.T Video Diffusion information

  • Licensing

    Proprietary and Free product.
  • Alternatives

    22 alternatives listed
  • Supported Languages

    • English

AlternativeTo Categories

AI Tools & ServicesPhotos & Graphics

Our users have written 0 comments and reviews about W.A.L.T Video Diffusion, and it has gotten 4 likes

W.A.L.T Video Diffusion was added to AlternativeTo by Mauricio B. Holguin on Dec 11, 2023 and this page was last updated Dec 11, 2023.
No comments or reviews, maybe you want to be first?
Post comment/review

What is W.A.L.T Video Diffusion?

W.A.L.T is a transformer-based method for photorealistic video generation via diffusion modeling. It uses a causal encoder to compress images and videos into a unified latent space, and a window attention architecture for joint spatial and spatiotemporal generative modeling.

This design allows for top performance on video (UCF-101 and Kinetics-600) and image (ImageNet) generation benchmarks without classifier free guidance. We also use a three-model cascade for text-to-video generation, producing 512 x 896 resolution videos at 8 frames per second.