telexed ~ c / 21caf802-bcaradar:70 · generative_mediaLIVE
← back
NO.
#21caf802
Topic
GENERATIVE MEDIA
Source
GitHub Trending Weekly
Published
2026-05-27 07:19:12
Importance
★ 7/10 — radar 70
`SANA` expands into image, video, and controllable world-model generation
FIG-0211:1

`SANA` expands into image, video, and controllable world-model generation

A research repo has become a full training/inference stack for high-res media. Useful for custom pipelines, but heavy for quick SaaS integration.

[ KEY POINTS ]
  1. SANA-WM adds 720p, 1-minute video with 6-DoF camera control; strong for simulation and controllable scene generation ideas.
  2. SANA-Video supports text-to-video and image-to-video, with LTX-VAE and LTX2 Refiner paths up to 2K output.
  3. SGLang support exposes high-performance serving through an OpenAI-compatible API, making product integration less painful.
  4. ComfyUI, Hugging Face, diffusers, and training recipes are all mentioned; the repo is closer to a platform than a single model drop.
Originalgithub.com/NVlabs/SanaRead original →

// related