AI Research

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Medium Severity Global
Date Occurred May 14, 2026 17:58 UTC
Event Type AI Research
Source arXiv
Recorded May 15, 2026
Full Description

arXiv: SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer We introduce SANA-WM, an efficient 2.6B-parameter open-source world model natively trained for one-minute generation, synthesizing high-fidelity, 720p, minute-scale videos with precise camera control. SANA-WM achieves visual quality comparable to large-scale industrial baselines such as LingBot-World and HY-WorldPlay, while significantly improving efficiency. Four core designs drive our architecture: (1) Hybrid Linear Attention combines frame-wise Gated DeltaNet (GDN) with softmax attention for

AI Intelligence Layer

AI Categories

performance
Event Metadata
  • ID #1314
  • Type AI Research
  • Region Global
  • Severity Medium
  • Indexed May 15, 2026