EX-4D
Turn any video into a controllable 4D experience
2025-07-04

EX-4D is an open-source framework by Pico/Bytedance that turns a single video into a camera-controllable 4D experience. Its novel Depth Watertight Mesh ensures consistency even at extreme viewpoints.
EX-4D is an open-source framework from Pico/Bytedance that transforms ordinary videos into dynamic 4D experiences with full camera control. Its standout feature is the Depth Watertight Mesh, a novel geometric representation that models both visible and occluded areas, ensuring consistent visuals even from extreme viewpoints.
The framework overcomes common challenges like geometric inconsistencies and occlusion artifacts by using a simulated masking strategy, eliminating the need for multi-view datasets. A lightweight LoRA-based video diffusion adapter further enhances efficiency, requiring only 1% trainable parameters for high-quality, temporally coherent output.
EX-4D’s three-step process—building the mesh, generating training masks, and synthesizing the video—delivers smooth transitions and realistic dynamics, making it ideal for immersive content creation.
The framework overcomes common challenges like geometric inconsistencies and occlusion artifacts by using a simulated masking strategy, eliminating the need for multi-view datasets. A lightweight LoRA-based video diffusion adapter further enhances efficiency, requiring only 1% trainable parameters for high-quality, temporally coherent output.
EX-4D’s three-step process—building the mesh, generating training masks, and synthesizing the video—delivers smooth transitions and realistic dynamics, making it ideal for immersive content creation.
Open Source
Artificial Intelligence
Video