Skip to content

MiZhenxing/One4D

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

8 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control

Zhenxing Mi, Yuxin Wang, Dan Xu
The Hong Kong University of Science and Technology (HKUST)

Arxiv Paper Hugging Face Website

cut_1.mp4

πŸ“ Abstract

One4D is a unified framework for 4D generation and reconstruction that can seamlessly transition between 4D generation from a single image, 4D reconstruction from a full video, mixed generation and reconstruction from sparse frames, and 4D generation from a text prompt via Unified Masked Conditioning (UMC). With Decoupled LoRA Control (DLC), which employs two modality-specific LoRA adapters to form decoupled computation branches for RGB frames and pointmaps, connected by lightweight, zero-initialized control links that gradually learn mutual pixel-level consistency, One4D produces high-quality RGB frames and accurate pointmaps across both generation and reconstruction tasks.

🧠 Methodology

Unified Framework

One4D Framework

Figure 1: The One4D Unified Framework architecture.

  • πŸŽ›οΈ Unified Masked Conditioning (UMC): Enables seamlessly transition between 4D generation from a single image, 4D reconstruction from a full video, and mixed generation and reconstruction from sparse frames using a single unified model.
  • 🧩 Decoupled LoRA Control (DLC): Decouples RGB and XYZ computation to minimize interference while maintaining pixel-wise cross-modal control.

Architecture Comparison

Architecture Comparison

Figure 2: Comparison of Decoupled LoRA Control against other architectures.

🎨 Results Showcase

1. Single Image to 4D

Generating a consistent 4D scene from a single input image.

cut_2.mp4
cut_6.mp4
cut_5.mp4
cut_4.mp4
cut_3.mp4

2. Sparse Frames to 4D

Reconstructing the 4D scene given only a few sparse frames.

cut_13.mp4
cut_12.mp4
cut_11.mp4
cut_8.mp4
cut_7.mp4

3. Full Video to 4D

High-fidelity reconstruction from a full video input.

cut_16.mp4
cut_15.mp4
cut_14.mp4
cut_10.mp4
cut_9.mp4

4. Text to 4D

Generating a consistent 4D scene from a pure text prompt.

cut_19.mp4
cut_20.mp4

πŸ“– BibTeX

If you find our work useful for your research, please consider citing us:

@article{mione4d2025,
  title={One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control},
  author={Mi, Zhenxing and Wang, Yuxin and Xu, Dan},
  journal={arXiv preprint arXiv:2511.18922},
  year={2025}
}

About

One4D: Unified 4D Generation and Reconstruction

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published