One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control

Zhenxing Mi, Yuxin Wang, Dan Xu
The Hong Kong University of Science and Technology (HKUST)

cut_1.mp4

📝 Abstract

One4D is a unified framework for 4D generation and reconstruction that can seamlessly transition between 4D generation from a single image, 4D reconstruction from a full video, mixed generation and reconstruction from sparse frames, and 4D generation from a text prompt via Unified Masked Conditioning (UMC). With Decoupled LoRA Control (DLC), which employs two modality-specific LoRA adapters to form decoupled computation branches for RGB frames and pointmaps, connected by lightweight, zero-initialized control links that gradually learn mutual pixel-level consistency, One4D produces high-quality RGB frames and accurate pointmaps across both generation and reconstruction tasks.

🧠 Methodology

Unified Framework

Figure 1: The One4D Unified Framework architecture.

🎛️ Unified Masked Conditioning (UMC): Enables seamlessly transition between 4D generation from a single image, 4D reconstruction from a full video, and mixed generation and reconstruction from sparse frames using a single unified model.
🧩 Decoupled LoRA Control (DLC): Decouples RGB and XYZ computation to minimize interference while maintaining pixel-wise cross-modal control.

Architecture Comparison

Figure 2: Comparison of Decoupled LoRA Control against other architectures.

🎨 Results Showcase

1. Single Image to 4D

Generating a consistent 4D scene from a single input image.

cut_2.mp4

cut_6.mp4

cut_5.mp4

cut_4.mp4

cut_3.mp4

2. Sparse Frames to 4D

Reconstructing the 4D scene given only a few sparse frames.

cut_13.mp4

cut_12.mp4

cut_11.mp4

cut_8.mp4

cut_7.mp4

3. Full Video to 4D

High-fidelity reconstruction from a full video input.

cut_16.mp4

cut_15.mp4

cut_14.mp4

cut_10.mp4

cut_9.mp4

4. Text to 4D

Generating a consistent 4D scene from a pure text prompt.

cut_19.mp4

cut_20.mp4

📖 BibTeX

If you find our work useful for your research, please consider citing us:

@article{mione4d2025,
  title={One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control},
  author={Mi, Zhenxing and Wang, Yuxin and Xu, Dan},
  journal={arXiv preprint arXiv:2511.18922},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control

📝 Abstract