Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE

Xu, Yucheng; Nanbo, Li; Goel, Arushi; Guo, Zijian; Yao, Zonghai; Kasaei, Hamidreza; Kasaei, Mohammadreze; Li, Zhibin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.05323 (cs)

[Submitted on 9 Mar 2023 (v1), last revised 4 Apr 2023 (this version, v2)]

Title:Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE

Authors:Yucheng Xu, Li Nanbo, Arushi Goel, Zijian Guo, Zonghai Yao, Hamidreza Kasaei, Mohammadreze Kasaei, Zhibin Li

View PDF

Abstract:Videos depict the change of complex dynamical systems over time in the form of discrete image sequences. Generating controllable videos by learning the dynamical system is an important yet underexplored topic in the computer vision community. This paper presents a novel framework, TiV-ODE, to generate highly controllable videos from a static image and a text caption. Specifically, our framework leverages the ability of Neural Ordinary Differential Equations~(Neural ODEs) to represent complex dynamical systems as a set of nonlinear ordinary differential equations. The resulting framework is capable of generating videos with both desired dynamics and content. Experiments demonstrate the ability of the proposed method in generating highly controllable and visually consistent videos, and its capability of modeling dynamical systems. Overall, this work is a significant step towards developing advanced controllable video generation models that can handle complex and dynamic scenes.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2303.05323 [cs.CV]
	(or arXiv:2303.05323v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.05323

Submission history

From: Yucheng Xu [view email]
[v1] Thu, 9 Mar 2023 15:13:51 UTC (2,131 KB)
[v2] Tue, 4 Apr 2023 10:59:43 UTC (2,139 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators