MotionFix: Text-Driven 3D Human Motion Editing

Athanasiou, Nikos; Ceske, Alpár; Diomataris, Markos; Black, Michael J.; Varol, Gül

Computer Science > Computer Vision and Pattern Recognition

arXiv:2408.00712 (cs)

[Submitted on 1 Aug 2024 (v1), last revised 19 Sep 2024 (this version, v2)]

Title:MotionFix: Text-Driven 3D Human Motion Editing

Authors:Nikos Athanasiou, Alpár Ceske, Markos Diomataris, Michael J. Black, Gül Varol

View PDF HTML (experimental)

Abstract:The focus of this paper is on 3D motion editing. Given a 3D human motion and a textual description of the desired modification, our goal is to generate an edited motion as described by the text. The key challenges include the scarcity of training data and the need to design a model that accurately edits the source motion. In this paper, we address both challenges. We propose a methodology to semi-automatically collect a dataset of triplets comprising (i) a source motion, (ii) a target motion, and (iii) an edit text, introducing the new MotionFix dataset. Access to this data allows us to train a conditional diffusion model, TMED, that takes both the source motion and the edit text as input. We develop several baselines to evaluate our model, comparing it against models trained solely on text-motion pair datasets, and demonstrate the superior performance of our model trained on triplets. We also introduce new retrieval-based metrics for motion editing, establishing a benchmark on the evaluation set of MotionFix. Our results are promising, paving the way for further research in fine-grained motion generation. Code, models, and data are available at this https URL .

Comments:	SIGGRAPH Asia 2024 Camera Ready, Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
Cite as:	arXiv:2408.00712 [cs.CV]
	(or arXiv:2408.00712v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2408.00712

Submission history

From: Nikos Athanasiou [view email]
[v1] Thu, 1 Aug 2024 16:58:50 UTC (4,906 KB)
[v2] Thu, 19 Sep 2024 17:28:40 UTC (7,436 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MotionFix: Text-Driven 3D Human Motion Editing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MotionFix: Text-Driven 3D Human Motion Editing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators