×
Mar 22, 2024 · In this study, we tackle the complex task of generating 3D human-object interactions (HOI) from textual descriptions in a zero-shot text-to-3D manner.
InterFusion is a two-stage framework that transforms textual descriptions into detailed 3D human-object interactions.
Inter-. Fusion involves human pose estimations derived from text as geometric priors, which simplifies the text-to-3D conversion process and introduces.
InterFusion can generate diverse 3D scenes of human-object interaction (3D HOI) given texts. [Project Page] • [arXiv] • [PDF]
InterFusion involves human pose estimations derived from text as geometric priors, which simplifies the text-to-3D conversion process and introduces additional ...
InterFusion is a two-stage framework that transforms textual descriptions into detailed 3D human-object interactions.
Nov 27, 2024 · InterFusion involves human pose estimations derived from text as geometric priors, which simplifies the text-to-3D conversion process and ...
HOI-Diff can generate realistic motions for 3D human-object interactions given a text prompt and object geometry.
View recent discussion. Abstract: In this study, we tackle the complex task of generating 3D human-object interactions (HOI) from textual descriptions in a ...
People also ask