Stable Diffusion@lemmy.dbzer0.comEnglish · 2 years ago

MoMask: Generative Masked Modeling of 3D Human Motions

ericguo5513.github.io

MoMask: Generative Masked Modeling of 3D Human Motions

ericguo5513.github.io

Even_Adder@lemmy.dbzer0.comM to

Stable Diffusion@lemmy.dbzer0.comEnglish · 2 years ago

Abstract

We introduce MoMask, a novel masked modeling framework for text-driven 3D human motion generation. In MoMask, a hierarchical quantization scheme is employed to represent human motion as multi-layer discrete motion tokens with high-fidelity details. Starting at the base layer, with a sequence of motion tokens obtained by vector quantization, the residual tokens of increasing orders are derived and stored at the subsequent layers of the hierarchy. This is consequently followed by two distinct bidirectional transformers. For the base-layer motion tokens, a Masked Transformer is designated to predict randomly masked motion tokens conditioned on text input at training stage. During generation (i.e. inference) stage, starting from an empty sequence, our Masked Transformer iteratively fills up the missing tokens; Subsequently, a Residual Transformer learns to progressively predict the next-layer tokens based on the results from current layer. Extensive experiments demonstrate that MoMask outperforms the state-of-art methods on the text-to-motion generation task, with an FID of 0.045 (vs e.g. 0.141 of T2M-GPT) on the HumanML3D dataset, and 0.228 (vs 0.514) on KIT-ML, respectively. MoMask can also be seamlessly applied in related tasks without further model fine-tuning, such as text-guided temporal inpainting.

Paper: https://arxiv.org/abs/2312.00063

Code: https://github.com/EricGuo5513/momask-codes (coming Dec. 15)

Progect Page: https://ericguo5513.github.io/momask/

Chat

Lemmy Tagginator@utter.onlineB
link
fedilink
arrow-up
2
arrow-down
1·
2 years ago
deleted by creator

Stable Diffusion@lemmy.dbzer0.com

stable_diffusion@lemmy.dbzer0.com

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !stable_diffusion@lemmy.dbzer0.com

Discuss matters related to our favourite AI Art generation technology

Also see

Stable Diffusion Art (See its sidebar for more GenAI Art comms)
!aihorde@lemmy.dbzer0.com

Other communities

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

13 users / day
51 users / week
124 users / month
390 users / 6 months
1 local subscriber
5.26K subscribers
937 Posts
1.16K Comments
Modlog