questions about Loss Functions and Representation #196

XueYing126 · 2024-03-26T12:29:44Z

Thank you for the great work!!!

I have a question regarding the choice of loss functions and data representation. Specifically, I noticed the use of a 263-D representation with L2 loss during training for the text-to-motion task with the HumanML3D dataset.

I'm curious about the role of foot contact loss and velocity loss, which seem optimized independently of joint positions. Can you clarify how these contribute to the final motion prediction?
( as far as I understand, The final output motion only used part of 263D representation: the root rotation/position and local joint position to calculate the global joint position. (in recover_from_ric()) did I miss something here?

Additionally, considering the availability and effectiveness of models like SMPL, could you explain why it wasn't utilized for this task? Do you think SMPL would also be a suitable representation for this task?

Thank you for your time and insights.

GuyTevet · 2024-05-07T08:36:32Z

Thanks @XueYing126 ! Please check out #19 and let me know if you have more questions. Thanks:)

XueYing126 · 2024-05-07T12:13:53Z

Thank you for checking the issue. However, I still have the question.

Are we primarily concerned with the final 22 human joints?

From what I understand, these 22 joints are derived from the aggregation of root velocity and the addition of local joint positions, meaning they are only influenced by the first 4 + 21 * 3, totaling 67 (out of 263).

The local joint velocity, rotation, and foot contact are unrelated to these final 22 joints....

This leaves me confused about how the 263-dimensional representation is evaluated. Shouldn't it be based on the predicted 22 joints rather than the entire 263 dimensions?
For instance, shouldn't foot contact loss be determined by comparing the predicted joints using a threshold(like how they computed the ground truth), rather than relying solely on the last binary feature in the 263-dimensional prediction?"

Thank you again!

GuyTevet · 2024-05-11T17:29:59Z

Indeed the visualization is based on the 22 joint locations, yet the evaluation is performed using all the 263 entries.

Jocker9527 · 2024-05-24T09:28:23Z

Can anyone tell me where the loss files are?

GuyTevet · 2024-05-24T12:24:10Z

motion-diffusion-model/diffusion/gaussian_diffusion.py

Line 1231 in dd0d003

    
           def training_losses(self, model, x_start, t, model_kwargs=None, noise=None, dataset=None):

GuyTevet closed this as completed May 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

questions about Loss Functions and Representation #196

questions about Loss Functions and Representation #196

XueYing126 commented Mar 26, 2024 •

edited

GuyTevet commented May 7, 2024

XueYing126 commented May 7, 2024 •

edited

GuyTevet commented May 11, 2024

Jocker9527 commented May 24, 2024 •

edited

GuyTevet commented May 24, 2024

questions about Loss Functions and Representation #196

questions about Loss Functions and Representation #196

Comments

XueYing126 commented Mar 26, 2024 • edited

GuyTevet commented May 7, 2024

XueYing126 commented May 7, 2024 • edited

GuyTevet commented May 11, 2024

Jocker9527 commented May 24, 2024 • edited

GuyTevet commented May 24, 2024

XueYing126 commented Mar 26, 2024 •

edited

XueYing126 commented May 7, 2024 •

edited

Jocker9527 commented May 24, 2024 •

edited