Questions about Mimi: loss balancing and bfloat16/mixed precision training. #178

SarthakYadav · 2025-01-01T16:30:39Z

Due diligence

I have done my due diligence in trying to find the answer myself.

Topic

The paper

Question

Thanks for the great work. I'm trying to reproduce Mimi and had the following questions:

Does Mimi use a loss balancer such as that used in Encodec for training? The paper points to the default Encodec configuration in AudioCraft which uses loss balancing, so I was wondering if that's the case for Mimi as well.
Was Mimi trained in bfloat16? Or did the actual training happen in full precision and the weights were exported in bfloat16?

Thanks!

akshatvishu · 2025-01-08T16:04:38Z

Hey @SarthakYadav 👋 , it seems the training code for Mimi has not yet been released, but they plan to do so in the near future, as mentioned in their FAQ section. However, here's what I could gather from the current resources:
In-their readme.md page, they state :

Finally, and similarly to EBEN, Mimi uses only an adversarial training loss, along with feature matching, showing strong improvements in terms of subjective quality despite its low bitrate.

so, i personally don't think they've used a loss balancer.

As for Q2, I think only the official team could answer as they've not released the training code yet!

LaurentMazare · 2025-01-08T22:26:06Z

2. Was Mimi trained in bfloat16? Or did the actual training happen in full precision and the weights were exported in bfloat16?

Which weights are you referring to? Looking at the model.safetensors file on our huggingface repo, the weights should actually be in fp32 rather than bf16 (and this should be the case too in our other repos).

SarthakYadav · 2025-01-09T15:55:53Z

@LaurentMazare Thanks, tokenizer weights are indeed fp32, it's only the moshi weights that are bf16.

SarthakYadav added the question label Jan 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about Mimi: loss balancing and bfloat16/mixed precision training. #178

Questions about Mimi: loss balancing and bfloat16/mixed precision training. #178

SarthakYadav commented Jan 1, 2025

akshatvishu commented Jan 8, 2025 •

edited

Loading

LaurentMazare commented Jan 8, 2025

SarthakYadav commented Jan 9, 2025

Questions about Mimi: loss balancing and bfloat16/mixed precision training. #178

Questions about Mimi: loss balancing and bfloat16/mixed precision training. #178

Comments

SarthakYadav commented Jan 1, 2025

Due diligence

Topic

Question

akshatvishu commented Jan 8, 2025 • edited Loading

LaurentMazare commented Jan 8, 2025

SarthakYadav commented Jan 9, 2025

akshatvishu commented Jan 8, 2025 •

edited

Loading