Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wav2Vec-U 2.0: could not training with fp16 #5092

Open
xiabingquan opened this issue Apr 28, 2023 · 1 comment
Open

Wav2Vec-U 2.0: could not training with fp16 #5092

xiabingquan opened this issue Apr 28, 2023 · 1 comment

Comments

@xiabingquan
Copy link

xiabingquan commented Apr 28, 2023

❓ Questions and Help

Before asking:

  1. search the issues.
  2. search the docs.

What is your question?

When training Wav2Vec-U 2.0 models following the official configuration, I tried training with fp16 but leads to errors. The losses will be NaN.

common:
fp16: false
fp16_no_flatten_grads: true
log_format: json
log_interval: 100
tensorboard_logdir: tb
reset_logging: false
suppress_crashes: false

Code

No code needed.

What have you tried?

Kind of stuck on debugging. Have no idea.

What's your environment?

  • fairseq Version: '0.12.2'
  • PyTorch Version: '1.13.0+cu117'
  • OS: Linux avsu-ESC8000-G4 5.15.0-69-generic
  • How you installed fairseq (pip, source): pip -e install
  • Build command you used (if compiling from source):
  • Python version: Python 3.8.13
  • CUDA/cuDNN version: 11.7
  • GPU models and configuration: 4 GeForce RTX 3090
  • Any other relevant information:
@XR1988
Copy link

XR1988 commented Dec 19, 2024

Thanks for your work. What's the current status? I'm not getting good results; my UER is stuck around 90.
I've cloned this repo (it might have environment problems new): https://github.com/oneapi-src/ai-transcribe
Others cloned it with a virtual environment: https://github.com/voidful/wav2vec-u-exp
I'm having trouble with this: #5572
@xiabingquan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants