-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RuntimeError: Detected that PyTorch and torchvision were compiled with different CUDA versions. PyTorch has CUDA Version=11.7 and torchvision has CUDA Versio=11.6. Please reinstall the torchvision that matches your PyTorch install. #37
Comments
Where are you running the script? If you are using the notebook, does the error occur when you launch the training? Or somewhere before? I only have access to Google Colab, where the CUDA versions seems to match:
|
Thats odd..Yeah happens when the actual training cell is launched, maybe I have an outdated notebook will try the recent one. |
Ok, I actually haven't tried the notebook on the main branch for awhile. I will test tonight. Thanks for reporting. |
i think it needs updating and tweaking Im getting error after error from the training cell, nothing seems to be linked back to the previous cells where the parameters are chosen |
Are you referring to the notebook on the main branch? |
Ok thanks. I'll have a look today. |
So I've fixed a couple of things and checked that the dependencies are all ok (at least on Google Colab). Please try the Notebook linked below. Two things:
|
Ok thanks, will give it a go |
For some reason got an out of memory error, although fp16 and 8bit adam are enabled, as is gradient checkpointing.
|
Are train_batch_size and sample_batch_size both equal to 1? Can you post the args.json output here (it will be in your output_dir). It OOMed at a weird step, so I'm not sure. |
They were yes, I had already deleted runtime by the time I had seen this so lost my output dir |
When launching training
Seems to be an error everywhere with this so not specific to this repo. Any ideas how to fix?
The text was updated successfully, but these errors were encountered: