Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(新人求教)请问有没有人遇到过这个问题:THCudaCheck FAIL file=..\aten\src\THC\THCCachingHostAllocator.cpp line=296 error=30 : unknown error #125

Open
caizhangjie opened this issue Apr 17, 2024 · 0 comments

Comments

@caizhangjie
Copy link

在训练模型的过程中出现下面的报错:
Epoch 12/200: 29%|██████▉ | 116/404 [01:31<03:42, 1.30it/s, loss=0.27, lr=0.000299]THCudaCheck FAIL file=..\aten\src\THC\THCCachingHostAllocator.cpp line=296 error=30 : unknown error
Traceback (most recent call last):
File "c:/Users/lenovo/Desktop/yolov5-pytorch-main/yolov5-pytorch-main/train.py", line 570, in
fit_one_epoch(model_train, model, ema, yolo_loss, loss_history, eval_callback, optimizer, epoch, epoch_step, epoch_step_val, gen, gen_val, UnFreeze_Epoch, Cuda, fp16, scaler, save_period, save_dir, local_rank)
File "c:\Users\lenovo\Desktop\yolov5-pytorch-main\yolov5-pytorch-main\utils\utils_fit.py", line 74,
in fit_one_epoch
ema.update(model_train)
File "c:\Users\lenovo\Desktop\yolov5-pytorch-main\yolov5-pytorch-main\nets\yolo_training.py", line 397, in update
v *= d
RuntimeError: CUDA error: unknown error
Epoch 12/200: 29%|██████▉ | 116/404 [01:34<03:54, 1.23it/s, loss=0.27, lr=0.000299]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant