-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I trained a model for 50 epochs but the AP was still 0.002 #323
Comments
i have the same problem |
Is someone tracking these issues? @ersheng-ai ,@Tianxiaomo , @KelvinCPChiu , could you maybe chip in? |
If you are using Cfg.use_darknet_cfg = True, you should try this solution #138 (comment). |
Did you find any solution to this? I also try to train with a custom dataset. I tried both with CFG.use_darknet_cfg = True and False. No matter what I do I get AP = 0.000 in evaluation after one EPOC of 5000 images. And my loss is in the thousands. |
Nope, still waiting... |
i have the same problem,and my initial loss is 200,000,In addition, although Loss has been decreasing, it was still at 12,000 in 140 epoch,The convergence rate is quite slow,And I found that the losses were just focused on the regression losses,So I think there may be some problems with the calculation of loss |
hello, did you solve this problem ? i have trained for hundreds epoch, AR and AR are almost zero when evaluate the model. but when i test single image using models.py, refer to #413 (comment). could you give me some advice? please thank you |
Nope I've given up |
Nope,I've given up. |
maybe this: In the function : Yolo_loss.build_target(), the label mismach the pred, one is offset value, one is not |
I had the same problem. |
I have the same problem, have anyone solved? |
1 similar comment
I have the same problem, have anyone solved? |
Hi, |
I just abandoned this repo all together and used the original darknet yolov4 instead. Got it to work much quicker and better. |
I changed the learning rate and batch size, it works~
…---Original---
From: ***@***.***>
Date: Thu, Feb 24, 2022 19:17 PM
To: ***@***.***>;
Cc: ***@***.******@***.***>;
Subject: Re: [Tianxiaomo/pytorch-YOLOv4] I trained a model for 50 epochs but the AP was still 0.002 (#323)
Hi,
I have also the same problem with my custom dataset, after training for 300 epochs the AP is zero and when running inference with the obtained model I cannot detect anything even in the train set.
Any idea?
—
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
You are receiving this because you commented.Message ID: ***@***.***>
|
@Pigdrum |
@devedse |
When debugging, iou=0 keeps coming out of bbox loss. I think there's an issue in anchor code or between custom data and anchor box size don't match. |
I have the same problem, have anyone solved? |
Candidate groups are well drawn between gt and anchor, but nothing matches (iou>0) between pred and anchor. There seems to be a problem with the model output. |
Introduction
For a personal project I'd like to detect cars, busses, trucks and their license plates. To accomplish this I wanted to train a custom model using YoloV4 with this repository.
I followed the guidance described on this page: https://github.com/Tianxiaomo/pytorch-YOLOv4/blob/master/Use_yolov4_to_train_your_own_data.md
Based on this I've created a
train.txt
andval.txt
file with the following content:train.txt (255 lines)
val.txt (64 lines)
When we look at the first image, it's tagged as follows:
There's 4 cars tagged and 1 license plate.
Preparation
To start the actual training process I first changed some values in
cfg.py
:I copied the
val.txt
file topytorch-YOLOv4\data\val.txt
.Training
After this I ran the following command:
After 50 epochs though, the results and AP where still really low (also there seems to be a logging error, I haven't looked into that):
When I tried to use the model on an image the results also came out strange:
Also if we look at the TensorBoard we see the same issues:
I'm not sure what I'm missing, but for some reason it seems the model is not training
More investigation
I also tried with
Cfg.use_darknet_cfg = True
andCfg.classes = 4
, however when I do this I keep getting the following error:The text was updated successfully, but these errors were encountered: