unable to autoTrain #9

GallonDeng · 2019-10-24T03:30:46Z

Hi, I set the "numImages_autotrain" to a small number(i.e., 5) to test the autoTrain function. My system is Ubuntu 16.04 and all the aide modules run on a single machine with one AIworker for detection task. But the autoTrain only ran once and never restart even new annotations were completed. It showed the trainning completed and task completed.
Then I mannually started trainning process and it worked a few times but would get stuck if I restart the process (annotaion and then training) again. The status would be kept "PENDING" not "SUCCESS"

bkellenb · 2019-11-27T17:19:31Z

Hi,

Apologies for the delayed response. The "PENDING" message is usually down to the server engine (Gunicorn) spawning multiple threads. Essentially, the job gets scheduled and sent to an AIWorker by one thread, but potentially not broadcast to other, new threads. This only affects the status messages for the GUI, not the actual training, and we are working on a fix for it for the next release.

In the meantime, the default implementations of e.g. RetinaNet should print to the command line while training, so if you have the terminal window of the AIWorker open, you should see the training messages accordingly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unable to autoTrain #9

unable to autoTrain #9

GallonDeng commented Oct 24, 2019

bkellenb commented Nov 27, 2019

unable to autoTrain #9

unable to autoTrain #9

Comments

GallonDeng commented Oct 24, 2019

bkellenb commented Nov 27, 2019