Low accuracy for MS COCO dataset in tao maskrcnn model training

• Hardware A5000
• Network Type Mask_rcnn
• TLT Version nvidia/tao/tao-toolkit-tf: v3.22.05-tf1.15.5-py3
• Training spec file
tao_maskrcnn_02_09_24_train_v6.txt (2.4 KB)
tao_maskrcnn_02_09_24_train_v7.txt (2.4 KB)

This is a trend I have observed while training with the MS COCO dataset. The dataset was filtered to include only “truck”, “bus” and “person” classes. TFRecords were then generated for training and validation using the tao command line command. Since I was trying out training for the first time for this model, I initially planned to run training with the default config file in the documentation MaskRCNN - NVIDIA Docs , but this failed as I was getting an error telling me the training loss has gone to NaN in the very first iteration. The training configs I have attached have much lower loss values compared to the values mentioned in the above documentation. In the case of the training config ending with v6, the training loss was jumping around a lot so I reduced the values again by a factor of 10 and you will find this update in the config ending with v7 and the loss values stopped jumping everywhere. In both the above trainings I have noticed that the loss value does’nt come down and stays around 3, while running inference on the validation dataset with the final model file there are’nt any detections or segmentations in the output for any of the validation images and the AP values computed are near 0. Since there is no loss propagation I am also unable to select the right model to test and I get to know that the training is not happening properly. What could be the reason for this behaviour ?
Thanks

6 posts - 2 participants

Read full topic

Low accuracy for MS COCO dataset in tao maskrcnn model training

Trending Articles

Mp3 Download: Mdu - Mazola

Practice Sheet of Right form of verbs for HSC Students

Explicit Proxy Configuration

Kurabuitaki na Sota Koya

AAD Connect loses the connection with SQL Server

Asianet plus schedule – list of programs , movie timings etc

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

[ubuntu] Running kickseed is seen stuck at 0%

[ROM][ONEUI 2.5][10.0][A530F/A530W/A730F]FusionX V1.0

ZLT P25 (Globe) - back to original firmware

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...

Ek Villain [2014 – FLAC]

Mafia Hit List: The Top 15 Connecticut Mob Murders Of All-Time

SSIS 2019, MSOLEDBSQL, Thread safe? Connection issue

A/L Technology Stream – Subject combinations, Syllabuses and Teacher guides

Mahindananda - 'Dimu' daughter Senani's marriage!

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Waves Ultimate 15 v25.04.07 Incl V.R Patch WiN

100+ Short Whatsapp Status in English | Short Status Quotes Words

Child Kidnapping: Amy McNeil was kidnapped on her way to school by 5 adults;...