Click on GIF for YouTube video
An attempt to train DETR on Custom Dataset of Construction Classes (about 50), for Object Detection and Panoptic Segmentation.
Achieved following results:
IoU metric: bbox
Metric | IoU Range | maxDetections | Value |
---|---|---|---|
Average Precision (AP) | 0.50:0.95 | 100 | 0.609 |
Average Recall (AR) | 0.50:0.95 | 100 | 0.86 |
Segmentation Metric: Panoptic, Segmentation, Recognition Quality
PQ | SQ | RQ | N | |
---|---|---|---|---|
All | 54.6 | 79.3 | 60.8 | 63 |
Things | 62.3 | 83.5 | 68.6 | 48 |
Stuff | 29.9 | 65.8 | 35.8 | 15 |
Let's go deep into the process now:
First let's wonder and understand how DETR works and ponder on few questions.
Now, we know how DETR works and what are the components, let's see how we create our Custom Dataset/
Now, we can train first for the Object Detection.
Training(maybe, finetuning) for Object Detection
Yayyy, now we have well trained, object detector for Things
and Stuff
classes, now we can freeze the detection weights and train our Panoptic model.
And, that's it for now. We shall try to implement few more things in the coming future, like applying RICAP for better detection results, use Mixed Precision, Change Number of Queries and do other Ablation study. Also, maybe now move to Deformable DETR which claims to be faster training than DETR.
Sayo Nara!
All Drawings done on draw.io
DETR Paper: arxiv
Excellent Demo from Author: Video
Great Explanation on Paper: Yanic AI Epiphany