-
Notifications
You must be signed in to change notification settings - Fork 5
Description
-
The first step was identifying the official street signs in the United States.
-
The broad categories of signage were warning signs, temporary road signs, regulatory signs, mandatory movement signs, guide signs, services signs, pedestrian signs, bicycle signs.
-
Images were downloaded using Google Images by writing a Selenium script to automatically download an input number of images for an input search. There were some duplicate images which were automatically deleted by Custom Vision on uploading.
-
There was a slight imbalance in the number of images and hence warning signs and temporary road signs were clubbed into one.
-
At the end of the last iteration each category had the number of images containing their label ranging from 50-150. Custom vision recommends a minimum of 30 images per category label.
-
The model chosen on Custom Vision was "Logos". This made sense since street signs have distinct shapes, symbols and colors for different kinds of categories.
-
The model was run for 12 iterations in total for a satisfactory performance. At the end of each iteration custom vision provides accuracy, recall and precision for the overall model as well as per category label.
-
After each iteration we targeted the category labels with the worst performance. Possible solutions included downloading more images for that label, combing the label with another similar label or multiple labels for a single bounding box.
-
The final performance metrics were 94.7% precision and 83.3% recall. Recall was the most difficult metric to optimize since a number a images had very small bounding boxes for street signs which were difficult to be identified up by the model.