Skip to content

Latest commit

 

History

History
111 lines (65 loc) · 3.58 KB

mlassitdatalabel.md

File metadata and controls

111 lines (65 loc) · 3.58 KB

Machine learning assisted data labelling

Image multi-class and bounding box labelling

Architecture

alt text

Steps

Step 1

First lets copy the data to blob storage for further processing

Step 2

Apply resize of images using the above URL https://github.com/balakreshnan/customvisionai2020/blob/master/ReSize/resizeimages.md

Step 3

Now create a new cluster for training. Select GPU since ML Assit works with GPU only as of writing to this document.

Step 4

Now time to creaet a dataset First click on Dataset and then click Create new data set

alt text

Now create data store where the file are saved:

alt text

Select the Path

alt text

Confirm the details

alt text

Step 5

Create a Data label project

alt text

Select the existing data set created above

alt text

Enable Incremental refresh

alt text

Add labels

alt text

Provide label instruction if available for labellers

alt text

Select ML Assit labelling

alt text

Click Create project

Now go to project and you will land in home page

alt text

Experiments section should show the details

  • First step go to Label data
  • Go through pictures and create bounding boxes
  • Validate the bounding boxes.
  • The Tool will show how many pictures to do bounding boxes
  • Once bounding box is done then we back to home screen and wait.
  • ML will automatically start to run based on what was labelled. Usualy takes time to do this task.

The tool will guide and lets us know how much more it needed to label. Usually it needs per label 25 images for 100 images.

Once the experiment is completed then we can back and validate what ML as done and then draw boxes where is it missing.

Also check the training cluster details

Training: alt text

Validations: alt text

inference: alt text

Conclusion

More to come.