Skip to content

Latest commit

 

History

History
22 lines (17 loc) · 2.7 KB

vision_kit_demo_notes.md

File metadata and controls

22 lines (17 loc) · 2.7 KB

Google Vision Kit

Vision demo

version: April 2018

Name Description Considerations Improvements
Image Classification Camera object classification using camera Low confidence. See results: https://gist.github.com/giacomobartoli/a29b066dfefb4af8ac474329c7d2b52b Adding a bounding box framing objects (ex: YOLO)
Image Classification Given an input image this demo classifies objects in it. There are two models available for image classification task: MobileNet based (image classification.MOBILENET), which has 59.9% top-1 accuracy on ImageNet; SqueezeNet based (image_classification.SQUEEZENET), which has 45.3% top-1 accuracy on ImageNet. Still to week. I tried using /img/room.jpg and it only found a bookcase and a window shade. However, the picture contains also plants, books, a sofa, a carpet, woods. Improve accuracy
Dish Classifier Given an input image this demo can classify food. Ex: piadina, napolean pizza with prosciutto, frozen yogurt It's incredibly accurate. It's even able to distinguish composed food. Make it in real time
Dish Detection Same as dish classifier but it provides detection (bbox) It's incredibly accurate. Make it in real time
Joy Detector Led button lights up depending on how much a detected face is happyor not. Ex: happy face => yellow led. It works very well. What about multiple faces? It would be nice even to label detected emotions.
Face Camera Trigger It takes a picture as soon as a face is detected Pretty clever. However, it does not recognize a face if the person is a little bit turned from behind. ---
Leds example
Face Detection Given an input image, this script face detection confidence and joy score. Ex: ./face_detection.py --input faces.jpg face_score=0.987305, joy_score=0.003976, bbox=(722.0, 152.0, 672.0, 672.0) Multiple faces are allowed. For each detected face the algorithm returns: 1.face_score, 2.joy_score, 3.bbox
Face Detection Camera Framing faces through camera Multple faces are allowed. However, the frame is lagging a little bit. Improve the FPS rate or even test tinyYOLO
MobileNet base classifier
Object detection The object detection demo takes an image and checks whether it’s a cat, dog, or person. It does not work on very simple tasks. Given an image with a god and a cat I got only one dog with 70% accuracy. See /img folder for the images with whom I tried. Extended this demo to other objects. This can be good for fining tuning and specific scenarios, eg: person detection on unmanned vechicles.
Nature explorer Based on a subset of Visipedia. It can recognize insects, plants and bird. This seems to be the only model based on MobileNet v1, input=192