In my experience stakeholders are usually willing to appoint someone to do it but they want to know how much footage they need to label and whether their team will need special training to do the labelling and after it's all done is this even going to work? At this point I figure I should train it with my own dataset, and so I guess I need to arrange to have this stuff labelled. ![]() At some point I try a bunch of models and all of them are at best 75% good. Where do I go from here? Keep trying different models? There are all sorts of models like YOLO and SSD and RetinaNet and YOLO2 and YOLO3. I get about 60% accuracy - this is no good. After about 30 minutes of fiddling and googling errors, I run it against the sample footage I find some pre-trained model that is able to do people detection or face detection and return bounding rectangles and download it in whatever form I ask for some sample footage to build a prototype and get a few very poor quality videos, at a very different standard from what I see in most of these tutorials. ![]() Some non-technical stakeholder comes to me and says "can we solve this problem with Machine Learning?" usually it's something like "there need to be two supervisors on the factory floor at all times, and I want an email alert everytime there are less than 2 supervisors for more than 20 minutes" As an engineer I find myself in this type of situation quite often - if anyone can point me to some good resources or has any advice, I'd be quite grateful:
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |