How to detect multiple objects in a video file and label these objects with bounding boxes.

4 views (last 30 days)
Hi, I want to know how I can create my own object detection using existing convolution neural networks like alexnet and have this applied to a video file that shows the bounding boxes of those objects.
Where are good places and sources so I can understand how to do this?

Accepted Answer

Image Analyst
Image Analyst on 1 Jan 2019
Watch Joe Hicklin's demo video, that does exactly that: MathWorks Deep Learning in 11 lines of code
  3 Comments
Image Analyst
Image Analyst on 2 Jan 2019
Well if all you need to do is to detect it, rather than figure out what it is, then it might be simpler. It might not even need a CNN. You might be able to just get by with simple thresholding. Who knows, because you dind't upload an image.
Jack Zimmerman
Jack Zimmerman on 2 Jan 2019
Edited: Jack Zimmerman on 2 Jan 2019
Like I said, any object would be fine, I simply want to understand how to get to the point that I can pass a video file and be able to detect multiple objects and their bounding boxes using pre-trained CNN's like alexnet.
Simple classification isnt enough, something like the documentation seen here:
However, without the requirement of the ROI of the image, that can create its own bounding boxes.

Sign in to comment.

More Answers (0)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!