Birju Patel

MathWorks

Last seen: 3 days ago | Active since 2014

Followers: 0 Following: 0

Message

Computer Vision System Toolbox developer working on deep learning and computer vision.

Statistics

View badges

Feeds

Answered
How to join or merge two Image Data Stores?
The best way to do this is to use the datastore combine method: https://www.mathworks.com/help/matlab/ref/matlab.io.datastore.c...

9 months ago | 1

Answered
Mask RCNN custom data training. Problem with JSON to mat conversion
You do not need convert ground truth exported from Image Labeler into the COCO format in order to train Mask R-CNN. The way the ...

9 months ago | 0

Answered
Can we fine tune Segmentation Anything Model (SAM) in MATLAB?
As of R2025b, finetuning SAM is not supported. This will be considered for future releases. You mentioned you were interested ...

9 months ago | 1

Answered
How to generate an ellipse using the insertShape Function?
Support for drawing an ellipsed is available in R2024b: https://www.mathworks.com/help/vision/ref/insertshape.html

1 year ago | 0

Answered
OCR (optical character recognition) misreading simple numbers even when image is pre-processed. What's the issue?
When giving an ROI around a word, setting the LayoutAnalysis to word can help. Here are the results I get in R2024b: >> txt = o...

2 years ago | 1

| accepted

Answered
augmentedImageDatastore for image segmentation
I recommend combining imageDatastore and pixelLabelDatastore and then using a transform to implement data augmentation for seman...

2 years ago | 0

Answered
how to plot features of resnet-50 when input given is image
Network features are usually high-dimensional vectors so one way to visualize them is to use t-SNE: https://www.mathworks.com/...

3 years ago | 0

| accepted

Answered
I want to insert rectangle shape to the real time image
You can also use the Draw Shapes block in Simulink: https://www.mathworks.com/help/vision/ref/drawshapes.html

3 years ago | 0

Answered
train cascade detector to detect more than one shape at the same time
It looks like you are trying to train a multi-class object detector. The cascade object detector is a single class detector. It ...

3 years ago | 0

Answered
Why that number of anchor boxes?
There isn't any rhyme or reason for these values. The examples need to be updated to provide more details on how to choose ancho...

3 years ago | 0

| accepted

Answered
How to show labels names?
Any object detector that supports detection multple classes will return the labels as a third output argument: [bboxes,scores,l...

3 years ago | 0

Answered
3-D Scene Reconstruction from Uncalibrated Stereo
These two examples walk through the process of doing 3-D reconstruction from uncalibrated stereo images: https://www.mathworks....

3 years ago | 0

Answered
How do I use polygon labeling for an instance segmentation neural network?
For instance segmentation, you should first try Mask R-CNN via trainMaskRCNN: https://www.mathworks.com/help/vision/ref/trainma...

3 years ago | 0

| accepted

Answered
How do I directly covert a depth image to 3-D point cloud?
pcfromdepth has been added to Computer Vision Toolbox in R2022b: https://www.mathworks.com/help/vision/ref/pcfromdepth.html

3 years ago | 0

Answered
how to calculate IoU for semantic segmentation
You can start here: https://www.mathworks.com/help/vision/ref/evaluatesemanticsegmentation.html You can also use jaccard when ...

3 years ago | 0

| accepted

Answered
Error using semanticSegmentationMetrics The categorical data returned by dsResults and dsTruth must have the same categories.
Check the categories of the data coming out of pxdsResults and pxdsTruth: A = read(pxdsResults); categories(A{1}) B = read(px...

4 years ago | 0

Answered
Combining Multiple Ground Truths
You don't need to combine the groundTruth objects. Use objectDetectorTrainingData to extract training data from multiple groundT...

4 years ago | 0

Answered
FCN code giving odd results
The fcnLayers functions returns a network with image net trained weights. When you generate code from Deep Network Designer, mak...

4 years ago | 0

| accepted

Answered
Computer block when use YOLO4 ?
My guess is your GPU is driving your display too and somehow it has stalled. Or it could be that your GPU drew too much power an...

4 years ago | 0

Answered
Segmentation algorithm not giving correct output
Generally, FCN, U-Net, and SegNet are different architectures that require their own set of training options to produce optimal ...

4 years ago | 0

Answered
why train yolov2 detector on the same images give two differnet result when you train it in one go do and other when you train them via checkpoint?
When you train from a checkpoint, you are resuming or continuing the training. If you continue to train the detector for more it...

4 years ago | 0

Answered
Image Labeler automation algorithm
You can create an automation algorithm for the Image Labeler app: https://www.mathworks.com/help/vision/ug/create-automation-al...

4 years ago | 0

Answered
U-net for image segmentation
The network you pointed to was trained in Caffe. You can use importCaffeNetwork to import this pretrained U-Net network: https:...

4 years ago | 0

Answered
Apply semanticseg to multiple images
To apply semanticseg to a images from a folder, you can pass in an imageDatastore to the semanticseg function: imds = imageData...

4 years ago | 0

Answered
Export groundTruth as single png image
As you noticed, you will have to use pixel labels instead of polygons to get to a label matrix directly from the image or video ...

4 years ago | 1

| accepted

Answered
How to extract feature vector using CNN and how to extract one particular image feature values from the extracted feature ?
This example should help: https://www.mathworks.com/help/deeplearning/ug/extract-image-features-using-pretrained-network.html ...

4 years ago | 1

| accepted

Answered
How to fuse the HOG and LBP features for a given set of images ?
The easiest method to fuse HOG and LBP is to simply concatenate them into one long feature vector: hog = extractHOGFeatures(......

4 years ago | 0

| accepted

Answered
fixing intrinsics during stereoCalibration (during R,T refinment)
You can use the estimateStereoBaseline function to estimate the translation and rotation between two cameras given fixed intrins...

4 years ago | 0

Answered
Reconstructing 3D from two stereo images
Hi, You will need to calibrate the stereo camera used to capture your images. In the code you posted, you're using calibration...

4 years ago | 0

| accepted

Answered
SFM 3D model
Please see this example for structure from motion (SfM) from multiple images: https://www.mathworks.com/help/vision/ug/structure...

4 years ago | 0