Version 7.2, part of Release 2016b, includes the following enhancements:

  • Deep Learning for Object Detection: Detect objects using region-based convolution neural networks (R-CNN)
  • Structure from Motion: Estimate the essential matrix and compute camera pose from 3-D to 2-D point correspondences
  • Point Cloud File I/O: Read and write PCD files using Point Cloud File I/O Functions
  • Code Generation for ARM Example: Detect and track faces on a Raspberry Pi 2 target
  • Visual Odometry Example: Estimate camera locations and trajectory from an ordered sequence of images

See the Release Notes for details.

Version 7.1, part of Release 2016a, includes the following enhancements:

  • OCR Trainer App: Train an optical character recognition (OCR) model to recognize a specific set of characters
  • Structure from Motion: Estimate the camera poses and 3-D structure of a scene from multiple images
  • Pedestrian Detection: Locate pedestrians in images and video using aggregate channel features (ACF)
  • Bundle Adjustment: Refine estimated locations of 3-D points and camera poses for the structure from motion (SFM) framework
  • Multiview Triangulation: Triangulate 3-D locations of points matched across multiple images

See the Release Notes for details.

Version 7.0, part of Release 2015b, includes the following enhancements:

  • 3-D Shape Fitting: Fit spheres, cylinders, and planes into 3-D point clouds using RANSAC
  • Streaming Point Cloud Viewer: Visualize streaming 3-D point cloud data from sensors such as the Microsoft Kinect​
  • Point Cloud Normal Estimation: Estimate normal vectors of a 3-D point cloud​
  • Farneback Optical Flow: Estimate optical flow vectors using the Farneback method
  • LBP Feature Extraction: Extract local binary pattern features from a grayscale image
  • Multilanguage Text Insertion: Insert text into image data, with support for multiple languages Release Notes

See the Release Notes for details.

Version 6.2, part of Release 2015a, includes the following enhancements:

  • 3-D point cloud functions for registration, denoising, downsampling, geometric transformation, and PLY file reading and writing
  • Image search and retrieval using bag of visual words
  • User-defined feature extractor for bag-of-visual-words framework
  • C code generation for eight functions, including rectifyStereoImages and vision.DeployableVideoPlayer on Mac

See the Release Notes for details.

Version 6.1, part of Release 2014b, includes the following enhancements:

  • Stereo camera calibration app
  • imageSet class for handling large collections of image files
  • Bag-of-visual-words suite of functions for image category classification​​
  • Approximate nearest neighbor search method for fast feature matching​
  • 3-D point cloud visualization function

See the Release Notes for details.