Visual SLAM

Real-time visual localization and mapping (vSLAM) with monocular, RGB-D, or stereo cameras and inertial sensor fusion with deployment support

Visual simultaneous localization and mapping (vSLAM) is the process of estimating the position and orientation of a camera while simultaneously building a map of its environment using visual inputs. Computer Vision Toolbox™ supports vSLAM workflows for monocular, RGB-D, and stereo cameras, with optional inertial sensor fusion for improved accuracy. These capabilities are essential for applications in robotics, augmented reality, and autonomous navigation. For guidance on choosing a vSLAM workflow, see Choose SLAM Workflow Based on Sensor Data.

Each visual SLAM object—monovslam, rgbdvslam, and stereovslam—provides ready-to-use tools to add frames, track keyframes, compute 3-D map points, estimate camera poses, close loops, and visualize data throughout the camera trajectory. You can also evaluate the performance of the vSLAM algorithm by comparing the estimated camera trajectory to the ground truth using the compareTrajectories function. The toolbox also provides functionality for building your own visual SLAM pipeline.

You can use the toolbox to perform code generation and deployment of vSLAM algorithms. For more information, see Build and Deploy Visual SLAM Algorithm with ROS in MATLAB and Performant and Deployable Monocular Visual SLAM.

Functions

expand all

Monocular Visual SLAM

`monovslam`	Visual simultaneous localization and mapping (vSLAM) and visual-inertial sensor fusion with monocular camera (Since R2023b)
`addFrame`	Add image frame to visual SLAM object (Since R2023b)
`hasNewKeyFrame`	Check if new key frame added in visual SLAM object (Since R2023b)
`checkStatus`	Check status of visual SLAM object (Since R2023b)
`isDone`	End-of-file status (logical)
`mapPoints`	Build 3-D map of world points (Since R2023b)
`poses`	Absolute camera poses of key frames (Since R2023b)
`plot`	Plot 3-D map points and estimated camera trajectory in visual SLAM (Since R2023b)
`reset`	Reset visual SLAM object (Since R2023b)

RGB-D Visual SLAM

`rgbdvslam`	Feature-based visual simultaneous localization and mapping (vSLAM) and visual-inertial sensor fusion with RGB-D camera (Since R2024a)
`addFrame`	Add pair of color and depth images to RGB-D visual SLAM object (Since R2024a)
`hasNewKeyFrame`	Check if new key frame added in RGB-D visual SLAM object (Since R2024a)
`checkStatus`	Check status of visual RGB-D SLAM object (Since R2024a)
`isDone`	End-of-processing status for RGB-D visual SLAM object (Since R2024a)
`mapPoints`	Build 3-D map of world points from RGB-D vSLAM object (Since R2024a)
`poses`	Absolute camera poses of RGB-D vSLAM key frames (Since R2024a)
`plot`	Plot 3-D map points and estimated camera trajectory in RGB-D visual SLAM (Since R2024a)
`reset`	Reset RGB-D visual SLAM object (Since R2024a)

Stereo Visual SLAM

`stereovslam`	Feature-based visual simultaneous localization and mapping (vSLAM) and visual-inertial sensor fusion with stereo camera (Since R2024a)
`addFrame`	Add pair of color and depth images to stereo visual SLAM object (Since R2024a)
`hasNewKeyFrame`	Check if new key frame added in stereo visual SLAM object (Since R2024a)
`checkStatus`	Check status of stereo visual SLAM object (Since R2024a)
`isDone`	End-of-processing status for stereo visual SLAM object (Since R2024a)
`mapPoints`	Build 3-D map of world points from stereo vSLAM object (Since R2024a)
`poses`	Absolute camera poses of stereo key frames (Since R2024a)
`plot`	Plot 3-D map points and estimated camera trajectory in stereo visual SLAM (Since R2024a)
`reset`	Reset stereo visual SLAM object (Since R2024a)

Evaluate Results

`compareTrajectories`	Compare estimated trajectory against ground truth (Since R2024b)
`trajectoryErrorMetrics`	Store accuracy metrics for trajectories (Since R2024b)

Visualize Results

`imshow`	Display image
`showMatchedFeatures`	Display corresponding feature points
`plot`	Plot image view set views and connections
`plotCamera`	Plot camera in 3-D coordinates
`pcshow`	Plot 3-D point cloud
`pcplayer`	Visualize streaming 3-D point cloud data

Build Your Own Visual SLAM Pipeline

Detect, Extract, and Match Features

`detectSURFFeatures`	Detect SURF features
`detectORBFeatures`	Detect ORB keypoints
`extractFeatures`	Extract interest point descriptors
`matchFeatures`	Find matching features
`matchFeaturesInRadius`	Find matching features within specified radius

Reconstruct 3-D Structure

`triangulate`	3-D locations of undistorted matching points in stereo images
`img2world2d`	Determine world coordinates of image points (Since R2022b)
`world2img`	Project world points into image (Since R2022b)

Estimate Motion

`estgeotform2d`	Estimate 2-D geometric transformation from matching point pairs (Since R2022b)
`estgeotform3d`	Estimate 3-D geometric transformation from matching point pairs (Since R2022b)
`estimateFundamentalMatrix`	Estimate fundamental matrix from corresponding points in stereo images
`estworldpose`	Estimate camera pose from 3-D to 2-D point correspondences (Since R2022b)
`findWorldPointsInView`	Find world points observed in view
`findWorldPointsInTracks`	Find world points that correspond to point tracks
`estrelpose`	Calculate relative rotation and translation between camera poses (Since R2022b)

Optimize Motion and 3-D Structure

`optimizePoses`	Optimize absolute poses using relative pose constraints
`createPoseGraph`	Create pose graph
`bundleAdjustment`	Adjust collection of 3-D points and camera poses
`bundleAdjustmentMotion`	Adjust collection of 3-D points and camera poses using motion-only bundle adjustment
`bundleAdjustmentStructure`	Refine 3-D points using structure-only bundle adjustment

Loop Closure

`bagOfFeatures`	Bag of visual words object
`bagOfFeaturesDBoW`	Bag of visual words using DBoW2 library (Since R2024b)
`dbowLoopDetector`	Detect loop closure using visual features (Since R2024b)
`indexImages`	Create image search index
`invertedImageIndex`	Search index that maps visual words to images

Manage Data

`imageviewset`	Manage data for structure-from-motion, visual odometry, and visual SLAM
`worldpointset`	Manage 3-D to 2-D point correspondences

Transformations

`se3`	SE(3) homogeneous transformation (Since R2026a)
`so3`	SO(3) rotation (Since R2026a)

Topics

Ready-To-Use Visual SLAM Functions

Performant and Deployable Monocular Visual SLAM
Use visual inputs from a camera to perform vSLAM and generate multi-threaded C/C++ code.
Performant Monocular Visual-Inertial SLAM
Use visual inputs from a camera and positional data from an IMU to perform viSLAM in real time. (Since R2025a)
Choose SLAM Workflow Based on Sensor Data
Choose the right simultaneous localization and mapping (SLAM) workflow and find topics, examples, and supported features.
How to Improve Accuracy in Visual SLAM
Tips to improve the accuracy, robustness, and efficiency of your visual SLAM system.

Build Your Own Visual SLAM Pipeline

Monocular Visual Simultaneous Localization and Mapping
Visual simultaneous localization and mapping (vSLAM).
Monocular Visual-Inertial SLAM
Perform SLAM by combining images captured by a monocular camera with measurements from an IMU sensor.
Stereo Visual Simultaneous Localization and Mapping
Process image data from a stereo camera to build a map of an outdoor environment and estimate the trajectory of the camera.

Featured Examples

Performant and Deployable Monocular Visual SLAM

Use visual inputs from a camera to perform vSLAM and generate multi-threaded C/C++ code.

Open Live Script

Performant Monocular Visual-Inertial SLAM

Use visual inputs from a camera and positional data from an IMU to perform viSLAM in real time.

Since R2025a
Open Live Script

Performant and Deployable Stereo Visual SLAM with Fisheye Images

Use fisheye image data from a stereo camera to perform VSLAM and generate multi-threaded C/C++ code.

Open Live Script

Build and Deploy Visual SLAM Algorithm with ROS in MATLAB

Implement and generate C ++ code for a vSLAM algorithm that estimates poses for the TUM RGB-D Benchmark and deploy as an ROS node to a remote device.

Open Live Script

Simulate RGB-D Visual SLAM System with Cosimulation in Gazebo and Simulink

Simulates an RGB-D visual simultaneous localization and mapping (SLAM) system to estimate the camera poses using data from a mobile robot in Gazebo.

(ROS Toolbox)

Since R2024b

Stereo Visual SLAM for UAV Navigation in 3D Simulation

Develop a visual SLAM algorithm for a UAV equipped with a stereo camera.

Open Live Script

Monocular Visual Odometry

Determine location and orientation of a camera by analyzing a sequence of images.

Open Live Script

Develop Visual SLAM Algorithm Using Unreal Engine Simulation

Develop a visual simultaneous localization and mapping (SLAM) algorithm using image data from the Unreal Engine^® simulation environment.

(Automated Driving Toolbox)

New

Estimate Camera-to-IMU Transformation Using Extrinsic Calibration

Estimate SE(3) transformation to define spatial relationship between camera and IMU.

Since R2026a
Open Live Script

Visual Localization in a Parking Lot

Develop a visual localization system using synthetic image data from the Unreal Engine® simulation environment.

Open Live Script