bagOfFeatures
Bag of visual words object
Description
You can construct a bag of visual words for use in image category classification, and image retrieval.
Creation
Syntax
Description
bag = bagOfFeatures(
returns a
bag of features object. The imds
)bag
output object is generated
using samples from the imds
input. By default, the visual
vocabulary is created from SURF features extracted from images in
imds
.
bag = bagOfFeatures(
returns a bag of features that uses a custom feature extractor function to
extract features from images in imds
,'CustomExtractor
',extractorFcn
)imds
.
extractorFcn
is a function handle to a custom feature
extraction function.
bag = bagOfFeatures(
specifies options using one or more name-value arguments in addition to any
combination of arguments from previous syntaxes. For example, imds
,Name=Value
)bag =
bagOfFeatures(imds,Verbose=true)
additionally sets
Verbose
to true
.
This object supports parallel computing using multiple MATLAB® workers. Enable parallel computing from the Computer Vision Toolbox Preferences dialog box. To open Computer Vision Toolbox™ preferences, on the Home tab, in the Environment section, click Preferences. Then select Computer Vision Toolbox.
Input Arguments
imds
— Images
ImageDatastore
object
Images, specified as an ImageDatastore
object.
The bagOfFeatures
extracts an equal number of
strongest features from the images contained in the
imds
object. The number of strongest features
is defined as:
number of strongest features = min
(number of
features found in each set) x
StrongestFraction
The object obtains the StrongestFraction
value from
the StrongestFeatures
property.
extractorFcn
— Custom feature extractor function
function handle
Custom feature extractor function, specified as a function handle. This custom function extracts features to learn the visual vocabulary of the object.
The function, extractorFcn
, must be specified as
a function handle for a
file:
extractorFcn = @exampleBagOfFeaturesExtractor; bag = bagOfFeatures(imds,CustomExtractor=extractorFcn)
exampleBagOfFeaturesExtractor
is a MATLAB function. For
example:function [features,featureMetrics,location] = exampleBagOfFeaturesExtractor(img) ...
Argument | Input/Output | Description |
---|---|---|
img | Input |
|
features | Output |
|
featureMetrics | Output |
|
location | Output |
|
For more details on the custom extractor function and its input and output requirements, see Create a Custom Feature Extractor.
You can open an example function file, and use it as a template by typing the following command at the MATLAB command-line:
edit('exampleBagOfFeaturesExtractor.m')
Properties
CustomExtractor
— Custom extraction function
function handle
Custom feature extractor function, specified as a handle to a function. The custom feature
extractor function extracts features used to learn the visual vocabulary for
bagOfFeatures
. You must specify
'CustomExtractor
' and the function handle,
extractorFcn
, to a custom feature extraction
function.
The function, extractorFcn
, must be specified as a function handle for a
file:
extractorFcn = @exampleBagOfFeaturesExtractor; bag = bagOfFeatures(imds,CustomExtractor=extractorFcn)
exampleBagOfFeaturesExtractor
is a MATLAB function such
as:function [features,featureMetrics] = exampleBagOfFeaturesExtractor(img) ...
For more details on the custom extractor function and its input and output requirements, see Create a Custom Feature Extractor. You can open an example function file, and use it as a template by typing the following command at the MATLAB command-line:
edit('exampleBagOfFeaturesExtractor.m')
TreeProperties
— Vocabulary tree properties
[1 500]
(default) | two-element vector
Vocabulary tree properties, specified as a two-element vector in the form
[numLevels
branchingFactor]. numLevels is an
integer that specifies the number of levels in the vocabulary tree.
branchingFactor is an integer that specifies a factor
to control the amount the vocabulary can grow at successive levels in the
tree. The maximum number of visual words represented by the vocabulary tree
is
branchingFactor^
numLevels.
Typical values for numLevels is between
1
and 6
. Typical values for
branchingFactor is between 10
and
500
. Use an empirical analysis to select optimal
values.
Increase the branching factor to generate a larger vocabulary. Increasing the vocabulary improves classification and image retrieval accuracy, but will also increase the time to encode images. You can use a vocabulary tree with multiple levels to create vocabularies on the order of 10,000 visual words or more. A multilevel tree reduces the time required to encode images with large vocabularies, but will take longer to create. You can use a tree with one level for vocabularies that contain only 100 - 1000 visual words.
StrongestFeatures
— Fraction of strongest features
0.8
(default) | [0,1
]
Fraction of strongest features, specified as the comma-separated pair
consisting of 'StrongestFeatures
' and a value in the
range [0,1]. The value represents the fraction of strongest features to use
from each label in the imds
input.
Verbose
— Enable progress display to screen
true
(default) | false
Enable progress display to screen, specified as the comma-separated pair
consisting of 'Verbose
' and the logical
true
or false
.
PointSelection
— Selection method for picking point locations
"Grid"
(default) | "Detector"
Selection method for picking point locations for SURF feature extraction,
specified as the comma-separated pair consisting of
'PointSelection
' and either
"Grid"
or "Detector"
. There are
two stages for feature extraction. First, you select a method for picking
the point locations, (SURF "Detector"
or
"Grid"
), with the PointSelection
property. The second stage extracts the features. The feature extraction
uses a SURF extractor for both point selection methods.
When you set PointSelection
to
"Detector"
, the feature points are selected using a
speeded up robust feature (SURF) detector. Otherwise, the points are picked
on a predefined grid with spacing defined by
'GridStep
'. This property applies only when you are not
specifying a custom extractor with the CustomExtractor
property.
GridStep
— Grid step size
[8 8]
(default) | 1-by-2 [x y] vector
Grid step size in pixels, specified as the comma-separated pair consisting
of "Grid"
and an 1-by-2 [x
y] vector. This property applies only when you set
PointSelection
to "Grid"
and you
are not specifying a custom extractor with the
CustomExtractor
property. The steps in the
x and y directions define the
spacing of a uniform grid. Intersections of the grid lines define locations
for feature extraction.
BlockWidth
— Patch size to extract upright SURF descriptor
[32 64 96 128]
(default) | 1-by-N vector
Patch size to extract upright SURF descriptor, specified as the
comma-separated pair consisting of 'BlockWidth
' and a
1-by-N vector of N block widths.
This property applies only when you are not specifying a custom extractor
with the CustomExtractor
property. Each element of the
vector corresponds to the size of a square block from which the function
extracts upright SURF descriptors. Use multiple square sizes to extract
multiscale features. All the square specified are used for each extraction
points on the grid. This property only applies when you set
PointSelection
to "Grid"
. The
block width corresponds to the scale of the feature. The minimum
BlockWidth
is 32 pixels.
Upright
— Orientation of SURF feature vector
true
(default) | false
Orientation of SURF feature vector, specified as a logical scalar. This
property applies only when you are not specifying a custom extractor with
the CustomExtractor
property. Set this property to
true
when you do not need to estimate the orientation
of the SURF feature vectors. Set it to false
when you
need the image descriptors to capture rotation information.
Object Functions
encode | Create histogram of visual word occurrences |
Examples
Create a Bag of Visual Words
Create an image datastore.
setDir = fullfile(toolboxdir('vision'),'visiondata','imageSets'); imds = imageDatastore(setDir,'IncludeSubfolders',true,'LabelSource',... 'foldernames');
Create the bag of features. This process can take a few minutes.
bag = bagOfFeatures(imds);
Creating Bag-Of-Features. ------------------------- * Image category 1: books * Image category 2: cups * Selecting feature point locations using the Grid method. * Extracting SURF features from the selected feature point locations. ** The GridStep is [8 8] and the BlockWidth is [32 64 96 128]. * Extracting features from 12 images...done. Extracted 230400 features. * Keeping 80 percent of the strongest features from each category. * Creating a 500 word visual vocabulary. * Number of levels: 1 * Branching factor: 500 * Number of clustering steps: 1 * [Step 1/1] Clustering vocabulary level 1. * Number of features : 184320 * Number of clusters : 500 * Initializing cluster centers...100.00%. * Clustering...completed 54/100 iterations (~1.39 seconds/iteration)...converged in 54 iterations. * Finished creating Bag-Of-Features
Compute histogram of visual word occurrences for one of the images. Store the histogram as feature vector.
img = readimage(imds, 1); featureVector = encode(bag,img);
Encoding images using Bag-Of-Features. -------------------------------------- * Encoding an image...done.
Create a Bag of Features with a Custom Feature Extractor
Create an image datastore.
setDir = fullfile(toolboxdir('vision'),'visiondata','imageSets'); imds = imageDatastore(setDir,'IncludeSubfolders',true,'LabelSource',... 'foldernames');
Specify a custom feature extractor.
extractor = @exampleBagOfFeaturesExtractor;
bag = bagOfFeatures(imds,'CustomExtractor',extractor)
Creating Bag-Of-Features. ------------------------- * Image category 1: books * Image category 2: cups * Extracting features using a custom feature extraction function: exampleBagOfFeaturesExtractor. * Extracting features from 12 images...done. Extracted 230400 features. * Keeping 80 percent of the strongest features from each category. * Creating a 500 word visual vocabulary. * Number of levels: 1 * Branching factor: 500 * Number of clustering steps: 1 * [Step 1/1] Clustering vocabulary level 1. * Number of features : 184320 * Number of clusters : 500 * Initializing cluster centers...100.00%. * Clustering...completed 50/100 iterations (~1.41 seconds/iteration)...converged in 50 iterations. * Finished creating Bag-Of-Features
bag = bagOfFeatures with properties: CustomExtractor: @exampleBagOfFeaturesExtractor NumVisualWords: 500 TreeProperties: [1 500] StrongestFeatures: 0.8000
References
[1] Csurka, G., D. Christopher, F. Lixin, W. Jutta, and B. Cédric. "Visual categorization with bags of keypoints." Workshop on statistical learning in computer vision, ECCV, 2004, pp. 1-2.
[2] Nister, D., and H. Stewenius. "Scalable Recognition with a Vocabulary Tree." Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2006, vol. 2, pp. 2161–2168.
Extended Capabilities
C/C++ Code Generation
Generate C and C++ code using MATLAB® Coder™.
Usage notes and limitations:
For code generation, the bagOfFeatures
function does not
support the ImageDatastore
object as input. Instead, specify the
input as a structure with fields Images
and
Labels
. The Images
field must contain a
cell array of grayscale or RGB color images. The Labels
field
must contain image labels stored as categorical arrays.
Automatic Parallel Support
Accelerate code by automatically running computation in parallel using Parallel Computing Toolbox™.
To run in parallel, set 'UseParallel'
to true
or enable
this by default using the Computer Vision Toolbox preferences.
For more information, see Parallel Computing Toolbox Support.
Version History
Introduced in R2014b
See Also
Objects
Functions
MATLAB Command
You clicked a link that corresponds to this MATLAB command:
Run the command by entering it in the MATLAB Command Window. Web browsers do not support MATLAB commands.
Select a Web Site
Choose a web site to get translated content where available and see local events and offers. Based on your location, we recommend that you select: United States.
You can also select a web site from the following list
How to Get Best Site Performance
Select the China site (in Chinese or English) for best site performance. Other MathWorks country sites are not optimized for visits from your location.
Americas
- América Latina (Español)
- Canada (English)
- United States (English)
Europe
- Belgium (English)
- Denmark (English)
- Deutschland (Deutsch)
- España (Español)
- Finland (English)
- France (Français)
- Ireland (English)
- Italia (Italiano)
- Luxembourg (English)
- Netherlands (English)
- Norway (English)
- Österreich (Deutsch)
- Portugal (English)
- Sweden (English)
- Switzerland
- United Kingdom (English)
Asia Pacific
- Australia (English)
- India (English)
- New Zealand (English)
- 中国
- 日本Japanese (日本語)
- 한국Korean (한국어)