Differences in memory pre-allocation approaches.
5 views (last 30 days)
Show older comments
Why should we prefer one pre-allocation approach over the other?
I have compiled three different approaches that apparently pre-allocates memory in MATLAB.
clear all
clc
N = 2^28;
tStart = tic;
% Approach #1:
%X = zeros(N,1);
%Approach #2:
% X = [];
% X(N) = 0;
%Approach #3:
%X = [];
%X(N, 1) = 0;
%TEST CODE
%Code that uses the previous pre-allocation approaches.
%for i = 1:N
%
% X(i) = i;
%
% end
tStop = toc(tStart)
Observations (All approaches equally speed up the test code.):
Approach #1: When I use this pre-allocation approach, MATLAB effectively reserves the memory, and this memory usage shows up on the Task Manager. The task manager shows MATLAB is using 2.50 GB before and after executing the test code.
Approach #2: This approach has the same behavior as approach #1.
Approach #3: When I use this pre-allocation approach, before executing the test code, the Task Manager shows MATLAB is using only about 500MB of memory, but after executing the test code it uses 2.50 GB, as seen on approach #1 and #2.
Does it mean that Approach #3 is more memory efficient than the other two approaches? It seems as if Approach #3 is pre-allocating memory, but it is not using it until required. Thanks!
3 Comments
James Tursa
on 27 May 2013
I'm curious ... what behavior do you get if you use this mex routine for the pre-allocation?
Cedric
on 27 May 2013
I'll have to make a few tests about #3, but Yair discusses #1 and #2 in http://undocumentedmatlab.com/blog/preallocation-performance/ (and associated threads).
Answers (2)
per isakson
on 27 May 2013
Some comments
- The behavior of Matlab may change from one release to the next. Thus, always state which release.
- With R2012a #1 allocates memory and assigns the value 0 to all elements. #2 and #3 takes a "lazy" approach. Matlab "reserves" memory internally and only the last element is assigned a value. The memory is not used until values are assign by your for-loop.
- Try tic, ..., toc the three approaches and I think you will see that #2 and #3 are much faster the #1.
1 Comment
Greg
on 18 Jan 2017
I know this is a really old thread, but I'm not comfortable with the implication of this answer. Methods #2 and #3 probably are faster at the pre-allocation, but the subsequent for loop will likely be much slower because MATLAB hasn't fully allocated that memory. This holds with my experience (sorry, I don't have any concrete examples to offer). I always suggest using zeros/ones/nan/inf/true/false functions to preallocate arrays if for no other reason than it's clear and obvious.
Jan
on 18 Jan 2017
For real world applications the actual work is the same: Either Matlab can reserve the memory lazily at first or allocate zeroed memory directly. At least since the data are written to the array, the explicit allocation must have happend. Therefore measuring the run time should include the process of writing.
0 Comments
See Also
Categories
Find more on Performance and Memory in Help Center and File Exchange
Products
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!