How to access data of regexp output (without temporary variable)?
9 views (last 30 days)
Show older comments
I know how to convert the output from regexp( ) to a table by intermediating through a temporary variable as this solution:
strT = ["apple 001"; "banana 102"; "orange 344 001"];
C = regexp(strT, '\s', 'split', 'once'); % temporary variable
array2table(vertcat(C{:,:})) % desired output, except I hope there's a way to avoid braces.
I wonder if there is a way to cascade/pipeline function calls in a single line. It seems that similar questions had been asked zillion times (this post said so.) But I want to learn more about Matlab, so forgive me for asking it again.
% I tried this, but it doesn't give the desired result.
cell2table(regexp(strT, '\s', 'split', 'once'))
0 Comments
Accepted Answer
Stephen23
on 26 Apr 2023
Edited: Stephen23
on 26 Apr 2023
The general answer to the question has not changed much since 2012.
Using a temporary variable is not "less efficient" as many beginners might imagine (most likely MATLAB will generate the intermediate cell array internally anyway).
str = ["apple 001"; "banana 102"; "orange 344 001"];
tbl = splitvars(cell2table(regexp(str, '\s', 'split', 'once')))
Note that this creates and discards an intermediate table, so uses more memory.
Personally I would recommend using the intermediate variable: while code golf is very entertaining, it does not serve the purpose of clarity and efficiency. In one year you will come back to this code any wonder what it does, how it works, and what side-effects it has. Code is read more times than it is written, so clarity when reading is valuable.
On top of that, hiding intermediate arrays within nested function call makes debugging harder.
More Answers (3)
albara
on 26 Apr 2023
Edited: per isakson
on 26 Apr 2023
You can access the data of the regexp output directly without using a temporary variable by using the syntax [~,~,~,match] = regexp(str, expr), where str is the input string and expr is the regular expression you want to match.
The regexp function returns multiple outputs, including the start and end indices of the match, any tokens captured by the regular expression, and the matched substring. By using the ~ symbol as a placeholder for the outputs you don't need, you can ignore them and only keep the matched substring.
Here is an example:
% Define input string and regular expression
str = 'The quick brown fox jumps over the lazy dog';
expr = 'brown';
% Use regexp to find the matching substring and access it directly
[~, ~, ~, match] = regexp(str, expr);
% Display the matching substring
disp(match)
In this example, regexp is used to find the first occurrence of the word "brown" in the input string str. The ~ symbol is used as a placeholder for the outputs we don't need, and the match variable is used to store the matching substring directly.
The output of this code will be the string 'brown'. Note that you can access any of the other outputs of regexp by using the corresponding placeholder variables, such as [start, end, tokens, match] = regexp(str, expr) to store the start and end indices and any tokens in separate variables.
3 Comments
Stephen23
on 13 May 2023
"Why we have different results from the same codes?"
Within a regular expression the meta-character \S only matches non-whitespace characters:
"orange" "344 001"
% ^ so there is no way it could return this
Rik
on 26 Apr 2023
If you insist on a single line, you can even use subsref to index the result of regexp. That will make your code hard to read and hard to modify.
Why don't you write a wrapper function you can call regexp2table like this:
strT = ["apple 001"; "banana 102"; "orange 344 001"];
regexp2table(strT, '\s', 'split', 'once') % Look, ma, one line
function tab=regexp2table(varargin)
% All inputs are piped to regexp().
C = regexp(varargin{:});
tab = array2table(vertcat(C{:}));
end
Dyuman Joshi
on 26 Apr 2023
Experimenting lead to this -
str = ["apple 001"; "banana 102 3579"; "orange 344 001"]
C = array2table([extractBefore(str,' ') extractAfter(str,' ')])
See Also
Categories
Find more on Data Type Conversion in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!