optimization of matrices with random initialization

Question

Jing Xie on 19 Oct 2021

0
Link

Direct link to this question

https://se.mathworks.com/matlabcentral/answers/1566903-optimization-of-matrices-with-random-initialization

Commented: Jing Xie on 21 Oct 2021

Accepted Answer: Matt J

Ciao, everyone

I want to optimize a function

where pi are optimal variables and matrices of different sizes. (each pi_bar have the same size as corresponding pi)

I reshape all the pi to one single column so I could use the fminunc to solve the problem.

The problem is unconstraint but yi is updated using pi. It also involves some random initialization (using randn) at the beginning for some variables.

pi_bar and yi_bar are already known.

case 1: I run the optimization, it returns different values each time, which is understandable as there is random inialization in the algorithm

case 2: I fixed the random initialization, using rng for example. It returns an error "maximum number of function evaluations has been exceeded " even if I set the value to very high (500000).

It seems that the algorithm only finds a better point when there is better random initialization points. Is there a better way to cope with the random initialization in optimization problem? And what could be the reasons for the case 2?

Thanks a lot for any suggestion in advance!

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Matt J on 20 Oct 2021

0
Link

Direct link to this answer

https://se.mathworks.com/matlabcentral/answers/1566903-optimization-of-matrices-with-random-initialization#answer_812593

Make sure your objective function code does not contain any randomization steps. Your initial guess can be random, but the objective function itself needs to be deterministic. Aside from that, nothing can be diagnosed without seeing your code.

8 Comments
Show 6 older commentsHide 6 older comments

Matt J on 20 Oct 2021

Edited: Matt J on 20 Oct 2021

That doesn't clarify why you are not optimizing xp. Surely the accuracy of the prediction of

depends jointly on xp and the other parameters. If you change x1...x8, your prediction can become worse if you don't change xp as well.

Also, you say that your initial guess of x1...x8 came from a previously trained LSTM. Why not use the xp from that network as well (regardless of whether xp is treated as an unknown or not)?

Jing Xie on 21 Oct 2021

Thanks for the comment!

I think treating xp as optimal variables is a nice idea to eliminate the randomness in the process. Although it returns the same error "exceeds max number of function evaluations". I think the initial guess of the optimal variables are still too far away from optimal solutions. If I choose less variables to optimize (e.g. only x1 and x2), the algorithm works fine.

Sign in to comment.

Answer 2

Alan Weiss on 19 Oct 2021

0
Link

Direct link to this answer

https://se.mathworks.com/matlabcentral/answers/1566903-optimization-of-matrices-with-random-initialization#answer_812053

You would probably do well to use the Problem-Based Optimization Workflow. But you can just as easily change your current solution method to use a more efficient algorithm. The point is that lsqnonlin is the solver of choice for sum-of-squares problems. Your objective function should return the

and lsqnonlin implicitly sums the squares and minimizes.

That said, I might be misunderstanding your problem. You said that your

are functions of

, and I do noot see that connection in your problem formulation. So I might have it wrong somehow.

In any case, see whether the problem-based formulation makes sense for you and whether it chooses a more efficient solver.

Alan Weiss

MATLAB mathematical toolbox documentation

1 Comment
Show -1 older commentsHide -1 older comments

Jing Xie on 20 Oct 2021

Hi Alan

thanks a lot for the answer!

Sorry I did not state it clearly. The optimization variables are only

in my algorithm. I am actually using fminunc to solve my optimization. I also have tried problem-based optimization and it automatically chooses fminunc as the solver. And it returns the exact same error.

In my problem formulation,

is actually a sequence and is calculated through

and some variables (lets say

).

If

is randomly initialized, the algorithm only finds a better point if there is a good random initialization.

I initialize the optimal variable

with the previous best result (according to my algorithm, it is the best initializaton for

).

For

, I used a good guess (at least in my opinion) to initialize it.

But it still throws the error "fminunc stopped because it exceeded the function evaluation limit"

I am new to optimization algorithm. Is it because that there is no available (sub)optimal points near the initial point of

? Or the algorithm is too conservative to take large steps that it is trapped?

I also would like to add, it is a relatively big optimization problem with 8 different optimization variables

(all are matrices with different size) and together 724 scalar optimization variable. Is fminunc suitable for problems with large number of optimization variables?

I would appreciate any suggestions!

Jing

Sign in to comment.

optimization of matrices with random initialization

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

8 Comments
Show 6 older commentsHide 6 older comments

More Answers (1)

1 Comment
Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

optimization of matrices with random initialization

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

8 Comments Show 6 older commentsHide 6 older comments

More Answers (1)

1 Comment Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

8 Comments
Show 6 older commentsHide 6 older comments

1 Comment
Show -1 older commentsHide -1 older comments