Advanced Optimization(高级优化)

Posted 晓语听风

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了Advanced Optimization(高级优化)相关的知识,希望对你有一定的参考价值。

Note: [7:35 - \'100\' should be 100 instead. The value provided should be an integer and not a character string.]

"Conjugate gradient", "BFGS", and "L-BFGS" are more sophisticated, faster ways to optimize θ that can be used instead of gradient descent. We suggest that you should not write these more sophisticated algorithms yourself (unless you are an expert in numerical computing) but use the libraries instead, as they\'re already tested and highly optimized. Octave provides them.

We first need to provide a function that evaluates the following two functions for a given input value θ:

We can write a single function that returns both of these:

function [jVal, gradient] = costFunction(theta)
  jVal = [...code to compute J(theta)...];
  gradient = [...code to compute derivative of J(theta)...];
end

Then we can use octave\'s "fminunc()" optimization algorithm along with the "optimset()" function that creates an object containing the options we want to send to "fminunc()". (Note: the value for MaxIter should be an integer, not a character string - errata in the video at 7:30)

options = optimset(\'GradObj\', \'on\', \'MaxIter\', 100);
initialTheta = zeros(2,1);
   [optTheta, functionVal, exitFlag] = fminunc(@costFunction, initialTheta, options);

 We give to the function "fminunc()" our cost function, our initial vector of theta values, and the "options" object that we created beforehand. 

  

 

以上是关于Advanced Optimization(高级优化)的主要内容,如果未能解决你的问题,请参考以下文章

Machine Learning Week 3-advanced-optimization

Convex Optimization

Flask - Flask高级技巧(Advanced Flask Patterns)

C++学习(三三九)CMAKE的Advanced

Keras官方中文文档:高级激活层Advanced Activation

(part2) Optimization basics