Subgradient Methods for the Lagrangian Dual
Maximize the Lagrangian dual function using subgradient methods. Covers computing subgradients of the projected subgradient update, and step size rules including the Polyak rule.
Tutorial
Subgradients of the Lagrangian Dual
Subgradients of the Lagrangian Dual
For a primal problem
dualizing the complicating constraints with multipliers produces the Lagrangian dual function
The function is concave -- it is the pointwise minimum of affine functions of -- but typically not differentiable: the inner minimizer jumps discretely as varies, so has kinks.
At such points we replace the gradient with a subgradient: a vector such that
Key fact. If is any optimal solution of the inner Lagrangian subproblem at then
is a subgradient of at
In words, the subgradient is the residual of the dualized constraint evaluated at A positive residual means the constraint is currently violated, so increasing locally raises
Illustration. If the dualized constraint is and the inner minimizer is then