Subgradient Methods for the Lagrangian Dual

Maximize the Lagrangian dual function L(λ)=minxXcTx+λT(bAx)L(\lambda)=\min_{x\in X} c^T x + \lambda^T(b-Ax) using subgradient methods. Covers computing subgradients of L,L, the projected subgradient update, and step size rules including the Polyak rule.

Step 1 of 157%

Tutorial

Subgradients of the Lagrangian Dual

Subgradients of the Lagrangian Dual

For a primal problem

minxX  cTxs.t.Axb,\min_{x\in X}\; c^T x\quad\text{s.t.}\quad Ax \geq b,

dualizing the complicating constraints with multipliers λ0\lambda \geq 0 produces the Lagrangian dual function

L(λ)=minxX{cTx+λT(bAx)}.L(\lambda) = \min_{x \in X}\Big\{ c^T x + \lambda^T(b - Ax) \Big\}.

The function LL is concave -- it is the pointwise minimum of affine functions of λ\lambda -- but typically not differentiable: the inner minimizer x(λ)x(\lambda) jumps discretely as λ\lambda varies, so LL has kinks.

At such points we replace the gradient with a subgradient: a vector gg such that

L(μ)L(λ)+gT(μλ)for all μ.L(\mu) \leq L(\lambda) + g^T(\mu - \lambda)\quad\text{for all } \mu.

Key fact. If x(λ)x(\lambda) is any optimal solution of the inner Lagrangian subproblem at λ,\lambda, then

g(λ)  =  bAx(λ)g(\lambda) \;=\; b - A\,x(\lambda)

is a subgradient of LL at λ.\lambda.

In words, the subgradient is the residual of the dualized constraint evaluated at x(λ).x(\lambda). A positive residual means the constraint is currently violated, so increasing λ\lambda locally raises L.L.

Illustration. If the dualized constraint is x1+x2+x32x_1 + x_2 + x_3 \geq 2 and the inner minimizer is x(λ)=(1,0,0),x(\lambda) = (1, 0, 0), then

g(λ)=2(1+0+0)=1.g(\lambda) = 2 - (1 + 0 + 0) = 1.
navigate · Enter open · Esc close · ⌘K/Ctrl K toggle