My question was: nestrov accelerated gradient descent is represented as having iteration complexity of o() and convergence rate of . Are these the same?
A) YES, depends on what is fixed.
- error after k iterationn (convergence rate): o(1/k^2), o(1/k),o(log(1/k))
- iteration needed for error k (iteration complexity): o(log(1/k)), , o(1/k), o(1/k^2)
above is the order of fast algorithm. I’m not sure whether the expression o(log(1/k)) which is an linear algorithm for convergence rate exists in iteration complexity context as well. I guess not.
ref: convergence rate and its comparison with iteration complexity (below)
Comment is the energy for a writer, thanks!