Core idea The learning rate $η$ trades off stability vs speed; bad $η$ looks like “the model is broken.”