Why hasn't longer-horizon training slowed AI progress?
Longer training horizons have not slowed AI progress because modern architectures (like transformers) and techniques such as reward shaping distribute gradients effectively, making credit assignment over long episodes tractable without exponential difficulty increases.