These two illustrations make the case.
Here's a word cloud of Nick's original speech on the video.
And now here's the same cloud - but for the Music Video
The first isn't dominated by any single message. There's a ton of words which share equal import and so its hard to take single message out.
The second is dominated by two words. I'm Sorry.
That's why as a piece of messaging it works. That's why it's clear. And that's why we should play it over and over again.
It's also worth noting that the word Sorry barely features in the first cloud.
The Poke have done us a huge favour.