Also, with videos like "what X said about situation Y in discourse Z". Sometimes you're just curious, and you can't realistically extract that efficiently from a full one-hour speech on a geolocked, untranscribed mass-media website, so it's easier to summarize the transcript of the 12 min video directly.
As for why everything is 12 minutes long, it's most likely because content creation isn't optimized to teach you anything or be useful, it's optimized to maximize watch time so platforms can serve more ads to you. The pattern is: I got you intrigued in something; you want the answer? pay me your time.