You underestimate the complexity here by orders of magnitudes. You also overestimate the usefulness to news companies. You underestimate the harm that bad actors can take.
The search algorithm tells you the order of search results for a particular set of terms. Except that as input you need to feed it a graph of the entire indexed internet, which is re-indexed periodically as the content on the index changes. How does knowing that benefit new companies? What, exactly would your hypothetical full-time guy/team, equipped with that index at huge cost, tell their company that would justify the time and expense? That they should write interesting content that lots of people consume?
Second, the general approach has been published and is well documented [1], as are its susceptibilities to attack [2]. So there's your algorithm, what does it tell you?
Third, general SEO isn't the problem, it's coordinated attacks that can poison all search results / ads markets if enough detail is known. Google invests [3] heavily to address these areas [4].
Finally, you underestimate how much of a firehose you'd have to drink from. It describes all of the internet.
[1] http://infolab.stanford.edu/~backrub/google.html
[2] https://en.wikipedia.org/wiki/PageRank#Manipulating_PageRank
[3] https://www.quora.com/What-does-the-Counter-Abuse-Technology...
[4] https://www.blog.google/around-the-globe/google-europe/meet-...