Human markup vs SemanticJuice content extraction algorithm: NEXT RANDOM SAMPLE (hundreds of different websites)
https://www.semanticjuice.com/rd/data-robert/google-news/raw/1db34a39-2057-46dc-939f-10f491af9773.html
Examples provided by Tomaž Kovačič.
Semantic Juice.