Quantcast
Channel: how to check uniqueness (non duplication) of a post in an rss feed - Stack Overflow
Viewing all articles
Browse latest Browse all 4

Answer by Jagira for how to check uniqueness (non duplication) of a post in an rss feed

$
0
0

Take a look at the clustering algorithms used Google news. Though your requirements are not that high, but they are vaguely related to what Google news does - They cluster stories about same event from different sources into one group. They use high level algorithms combined with NLP. But you can start with mapping the keywords in title and url.


Viewing all articles
Browse latest Browse all 4

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>