Take a look at the clustering algorithms used Google news. Though your requirements are not that high, but they are vaguely related to what Google news does - They cluster stories about same event from different sources into one group. They use high level algorithms combined with NLP. But you can start with mapping the keywords in title and url.
↧