Description
As discussed by Nutch Newbie, Gal, and Chris on NUTCH-443, the current library (feedparser) has the following issues:
- OutOfMemory when parsing > 100k feeds, since it has to convert the feed to jdom first
- no support for Atom 1.0
- there has been no development in the last year
Alternatives are:
- Rome
- Informa
- custom implementation based on Stax
- ??