Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-583

FeedParser empty links for items

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Auto Closed
    • 1.0.0
    • 2.5
    • None
    • None

    Description

      FeedParser in feed plugin just discards the item if it does not have <link> element. However Rss 2.0 does not necessitate the <link> element for each <item>.
      Moreover sometimes the link is given in the <guid> element which is a globally unique identifier for the item. I think we can search the url for an item first, then if it is still not found, we can use the feed's url, but with merging all the parse texts into one Parse object.

      Attachments

        Activity

          People

            enis Enis Soztutar
            enis Enis Soztutar
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: