Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2744

CrawlDbReader: improved reporting of syntactic errors in Jexl expression

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 1.16
    • 1.21
    • crawldb
    • None

    Description

      CrawlDbReader reports syntactic errors in Jexl expressions only in task logs (hadoop.log in local mode) and continues as if there where no Jexl expression set. It should report it more verbosely and probably also fail the job, at least, if the error can be checked at job start.
      In my case a trivial error (score > .9 instead of score > 0.9), the crawlDb was just left unfiltered.

      Attachments

        Activity

          People

            Unassigned Unassigned
            snagel Sebastian Nagel
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: