Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-624

Better parsed text by default parser

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Incomplete
    • 1.0.0
    • None
    • None
    • None

    Description

      I found the parsed text by default parser, Neko in 1.0 nightly is not easy to process - it just add a space to the end of the tag.
      For easier analysis, neko (or other parser) should change the behaviour to
      1.adding tab for inline element
      2.add a tab+newline for block level element end
      instead of space

      That will help another application to use the parsed text.

      Attachments

        Activity

          People

            ab Andrzej Bialecki
            vinci Vinci
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: