XMLWordPrintableJSON

    Details

      Description

      I’ll be crawling a website with the standard Web connecter. I want to extract just certain html tags like <h1>, <h2> and <p>. 
      I’ve set up an HTML extractor transformation connector and the internal Tika transformation connector. But I can’t find any place to do a mapping to the output for this.
       
      Do I have to write my own transformation connector to extract the content of these tags? Or is there a built in solution?

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              DonaldVdD Donald Van den Driessche
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: