Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-44265 Built-in XML data source support
  3. SPARK-47218

XML: Ignore commented Row Tags in XML tokenizer

    XMLWordPrintableJSON

Details

    Description

      The following returns rows that was within comments:

      {{}}

      // BUG: rowTag in comment -- incorrectly processed display(spark.read.xml(write("""<root> <ROW><a>1</a></ROW> <!--<ROW><a>2</a></ROW>--> </root>""")))

      {{}}

      This has been reported before:!https://github.com/fluidicon.png!How to Ignore XML comments like this · Issue #208 · databricks/spark-xml
      {{}}

      Attachments

        Issue Links

          Activity

            People

              yhosny Yousof Hosny
              yhosny Yousof Hosny
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: