Details

    • Sub-task
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 2.3.1
    • 2.4.0
    • SQL
    • None

    Description

      Need to add new option for Avro datasource - ignoreExtension. It should control ignoring of the .avro extensions. If it is set to true (by default), files with and without .avro extensions should be loaded. Example of usage:

      spark
        .read
        .option("ignoreExtension", false)
        .avro("path to avro files")
      

      The option duplicates Hadoop's config avro.mapred.ignore.inputs.without.extension which is taken into account by Avro datasource now and can be set like:

      spark
        .sqlContext
        .sparkContext
        .hadoopConfiguration
        .set("avro.mapred.ignore.inputs.without.extension", "true")
      

      The ignoreExtension option must override avro.mapred.ignore.inputs.without.extension.

      Attachments

        Activity

          People

            maxgekk Max Gekk
            maxgekk Max Gekk
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: