[SPARK-27162] Add new method getOriginalMap in CaseInsensitiveStringMap - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 3.0.0
Fix Version/s: 3.0.0
Component/s: SQL
Labels:
None

Description

Currently, DataFrameReader/DataFrameReader supports setting Hadoop configurations via method `.option()`.
E.g.
```
class TestFileFilter extends PathFilter

{ override def accept(path: Path): Boolean = path.getParent.getName != "p=2" }

withTempPath

{ dir => val path = dir.getCanonicalPath val df = spark.range(2) df.write.orc(path + "/p=1") df.write.orc(path + "/p=2") assert(spark.read.orc(path).count() === 4) val extraOptions = Map( "mapred.input.pathFilter.class" -> classOf[TestFileFilter].getName, "mapreduce.input.pathFilter.class" -> classOf[TestFileFilter].getName ) assert(spark.read.options(extraOptions).orc(path).count() === 2) }

```
While Hadoop Configurations are case sensitive, the current data source V2 APIs are using `CaseInsensitiveStringMap` in TableProvider.
To create Hadoop configurations correctly, I suggest adding a method `getOriginalMap` in `CaseInsensitiveStringMap`.

Attachments

Issue Links

links to

GitHub Pull Request #24094

Activity

People

Assignee:: Gengliang Wang

Reporter:: Gengliang Wang

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 14/Mar/19 16:07

Updated:: 19/Mar/19 05:38

Resolved:: 19/Mar/19 05:36