[SPARK-17986] SQLTransformer leaks temporary tables - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.0.2, 2.1.0
Component/s: ML
Labels:
None

Description

The SQLTransformer creates a temporary table when called, and does not delete this temporary table. When using a SQLTransformer in a long running Spark Streaming task, these temporary tables accumulate.

I believe that the fix would be as simple as calling `dataset.sparkSession.catalog.dropTempView(tableName)` in the last part of `transform`:
https://github.com/apache/spark/blob/v2.0.1/mllib/src/main/scala/org/apache/spark/ml/feature/SQLTransformer.scala#L65.

Attachments

Issue Links

links to

[Github] Pull Request #15526 (drewrobb)

Activity

People

Assignee:: Drew Robb

Reporter:: Drew Robb

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 18/Oct/16 03:17

Updated:: 22/Oct/16 09:06

Resolved:: 22/Oct/16 09:05