Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
According to the documentation, global indexing can be used with SQL joins and is enabled by default. But the code path here:
calls the JoinParams constructor here:
which always sets useIndex to false. This prevents indexing from being possible via SQL queries, and the non-indexed join doesn't work well with large datasets (separate issue, loads all of the non-window objects of each partition into memory at once and quickly runs out of memory)
Also, this python adapter uses the arguments incorrectly as well here:
Need to update the signature of JoinParams, probably just add one with all four parameters
Attachments
Issue Links
- links to