Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-49673

Increase maxBatchSize for Connect's sqlCommandResult

    XMLWordPrintableJSON

Details

    Description

      Increase the default maxBatchSize from 4MiB * 0.7 to 128MiB (=
      CONNECT_GRPC_MAX_MESSAGE_SIZE) * 0.7 when creating the single Arrow batch for the SqlCommandResult in the SparkConnectPlanner. This lets us return much larger LocalRelations in the SqlCommandResult (for example for the SHOW PARTITIONS command) while still staying within the GRPC message size limit.

      Attachments

        Issue Links

          Activity

            People

              dillitz Robert Dillitz
              dillitz Robert Dillitz
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - 1h
                  1h
                  Remaining:
                  Remaining Estimate - 1h
                  1h
                  Logged:
                  Time Spent - Not Specified
                  Not Specified