Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-44111

Prepare Apache Spark 4.0.0

    XMLWordPrintableJSON

Details

    • Umbrella
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • 4.0.0
    • None
    • Build

    Description

      For now, this issue aims to collect ideas for planning Apache Spark 4.0.0.

      We will add more items which will be excluded from Apache Spark 3.5.0 (Feature Freeze: July 16th, 2023).

      Spark 1: 2014.05 (1.0.0) ~ 2016.11 (1.6.3)
      Spark 2: 2016.07 (2.0.0) ~ 2021.05 (2.4.8)
      Spark 3: 2020.06 (3.0.0) ~ 2026.xx (3.5.x)
      Spark 4: 2025.02 (4.0.0, NEW)
      - January 15th 2025	Code freeze. Release branch cut.
      - February 1st 2025	QA period. Focus on bug fixes, tests, stability and docs. Generally, no new features merged.
      - February 15th 2025	Release candidates (RC), voting, etc. until final release passes
      

      Attachments

        Issue Links

          1.
          Drop mesos support Sub-task Resolved Sean R. Owen
          2.
          Drop Python 3.8 support Sub-task Resolved Hyukjin Kwon
          3.
          Deprecate SaprkR Sub-task Resolved Hyukjin Kwon
          4.
          Remove deprecated `BinaryClassificationMetrics.scoreLabelsWeight` Sub-task Resolved Dongjoon Hyun
          5.
          Fix `GenerateMIMAIgnore.isPackagePrivateModule` to return false instead of failing Sub-task Resolved Dongjoon Hyun
          6.
          Remove shim classes for Hive prior 2.0.0 Sub-task Resolved Cheng Pan
          7.
          Support Hive 4.0 metastore Sub-task Resolved Kent Yao
          8.
          Cast between UDT and other types Sub-task Resolved L. C. Hsieh
          9.
          Eliminate unnecessary reflection invocation in Hive shim classes Sub-task Resolved Cheng Pan
          10.
          Deprecated no-op `spark.shuffle.spill` config Sub-task Resolved Dongjoon Hyun
          11.
          Deprecate spark.sql.parser.escapedStringLiterals Sub-task Resolved Max Gekk
          12.
          Deprecate `spark.sql.legacy.allowZeroIndexInFormatString` Sub-task Resolved Max Gekk
          13.
          Deprecate spark.network.remoteReadNioBufferConversion Sub-task Resolved Cheng Pan
          14.
          Upgrade Guava to 33.2.1-jre Sub-task Resolved Cheng Pan
          15.
          Upgrade Guava to 33.3.1-jre Sub-task Resolved Yang Jie
          16.
          Upgrade Scala to 2.13.12 Sub-task Resolved Yang Jie
          17.
          Upgrade Scala to 2.13.13 Sub-task Resolved Pan Bingkun
          18.
          Upgrade Scala to 2.13.14 Sub-task Resolved Pan Bingkun
          19.
          Upgrade Scala to 2.13.15 Sub-task Resolved Pan Bingkun
          20.
          Enable spark.shuffle.service.removeShuffle by default Sub-task Resolved Dongjoon Hyun
          21.
          Enable spark.eventLog.compress by default Sub-task Resolved Dongjoon Hyun
          22.
          Enable spark.eventLog.rolling.enabled by default Sub-task Resolved Dongjoon Hyun
          23.
          Enable `spark.metrics.appStatusSource.enabled` by default Sub-task Resolved Dongjoon Hyun
          24.
          Enable `spark.stage.ignoreDecommissionFetchFailure` by default Sub-task Resolved Dongjoon Hyun
          25.
          Enable `spark.sql.sources.v2.bucketing.pushPartValues.enabled` by default Sub-task Resolved Szehon Ho
          26.
          Update `spark.speculation.multiplier` to 3 and `spark.speculation.quantile` to 0.9 Sub-task Resolved Dongjoon Hyun
          27.
          Change default of spark.sql.legacy.timeParserPolicy from EXCEPTION to CORRECTED Sub-task Resolved Serge Rielau
          28.
          Make EventLoggingListenerSuite independent from spark.eventLog.compress conf Sub-task Resolved Dongjoon Hyun
          29.
          Fix EventLogFileWriters to handle `none` codec case Sub-task Resolved Dongjoon Hyun
          30.
          Migrate antlr4 from 4.9 to 4.10+ Sub-task Resolved Yang Jie
          31.
          Upgrade Python to 3.11 in Maven builds Sub-task Resolved Hyukjin Kwon
          32.
          Upgrade `protobuf` to 4.25.1 to support `Python 3.11` Sub-task Resolved Dongjoon Hyun
          33.
          Upgrade `protobuf-java` to 3.25.1 to match with protobuf 4.25.1 Sub-task Resolved Dongjoon Hyun
          34.
          Upgrade `protobuf-java` to 4.28.3 Sub-task Resolved Dongjoon Hyun
          35.
          Upgrade protobuf-java to 4.29.1 Sub-task Resolved Yang Jie
          36.
          Support Python 3.12 Sub-task Resolved Dongjoon Hyun
          37.
          Support Python 3.13 in Spark Classic Sub-task Resolved Hyukjin Kwon
          38.
          Remove pinned version of torch for Python 3.12 support Sub-task Resolved Hyukjin Kwon
          39.
          Handle PEP-632 by checking `ModuleNotFoundError` on `setuptools` in Python 3.12 Sub-task Resolved Dongjoon Hyun
          40.
          `mypy` should have `--python-executable` parameter Sub-task Resolved Dongjoon Hyun
          41.
          Upgrade Pandas to 2.2.0 Sub-task Resolved Haejoon Lee
          42.
          Update pandas to 2.2.1 Sub-task Resolved Bjørn Jørgensen
          43.
          Update pandas to 2.2.2 Sub-task Resolved Bjørn Jørgensen
          44.
          Update pandas to 2.2.3 Sub-task Resolved Bjørn Jørgensen
          45.
          Remove `distutils` usage Sub-task Resolved Dongjoon Hyun
          46.
          Remove deprecated Hadoop-2 `LocatedFileStatus` constructor Sub-task Resolved Dongjoon Hyun
          47.
          Improve InMemoryFileIndex to use FileSystem.listFiles API Sub-task Resolved Dongjoon Hyun
          48.
          Change RocksDB as default shuffle service db backend Sub-task Resolved Jia Fan
          49.
          Remove threeten-extra exclusion in enforceBytecodeVersion rule Sub-task Resolved Dongjoon Hyun
          50.
          Upgrade ORC to 1.9.1 Sub-task Resolved Dongjoon Hyun
          51.
          Upgrade ORC to 1.9.2 Sub-task Resolved Dongjoon Hyun
          52.
          Upgrade ORC to 2.0.0 Sub-task Resolved Dongjoon Hyun
          53.
          Upgrade ORC to 2.0.1 Sub-task Resolved William Hyun
          54.
          Upgrade ORC to 2.0.2 Sub-task Resolved Dongjoon Hyun
          55.
          Remove workaround for YARN-3350 Sub-task Resolved Cheng Pan
          56.
          Update docs for size function and sizeOfNull configuration Sub-task Resolved Kent Yao
          57.
          Support ORC Brotli codec Sub-task Resolved dzcxzl
          58.
          Fix ORC tests to be independent from default compression Sub-task Resolved Dongjoon Hyun
          59.
          Use `zstd` as the default ORC compression Sub-task Resolved Dongjoon Hyun
          60.
          Use the default ORC compression in OrcReadBenchmark Sub-task Resolved Dongjoon Hyun
          61.
          Improve `TPCDSQueryBenchmark` to support other file formats Sub-task Resolved Dongjoon Hyun
          62.
          Use default ORC compression in data source benchmarks Sub-task Resolved Dongjoon Hyun
          63.
          Upgrade `Parquet` to 1.14.1 Sub-task Resolved Fokko Driesprong
          64.
          Upgrade Parquet to 1.14.2 Sub-task Resolved Fokko Driesprong
          65.
          Upgrade `Parquet` to 1.14.3 Sub-task Resolved Pan Bingkun
          66.
          Bump Apache Parquet to 1.14.4 Sub-task Resolved Fokko Driesprong
          67.
          Bump Apache Parquet to 1.15.0 Sub-task Resolved Fokko Driesprong
          68.
          Upgrade Avro to 1.11.3 Sub-task Resolved Dongjoon Hyun
          69.
          Upgrade Avro to 1.12.0 Sub-task Resolved Fokko Driesprong
          70.
          Documented Prometheus endpoints Sub-task Resolved Jerry Zhou
          71.
          Document a few missed `spark.ui.*` configs to `Configuration` page Sub-task Resolved Dongjoon Hyun
          72.
          Upgrade Maven to 3.9.6 for MNG-7913 Sub-task Resolved Dongjoon Hyun
          73.
          Upgrade Maven to 3.9.7 Sub-task Resolved Pan Bingkun
          74.
          Upgrade Maven to 3.9.8 Sub-task Resolved Dongjoon Hyun
          75.
          Upgrade maven to 3.9.9 Sub-task Resolved Yang Jie
          76.
          Use Scala 2.13 Spark distribution in HiveExternalCatalogVersionsSuite Sub-task Resolved Dongjoon Hyun
          77.
          Add Apple Silicon Maven build test to GitHub Action CI Sub-task Resolved Dongjoon Hyun
          78.
          Add Daily Apple Silicon Github Action Job (Java/Scala) Sub-task Resolved Hyukjin Kwon
          79.
          Migrate from AppVeyor to GitHub Actions for SparkR tests on Windows Sub-task Resolved Hyukjin Kwon
          80.
          Attach codec extension to avro datasource files Sub-task Resolved Kent Yao
          81.
          Benchmarking Avro with Compression Codecs Sub-task Resolved Kent Yao
          82.
          Codec xz and zstandard support compression level for avro files Sub-task Resolved Kent Yao
          83.
          Disable unsupported `ExtendedLevelDBTest` on `MacOS/aarch64` Sub-task Resolved Yang Jie
          84.
          Change to use bcprov/bcpkix-jdk18on for test Sub-task Resolved Yang Jie
          85.
          Add `bouncy-castle` test dependencies to `sql/core` module for Hadoop 3.4.0 Sub-task Resolved Yang Jie
          86.
          Add `bcpkix-jdk18on` test dependencies to `hive` module for Hadoop 3.4.0 Sub-task Resolved Dongjoon Hyun
          87.
          Upgrade `bouncycastle` to 1.78 Sub-task Resolved Dongjoon Hyun
          88.
          Use Hadoop 3.3.5 winutils in AppVeyor build Sub-task Resolved Pan Bingkun
          89.
          Upgrade Hadoop to 3.3.6 Sub-task Resolved Dongjoon Hyun
          90.
          Upgrade Hadoop to 3.4.0 Sub-task Resolved Dongjoon Hyun
          91.
          Upgrade Hadoop to 3.4.1 Sub-task Resolved Dongjoon Hyun
          92.
          Use hadoop 3.4.0 in some docs Sub-task Resolved Pan Bingkun
          93.
          Fix `IsolatedClientLoader.supportsHadoopShadedClient` to handle Hadoop 3.4+ Sub-task Resolved Dongjoon Hyun
          94.
          Exclude `logback` dependency from SBT like Maven Sub-task Resolved Dongjoon Hyun
          95.
          Ignore `IntentionallyFaultyConnectionProvider` error in `CliSuite` Sub-task Resolved Dongjoon Hyun
          96.
          Set spark.hadoop.fs.s3a.connection.establish.timeout to 30s Sub-task Resolved Dongjoon Hyun
          97.
          Regenerate benchmark results Sub-task Resolved Dongjoon Hyun
          98.
          Upgrade R version from 4.3.1 to 4.3.2 in AppVeyor Sub-task Resolved Hyukjin Kwon
          99.
          Use R 4.3.3 in `windows` R GitHub Action job Sub-task Resolved Dongjoon Hyun
          100.
          Use `Ubuntu 22.04` in `dev/infra/Dockerfile` Sub-task Resolved Dongjoon Hyun
          101.
          Support MergeInto in DataFrameWriterV2 Sub-task Resolved Huaxin Gao
          102.
          Upgrade Arrow to 14.0.0 Sub-task Resolved Yang Jie
          103.
          Upgrade pyarrow to 14 Sub-task Resolved Ruifeng Zheng
          104.
          Upgrade Arrow to 14.0.1 Sub-task Resolved Dongjoon Hyun
          105.
          Upgrade Arrow to 14.0.2 Sub-task Resolved Dongjoon Hyun
          106.
          Upgrade Arrow to 15.0.0 Sub-task Resolved Yang Jie
          107.
          Upgrade Arrow to 15.0.2 Sub-task Resolved Pan Bingkun
          108.
          Upgrade pyarrow to 15.0.0 Sub-task Resolved Ruifeng Zheng
          109.
          Upgrade `Arrow` to 16.0.0 Sub-task Resolved dzcxzl
          110.
          Upgrade `Arrow` to 17.0.0 Sub-task Resolved Pan Bingkun
          111.
          fail by default in deprecated catalog plugin APIs Sub-task Resolved Wenchen Fan
          112.
          Refresh testing image for pyarrow 17 Sub-task Resolved Ruifeng Zheng
          113.
          Upgrade the minimum version of PyArrow to 10.0.0 Sub-task Resolved Haejoon Lee
          114.
          Upgrade the minimum version of `arrow` R package to 10.0.0 Sub-task Resolved Dongjoon Hyun
          115.
          Move `o.a.s.variant` to `o.a.s.types.variant` Sub-task Resolved Dongjoon Hyun
          116.
          Remove Spark 3.0~3.2 pyspark/version.py workaround from release scripts Sub-task Resolved Dongjoon Hyun
          117.
          Add `slf4j-api` jar to the class path first before the others of `jars` directory Sub-task Resolved Dongjoon Hyun
          118.
          Make Spark build with -release instead of -target Sub-task Resolved Yang Jie
          119.
          Use `HiveConf.getConfVars` or Hive conf names directly Sub-task Resolved Dongjoon Hyun
          120.
          Upgrade hive-service-rpc 4.0.0 Sub-task Resolved Cheng Pan
          121.
          Upgrade Kafka to 3.6.1 Sub-task Resolved Deng Ziming
          122.
          Upgrade Kafka to 3.7.0 Sub-task Resolved Pan Bingkun
          123.
          Upgrade Kafka to 3.7.1 Sub-task Resolved Pan Bingkun
          124.
          Upgrade Kafka to 3.8.0 Sub-task Resolved Pan Bingkun
          125.
          Upgrade Kafka to 3.8.1 Sub-task Resolved Dongjoon Hyun
          126.
          Upgrade Kafka to 3.9.0 Sub-task Resolved Pan Bingkun
          127.
          Fix `dev/scalastyle` to check `hadoop-cloud` and `jvm-profiler` modules Sub-task Resolved Dongjoon Hyun
          128.
          Remove redundant rules from `MimaExcludes` Sub-task Resolved Dongjoon Hyun
          129.
          Run `ANSI` SQL CI twice per day Sub-task Resolved Dongjoon Hyun
          130.
          Use ANSI SQL mode by default Sub-task Resolved Dongjoon Hyun
          131.
          Switch ANSI SQL CI job to NON-ANSI SQL CI job Sub-task Resolved Dongjoon Hyun
          132.
          Update `sql-ref-ansi-compliance.md` and `sql-ref-identifier.md` Sub-task Resolved Dongjoon Hyun
          133.
          Fix a bug in try_divide function when with decimals Sub-task Resolved Gengliang Wang
          134.
          Regenerate benchmark results after turning ANSI on Sub-task Resolved Kent Yao
          135.
          Remove install_scala from build/mvn Sub-task Resolved Cheng Pan
          136.
          Use Hive tables explicitly for Hive table capability tests Sub-task Resolved Dongjoon Hyun
          137.
          Parameterize max limits of `spark.sql.test.randomDataGenerator` Sub-task Resolved Dongjoon Hyun
          138.
          Set `spark.sql.legacy.createHiveTableByDefault` to `false` by default Sub-task Resolved Dongjoon Hyun
          139.
          Support SPARK_SQL_LEGACY_CREATE_HIVE_TABLE env variable Sub-task Resolved Dongjoon Hyun
          140.
          Fix StreamingQueryHashPartitionVerifySuite to update golden files correctly Sub-task Resolved Dongjoon Hyun
          141.
          Introduces a universal BinaryFormatter to make binary output consistent Sub-task Resolved Kent Yao
          142.
          Disable a flaky `SparkSessionE2ESuite.interrupt tag` test Sub-task Resolved Dongjoon Hyun
          143.
          Add toArrow() DataFrame method to PySpark Sub-task Resolved Ian Cook
          144.
          Mark deprecated functions with `@deprecated` in `SQLImplicits` Sub-task Resolved Yang Jie
          145.
          Allow passing PyArrow Table to createDataFrame() Sub-task Resolved Ian Cook
          146.
          Support additional PyArrow Table column types Sub-task Resolved Ian Cook
          147.
          Fix `Black` target version to Python 3.9 Sub-task Resolved Dongjoon Hyun
          148.
          Make `IndexShuffleBlockResolver` have explicit constructors Sub-task Resolved Dongjoon Hyun
          149.
          Preserve nulls in map columns in PyArrow Tables Sub-task Resolved Ian Cook
          150.
          Use R 4.4.0 in `windows` R GitHub Action Windows job Sub-task Resolved Pan Bingkun
          151.
          Use R 4.4.1 in `windows` R GitHub Action Windows job Sub-task Resolved Dongjoon Hyun
          152.
          Use R 4.4.2 in `windows` R GitHub Action Windows job Sub-task Resolved Dongjoon Hyun
          153.
          Replace SQLContext with SparkSession in STS Sub-task Resolved Cheng Pan
          154.
          Implement IndentingXMLStreamWriter Sub-task Resolved Alden Lau
          155.
          `Row#getSeq` exhibits a performance regression between master and Spark 3.5 with Scala 2.12 Sub-task Resolved Yang Jie
          156.
          Regenerate benchmark results Sub-task Resolved Dongjoon Hyun
          157.
          Investigate ExternalAppendOnlyUnsafeRowArrayBenchmark Sub-task Resolved Dongjoon Hyun
          158.
          Add `scalastyle` and `checkstyle` rules to avoid `URL` constructors Sub-task Resolved Dongjoon Hyun
          159.
          Enable ANSI for TPCDSQueryBenchmark Sub-task Resolved Kent Yao
          160.
          Support CRC32C for Shuffle Checksum Sub-task Resolved Kent Yao
          161.
          Free memory allocated by large vectors when vectors are reset Sub-task Resolved Wan Kun
          162.
          Remove `Python 3.6 and older`-related logic from `try_simplify_traceback` Sub-task Resolved Dongjoon Hyun
          163.
          Fix `assembly/pom.xml` to use `{project.version}` instead of `{version}` Sub-task Resolved Dongjoon Hyun
          164.
          Use more appropriate parameter type to construct `GenericArrayData` Sub-task Resolved Yang Jie
          165.
          Support user-defined labels for OnDemand PVCs Sub-task Resolved Prathit Malik
          166.
          Fix `spark-rm` and `infra` docker files to create `pypy3.9` links Sub-task Resolved Dongjoon Hyun
          167.
          Remove the flaky `EXEC IMMEDIATE STACK OVERFLOW` test case Sub-task Resolved Dongjoon Hyun
          168.
          Re-enable test_create_dataframe_from_pandas_with_day_time_interval Sub-task Resolved Dongjoon Hyun
          169.
          Pick up system property defaults in SharedSparkContext test suite mixin Sub-task Resolved Josh Rosen
          170.
          Make `docs/index.md` up-to-date for 4.0.0 Sub-task Resolved Dongjoon Hyun
          171.
          Link `python3` to `python3.9` in `spark-rm` Docker image Sub-task Resolved Dongjoon Hyun
          172.
          Support `spark.test.master` in `SparkSubmitArguments` Sub-task Resolved Dongjoon Hyun
          173.
          Parallelize Sphinx build of Python API docs Sub-task Resolved Nicholas Chammas
          174.
          Limit `Sphinx` build parallelism to 4 by default Sub-task Resolved Dongjoon Hyun
          175.
          Deflaky PythonForeachWriterSuite Sub-task Resolved Dongjoon Hyun
          176.
          Function `substring` should accept column names Sub-task Resolved Ruifeng Zheng
          177.
          Add official image Dockerfile for Apache Spark 4.0.0-preview1 Sub-task Resolved Wenchen Fan
          178.
          Add Apache Spark `4.0.0-preview2` Dockerfiles Sub-task Resolved Dongjoon Hyun
          179.
          Increase `spark.test.docker.connectionTimeout` to 10min Sub-task Resolved Dongjoon Hyun
          180.
          Change log level to debug in BlockManagerMasterEndpoint Sub-task Resolved Yuming Wang
          181.
          Set `OBJC_DISABLE_INITIALIZE_FORK_SAFETY=YES` in `MacOS` GitHub Action Job Sub-task Resolved Dongjoon Hyun
          182.
          Use `MacOS 15` in `build_maven_java21_macos14.yml` Sub-task Resolved Dongjoon Hyun
          183.
          Upgrade `Arrow` to 18.0.0 Sub-task Resolved Dongjoon Hyun
          184.
          Skip PySpark `test_cast_to_udt_with_udt` test during `SPARK_SKIP_CONNECT_COMPAT_TESTS` Sub-task Resolved Dongjoon Hyun
          185.
          Fix `SPARK-41388` test case to assert the result Sub-task Resolved Dongjoon Hyun
          186.
          Use SPDX short identifier in `licenses` name Sub-task Resolved Dongjoon Hyun
          187.
          Use `lower_camel_case` in `store_types.proto` Sub-task Resolved Dongjoon Hyun
          188.
          Enable `buf-lint-action` on `core` module Sub-task Resolved Dongjoon Hyun
          189.
          Reduce Cloud Store List API cost for state store maintenance task Sub-task Resolved Riya Verma
          190.
          RocksDB State Store Doesn't Need to Recheck checkpoint path existence Sub-task Resolved Siying Dong
          191.
          Increase test timeout of PythonForeachWriterSuite to 60s Sub-task Resolved Dongjoon Hyun
          192.
          Deflaky `(Ssl)CoarseGrainedExecutorBackendSuite` to reduce test resource and increase timeout Sub-task Resolved Dongjoon Hyun
          193.
          Fix `AsyncRDDActionsSuite.FutureAction result, timeout` test case to be robust Sub-task Resolved Dongjoon Hyun
          194.
          Remove redundant `blank space` after 'show' in `Scala` and `Connect` clients Sub-task Resolved Pan Bingkun
          195.
          Make HadoopMapReduceCommitProtocol respect 'mapreduce.output.basename' to generate file names Sub-task Resolved Kent Yao
          196.
          Handle or document `NumPy 2.1` difference in Python 3.13 Sub-task Resolved Unassigned
          197.
          The `sbt console` command is not available Sub-task Resolved Yang Jie
          198.
          Upgrade the minimum version of `googleapis-common-protos` to 1.65.0 Sub-task Resolved Dongjoon Hyun
          199.
          Upgrade `protobuf` Python package to 5.28.3 Sub-task Resolved Dongjoon Hyun
          200.
          Fix `protobuf` module Maven compilation Sub-task Resolved Dongjoon Hyun
          201.
          Upgrade ORC to 2.0.3 Sub-task Resolved Dongjoon Hyun
          202.
          Upgrade `grpcio*` to 1.59.3 for Python 3.12 Sub-task Resolved Dongjoon Hyun
          203.
          Update INFRA docker image to use `jammy-20240911.1` Sub-task Resolved Dongjoon Hyun
          204.
          Flaky test: `SPARK-46957: Migrated shuffle files should be able to cleanup from executor` with Java 21 Sub-task Resolved Attila Zsolt Piros
          205.
          Remove `run-tests-jenkins`-related stuff Sub-task Resolved Dongjoon Hyun
          206.
          Flaky test: `Cancellation APIs in SparkSession are isolated` Sub-task Resolved Pengfei Xu
          207.
          Install Jinja2 for Python 3.13 build Sub-task Resolved Hyukjin Kwon
          208.
          Disable `SparkSessionJobTaggingAndCancellationSuite.Cancellation APIs in SparkSession are isolated` Sub-task Resolved Unassigned
          209.
          Upgrade `bouncycastle` to 1.79 Sub-task Resolved Dongjoon Hyun
          210.
          Add `getSystemProperty` to PySpark `SparkContext` Sub-task Resolved Dongjoon Hyun
          211.
          Make `curl` retry 3 times in `bin/mvn` Sub-task Resolved Pan Bingkun
          212.
          Make GA's `build_error_docs` run only once Sub-task Resolved Pan Bingkun
          213.
          Make `JdbcDialect.classifyException` throw out the original exception Sub-task Resolved Pan Bingkun
          214.
          Upgrade oracle jdbc driver to 23.6.0.24.10 Sub-task Resolved Pan Bingkun
          215.
          Support Table Options for Insert SQL Sub-task Resolved Szehon Ho
          216.
          Use mirror host instead of `archive.apache.org` Sub-task Resolved Dongjoon Hyun
          217.
          Remove `(any|empty).proto` from RAT exclusion Sub-task Resolved Dongjoon Hyun
          218.
          Support Python 3.13 in Spark Connect Sub-task Resolved Hyukjin Kwon
          219.
          Investigate UTF8_BINARY regression Sub-task Resolved Stevo Mitric
          220.
          Make `--remote` an official option by removing `experimental` warning Sub-task Resolved Dongjoon Hyun
          221.
          Enable ANSI in SQL *SQLQueryTestSuite by default Sub-task Resolved Kent Yao
          222.
          Rename `ojdbc11.version` to `ojdbc17.version` Sub-task Resolved Cheng Pan
          223.
          Make `Parameterized SQL queries` of `SparkSession.sql` API GA Sub-task Resolved Dongjoon Hyun
          224.
          Remove workaround for THRIFT-4805 Sub-task Resolved Cheng Pan
          225.
          Proper error handling for unsupported file system when reading files Sub-task Resolved Kent Yao
          226.
          Update cross-spawn JS module to surpress a warning in lint Sub-task Resolved Kousuke Saruta
          227.
          Fix `spark.storage.replication.proactive` default value documentation Sub-task Resolved Dongjoon Hyun
          228.
          migrate kafka consumer offset information in spark to new MSK cluster Sub-task Open Unassigned
          229.
          Re-enable `SparkSessionE2ESuite.interrupt tag` Sub-task Reopened Unassigned
          230.
          Reenble AmmoniteTest tests in Maven builds Sub-task Open Unassigned
          231.
          Re-enable `SparkSessionJobTaggingAndCancellationSuite.Cancellation APIs in SparkSession are isolated` Sub-task Open Unassigned
          232.
          Add addArtifact API to the Spark SQL Core Sub-task Resolved Pengfei Xu
          233.
          Add addArtifact API to PySpark Sub-task Open Unassigned
          234.
          Log downloaded archive file sizes in SparkContext and Executor Sub-task Resolved Dongjoon Hyun
          235.
          Use `rouge` `4.5.x` by remove the upper bound `3.x` Sub-task Resolved Dongjoon Hyun
          236.
          Add quote builtin function Sub-task Open Kousuke Saruta
          237.
          Use CommonJS format for ESLint configuration file Sub-task Open Kousuke Saruta

          Activity

            People

              Unassigned Unassigned
              dongjoon Dongjoon Hyun
              Votes:
              2 Vote for this issue
              Watchers:
              38 Start watching this issue

              Dates

                Created:
                Updated: