Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-3818

hudi doesn't support bytes column as primary key

    XMLWordPrintableJSON

Details

    • 0.5

    Description

       when use bytes column as primary key, hudi will generate fixed hoodie key, then upserts will only insert one row. 

      scala> sql("desc extended binary_test1").show()
      +--------------------+--------------------+-------+
      |            col_name|           data_type|comment|
      +--------------------+--------------------+-------+
      | _hoodie_commit_time|              string|   null|
      |_hoodie_commit_seqno|              string|   null|
      |  _hoodie_record_key|              string|   null|
      |_hoodie_partition...|              string|   null|
      |   _hoodie_file_name|              string|   null|
      |                  id|              binary|   null|
      |                name|              string|   null|
      |                  dt|              string|   null|
      |                    |                    |       |
      |# Detailed Table ...|                    |       |
      |            Database|             default|       |
      |               Table|        binary_test1|       |
      |               Owner|                root|       |
      |        Created Time|Sat Apr 02 13:28:...|       |
      |         Last Access|             UNKNOWN|       |
      |          Created By|         Spark 3.2.0|       |
      |                Type|             MANAGED|       |
      |            Provider|                hudi|       |
      |    Table Properties|[last_commit_time...|       |
      |          Statistics|        435194 bytes|       |
      +--------------------+--------------------+-------+
      
      scala> sql("select * from binary_test1").show()
      +-------------------+--------------------+--------------------+----------------------+--------------------+--------------------+---------+--------+
      |_hoodie_commit_time|_hoodie_commit_seqno|  _hoodie_record_key|_hoodie_partition_path|   _hoodie_file_name|                  id|     name|      dt|
      +-------------------+--------------------+--------------------+----------------------+--------------------+--------------------+---------+--------+
      |  20220402132927590|20220402132927590...|id:java.nio.HeapB...|                      |1a06106e-5e7a-4e6...|[03 45 6A 00 00 0...|Mary Jane|20220401|
      +-------------------+--------------------+--------------------+----------------------+--------------------+--------------------+---------+--------+

      Attachments

        Issue Links

          Activity

            People

              rex_xiong rex xiong
              rex_xiong rex xiong
              Shiyan Xu
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: