Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-11983

Hive streaming API uses incorrect logic to assign buckets to incoming records

    XMLWordPrintableJSON

Details

    Description

      The Streaming API tries to distribute records evenly into buckets.
      All records in every Transaction that is part of TransactionBatch goes to the same bucket and a new bucket number is chose for each TransactionBatch.

      Fix: API needs to hash each record to determine which bucket it belongs to.

      Attachments

        1. HIVE-11983.3.patch
          60 kB
          Roshan Naik
        2. HIVE-11983.4.patch
          61 kB
          Roshan Naik
        3. HIVE-11983.5.patch
          58 kB
          Roshan Naik
        4. HIVE-11983.patch
          77 kB
          Roshan Naik

        Issue Links

          Activity

            People

              roshan_naik Roshan Naik
              roshan_naik Roshan Naik
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: