Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-1249

MapJoin to Check Number of Files to Match Metadata

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 0.10.0
    • Query Processor
    • None

    Description

      Sorted Map Join only can execute when two tables' bucket numbers are the same.
      Currently, the checking is done by looking at metadata of the table. However, for some reason, the real physical storage is not guaranteed to follow what the metadata indicates. In that case, Sorted Map Join might fail to identify it and deliver wrong results. An extra check when storage scan is necessary to deal with the scenario.

      This issue may also apply to normal map join.

      Attachments

        Activity

          People

            namit Namit Jain
            sdong Siying Dong
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: