Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.18.0, 0.18.1, 0.18.2, 0.18.3, 0.19.0, 0.19.1, 0.19.2, 0.20.0, 0.20.1
    • Fix Version/s: 0.20.2
    • Component/s: fs
    • Labels:
      None
    • Hadoop Flags:
      Reviewed
    • Release Note:
      Bugs fixed for Hadoop archives: character escaping in paths, LineReader and file system caching.

      Description

      Found and fixed several bugs involving Hadoop archives:

      • In makeQualified(), the sloppy conversion from Path to URI and back mangles the path if it contains an escape-worthy character.
      • It's possible that fileStatusInIndex() may have to read more than one segment of the index. The LineReader and count of bytes read need to be reset for each block.
      • har:// connections cannot be indexed by (scheme, authority, username) – the path is significant as well. Caching them in this way limits a hadoop client to opening one archive per filesystem. It seems to be safe not to cache them, since they wrap another connection that does the actual networking.

        Attachments

        1. HADOOP-6097-v2.patch
          1 kB
          Tom White
        2. HADOOP-6097-0.20.patch
          1 kB
          Mahadev konar
        3. HADOOP-6097-0.20.patch
          4 kB
          Mahadev konar
        4. HADOOP-6097-0.20.patch
          4 kB
          Mahadev konar
        5. HADOOP-6097.patch
          2 kB
          Ben Slusky

          Issue Links

            Activity

              People

              • Assignee:
                sluskyb Ben Slusky
                Reporter:
                sluskyb Ben Slusky
              • Votes:
                0 Vote for this issue
                Watchers:
                9 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: