Uploaded image for project: 'Crunch (Retired)'
  1. Crunch (Retired)
  2. CRUNCH-450

Adding ORC file format support in Crunch

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.11.0
    • Component/s: Core, IO
    • Labels:
      None

      Description

      This JIRA adds ORC (Optimized Row Columnar) file format support in Crunch. Three modes are supported for ORC serialization/deserialization:

      1) Orcs.orcs(): using OrcStructs as the deserialized objects to provide high performance
      2) Orcs.reflects(): using Java reflection to support POJOs as the deserialized objects
      3) Orcs.tuples(): using Crunch Tuples as the deserialized objects to leverage performance and user-friendliness

        Attachments

        1. CRUNCH-450.patch
          111 kB
          Zhong Wang
        2. CRUNCH-450-final.patch
          96 kB
          Zhong Wang
        3. CRUNCH-450-newapi.patch
          29 kB
          Zhong Wang
        4. CRUNCH-450-submodule.1.patch
          113 kB
          Zhong Wang
        5. CRUNCH-450-submodule.2.patch
          114 kB
          Zhong Wang
        6. CRUNCH-450-submodule.patch
          120 kB
          Zhong Wang

          Activity

            People

            • Assignee:
              jwills Josh Wills
              Reporter:
              zwang Zhong Wang
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: