Uploaded image for project: 'Crunch (Retired)'
  1. Crunch (Retired)
  2. CRUNCH-450

Adding ORC file format support in Crunch

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.11.0
    • Core, IO
    • None

    Description

      This JIRA adds ORC (Optimized Row Columnar) file format support in Crunch. Three modes are supported for ORC serialization/deserialization:

      1) Orcs.orcs(): using OrcStructs as the deserialized objects to provide high performance
      2) Orcs.reflects(): using Java reflection to support POJOs as the deserialized objects
      3) Orcs.tuples(): using Crunch Tuples as the deserialized objects to leverage performance and user-friendliness

      Attachments

        1. CRUNCH-450-final.patch
          96 kB
          Zhong Wang
        2. CRUNCH-450-newapi.patch
          29 kB
          Zhong Wang
        3. CRUNCH-450-submodule.2.patch
          114 kB
          Zhong Wang
        4. CRUNCH-450-submodule.1.patch
          113 kB
          Zhong Wang
        5. CRUNCH-450-submodule.patch
          120 kB
          Zhong Wang
        6. CRUNCH-450.patch
          111 kB
          Zhong Wang

        Activity

          People

            jwills Josh Wills
            zwang Zhong Wang
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: