Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-3968

Standalone CSV to Arrow Conversion Tool

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Major
    • Resolution: Won't Do
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: C++

      Description

      As part of exploring the Arrow C++ implementation, I wrote standalone tool for streaming Arrow to a file from CSV, using the new CSV parser implementation from [~pitrou]. I realize that Arrow's emphasis is on in-memory representation, as opposed to efficient storage to disk, so I'd love to know if this has any utility for the project. At the very least, it seems like a quick way to get into exploring the format from a something easily inspectable/familiar (i.e., CSV).

      In either case, I'm making this issue here as a placeholder for an accompanying PR on Github. Also, I think this is my first issue for this project, so please let me know if I should do anything differently.

       

      PR is now available at: https://github.com/apache/arrow/pull/3136

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                mookerji Bhaskar Mookerji
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 3h 40m
                  3h 40m