Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-3968

Standalone CSV to Arrow Conversion Tool

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Won't Do
    • None
    • None
    • C++

    Description

      As part of exploring the Arrow C++ implementation, I wrote standalone tool for streaming Arrow to a file from CSV, using the new CSV parser implementation from pitrou. I realize that Arrow's emphasis is on in-memory representation, as opposed to efficient storage to disk, so I'd love to know if this has any utility for the project. At the very least, it seems like a quick way to get into exploring the format from a something easily inspectable/familiar (i.e., CSV).

      In either case, I'm making this issue here as a placeholder for an accompanying PR on Github. Also, I think this is my first issue for this project, so please let me know if I should do anything differently.

       

      PR is now available at: https://github.com/apache/arrow/pull/3136

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              mookerji Bhaskar Mookerji
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 3h 40m
                  3h 40m