Uploaded image for project: 'CarbonData'
  1. CarbonData
  2. CARBONDATA-2916

Support CarbonCli tool for data summary

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.5.0
    • None
    • None

    Description

      When I am tuning carbon performance, very often that I want to check the metadata in carbon files without launching spark shell or sql. In order to do that, I am writing a tool to print metadata information of a given data folder.
      Currently, I am planning to do like this:

      usage: CarbonCli
      a,-all print all information
      b,-tblProperties print table properties
      c,-column <column name> column to print statistics
      -cmd <command name> command to execute, supported commands are:
      summary
      d,-detailSize print each blocklet size
      h,-help print this message
      m,-showSegment print segment information
      p,-path <path> the path which contains carbondata files,
      nested folder is supported
      s,-schema print the schema

      In first phase, I think “summary” command is high priority, and developers can add more command in the future.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jackylk Jacky Li
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 10h 20m
                  10h 20m