Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-25660

File Format (ORC/AVRO/TextFile...) available in information schema for bulk query

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsAdd voteVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • File Formats, Metastore
    • None

    Description

      Hello all,

      As of today, when you want to know the file format of every table, you have, as far I know, two solutions :
      -a loop in shell
      -a loop in the tool you use for HQL queries, and then parse the answer, etc..

      I think this is way too complicated for such a very basic need. So a table_file_format (or partition_file_format, I don't know) in the information_schema would be a very precious help for monitoring. It can be directly read by a reporting tool (Superset, Tableau, PowerBi, Qlik, whatever you want).

      Best regards,

      Simon

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned Assign to me
            simon.aubert Simon AUBERT

            Dates

              Created:
              Updated:

              Slack

                Issue deployment