Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-8556

Trim the number of columns to generate col stats out of the box

    XMLWordPrintableJSON

Details

    Description

      As of now, out of the box, we generate col stats for all top level fields. This could be prohibitively expensive for wider tables having 1000 columns. So, we should trim it down to say first 32 to level columns for a good out of the box performance. 

      Users will anyway have an option to override if need be. 

       

      Lets add a config to drive this, and out of the box we can set it to 32. 

       

      Attachments

        Issue Links

          Activity

            People

              jonvex Jonathan Vexler
              shivnarayan sivabalan narayanan
              sivabalan narayanan
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: