Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-7513

[SQL] Row Estimation for BigQueryTable

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Resolved
    • Priority: Major
    • Resolution: Implemented
    • Affects Version/s: None
    • Fix Version/s: 2.14.0
    • Component/s: dsl-sql, io-java-gcp
    • Labels:
      None

      Description

      Calcite tables (org.apache.calcite.schema.Table) should implement the method org.apache.calcite.schema.Statistic getStatistic(). The Statistic instance returned by this method is used for the Volcano optimizer in Calcite. 

      Currently, org.apache.beam.sdk.extensions.sql.impl.BeamCalciteTable has not implemented getStatistic() which means it uses the implementation in org.apache.calcite.schema.impl.AbstractTable and that implementation just returns Statistics.UNKNOWN for all sources.

       

      Things needed to be implemented:

      1- Implementing getStatistic in BeamCalciteTable such that it calls a row count estimation method from BeamSqlTable and adding this method to BeamSqlTable.

      2- Implementing the row count estimation method for BigQueryTable. 

        Attachments

          Activity

            People

            • Assignee:
              riazela Alireza Samadianzakaria
              Reporter:
              riazela Alireza Samadianzakaria
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 10h 10m
                10h 10m