Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-34812

[Bug] During the snapshot phase, projection is not being performed according to the user defined schema.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Flink CDC

    Description

          1. Search before asking
      • [X] I searched in the [issues|https://github.com/ververica/flink-cdc-connectors/issues) and found nothing similar.
          1. Flink version

      1.17.0

          1. Flink CDC version

      <=master

          1. Database and its version

      such as mysql,oracle,pg,sqlserver,take oracle 11g as an example.

          1. Minimal reproduce step

      Assuming the ABC table contains three fields, A, B, and C.

      *ddl:*
      CREATE TABLE ABC (
       A BIGINT NOT NULL,
       B STRING,
       PRIMARY KEY(A) NOT ENFORCED
      ) WITH (
       'connector' = 'oracle-cdc',
       'hostname' = '192.168.xxx.xxx',
       'port' = '1521',
       'username' = 'xxx',
       'password' = 'xxx',
       'database-name' = 'xxx',
       'schema-name' = 'xxx',
       'table-name' = 'ABC',
       'debezium.log.mining.strategy' = 'online_catalog',
       'debezium.log.mining.continuous.mine' = 'true',
       'debezium.database.tablename.case.insensitive' = 'false'];

      *dml:*
      select * from ABC;

          1. What did you expect to see?

      *should be:*
      !查部分
      Projection is performed according to the schema definition, and it takes 2.534 seconds.

          1. What did you see instead?

      *snapshot phase:*
      !查全部
      The screenshot indicates that an `*` was used as the query condition, and projection was not performed as defined by the schema. And it took 3.532 seconds to execute this query.

          1. Anything else?

      When dealing with a large amount of data and numerous partitions, this time can become quite significant.

          1. Are you willing to submit a PR?
      • [X] I'm willing to submit a PR!

      ---------------- Imported from GitHub ----------------
      Url: https://github.com/apache/flink-cdc/issues/2470
      Created by: hzjhjjyy
      Labels: bug,
      Created at: Fri Sep 08 09:43:07 CST 2023
      State: open

      Attachments

        Activity

          People

            Unassigned Unassigned
            flink-cdc-import Flink CDC Issue Import
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: