Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-15388

Handle topics that were having compaction as retention earlier are changed to delete only retention policy and onboarded to tiered storage.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • None
    • 3.9.0
    • None
    • None

    Description

      Context: https://github.com/apache/kafka/pull/13561#discussion_r1300055517

       
      There are 3 paths I looked at:

      • When data is moved to remote storage (1)
      • When data is read from remote storage (2)
      • When data is deleted from remote storage (3)

      (1) Does not have a problem with compacted topics. Compacted segments are uploaded and their metadata claims they contain offset from the baseOffset of the segment until the next segment's baseOffset. There are no gaps in offsets.
      (2) Does not have a problem if a customer is querying offsets which do not exist within a segment, but there are offset after the queried offset within the same segment. However, it does have a problem when the next available offset is in a subsequent segment.
      (3) For data deleted via DeleteRecords there is no problem. For data deleted via retention there is no problem.
       
      I believe the proper solution to (2) is to make tiered storage continue looking for the next greater offset in subsequent segments.

      Steps to reproduce the issue:

      // TODO (christo)
      

      Attachments

        1. tieredtopicloglist.png
          971 kB
          Arpit Goyal
        2. Screenshot 2023-11-15 at 3.47.54 PM.png
          97 kB
          Arpit Goyal

        Issue Links

          Activity

            People

              goyarpit Arpit Goyal
              satish.duggana Satish Duggana
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated: