Solr
  1. Solr
  2. SOLR-6089

When using the HDFS block cache, when a file is deleted, it's underlying data entries in the block cache are not removed, which is a problem with the global block cache option.

    Details

    • Type: Bug Bug
    • Status: Resolved
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 4.10, 6.0
    • Component/s: hdfs
    • Labels:
      None

      Description

      Patrick Hunt noticed this. Without the global block cache, the block cache was not reused after a directory was closed. Now that it is reused when using the global cache, leaving the underlying entries presents a problem if that directory is created again because blocks from the previous directory may be read. This could happen when you remove a solrcore and recreate it with the same data directory (or a collection with the same name). I could only reproduce it easily using index merges (core admin) with the sequence: merge index, delete collection, create collection, merge index. Reads on the final merged index can look corrupt or queries may just return no results.

      1. SOLR-6089.patch
        21 kB
        Mark Miller

        Activity

        Hide
        ASF subversion and git services added a comment -

        Commit 1619427 from Mark Miller in branch 'dev/trunk'
        [ https://svn.apache.org/r1619427 ]

        SOLR-6089: When using the HDFS block cache, when a file is deleted, it's underlying data entries in the block cache are not removed, which is a problem with the global block cache option.

        Show
        ASF subversion and git services added a comment - Commit 1619427 from Mark Miller in branch 'dev/trunk' [ https://svn.apache.org/r1619427 ] SOLR-6089 : When using the HDFS block cache, when a file is deleted, it's underlying data entries in the block cache are not removed, which is a problem with the global block cache option.
        Hide
        ASF subversion and git services added a comment -

        Commit 1619431 from Mark Miller in branch 'dev/branches/branch_4x'
        [ https://svn.apache.org/r1619431 ]

        SOLR-6089: When using the HDFS block cache, when a file is deleted, it's underlying data entries in the block cache are not removed, which is a problem with the global block cache option.

        Show
        ASF subversion and git services added a comment - Commit 1619431 from Mark Miller in branch 'dev/branches/branch_4x' [ https://svn.apache.org/r1619431 ] SOLR-6089 : When using the HDFS block cache, when a file is deleted, it's underlying data entries in the block cache are not removed, which is a problem with the global block cache option.

          People

          • Assignee:
            Mark Miller
            Reporter:
            Mark Miller
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development