Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-10115

Corruption in read-side of SOLR-HDFS stack

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 4.4
    • None
    • Hadoop Integration, hdfs
    • None

    Description

      I've been trying to track down some random AIOOB exceptions in Lucene for a customer, and I've managed to reproduce the issue with a unit test of sufficient size in conjunction with highly concurrent read requests.

      A typical stack trace looks like:

      org.apache.solr.common.SolrException; java.lang.ArrayIndexOutOfBoundsException: 172033655
              at org.apache.lucene.codecs.lucene40.BitVector.get(BitVector.java:149)
              at org.apache.lucene.codecs.lucene41.Lucene41PostingsReader$BlockDocsEnum.nextDoc(Lucene41PostingsReader.java:455)
              at org.apache.lucene.search.MultiTermQueryWrapperFilter.getDocIdSet(MultiTermQueryWrapperFilter.java:111)
              at org.apache.lucene.search.ConstantScoreQuery$ConstantWeight.scorer(ConstantScoreQuery.java:157)
      

      The number of unique stack traces is relatively high, most AIOOB exceptions, but some EOF. Most exceptions occur in the term index, however I believe this may be just an artifact of where highly concurrent access is most likely to occur. The queries that triggered this had many wildcards and other multi-term queries.

      Attachments

        1. YCS_HdfsTest.java
          20 kB
          Yonik Seeley

        Issue Links

          Activity

            People

              yseeley@gmail.com Yonik Seeley
              yseeley@gmail.com Yonik Seeley
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: