Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-10115

Corruption in read-side of SOLR-HDFS stack

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 4.4
    • Fix Version/s: None
    • Component/s: Hadoop Integration, hdfs
    • Labels:
      None

      Description

      I've been trying to track down some random AIOOB exceptions in Lucene for a customer, and I've managed to reproduce the issue with a unit test of sufficient size in conjunction with highly concurrent read requests.

      A typical stack trace looks like:

      org.apache.solr.common.SolrException; java.lang.ArrayIndexOutOfBoundsException: 172033655
              at org.apache.lucene.codecs.lucene40.BitVector.get(BitVector.java:149)
              at org.apache.lucene.codecs.lucene41.Lucene41PostingsReader$BlockDocsEnum.nextDoc(Lucene41PostingsReader.java:455)
              at org.apache.lucene.search.MultiTermQueryWrapperFilter.getDocIdSet(MultiTermQueryWrapperFilter.java:111)
              at org.apache.lucene.search.ConstantScoreQuery$ConstantWeight.scorer(ConstantScoreQuery.java:157)
      

      The number of unique stack traces is relatively high, most AIOOB exceptions, but some EOF. Most exceptions occur in the term index, however I believe this may be just an artifact of where highly concurrent access is most likely to occur. The queries that triggered this had many wildcards and other multi-term queries.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                yseeley@gmail.com Yonik Seeley
                Reporter:
                yseeley@gmail.com Yonik Seeley
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated: