Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-846

Out Of memory doing delta import with fetch size set to -1

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.3
    • 1.4
    • None
    • Linux 2.6.18-92.1.13.el5xen, mysql 5.0

    Description

      Database has about 3 million records. Doing full-import there is no problem. However, when a large number of changes occurred 2558057, delta-import throws OutOfMemory error after 1288338 documents processed. The stack trace is below

      Exception in thread "Thread-3" java.lang.OutOfMemoryError: Java heap space
      at org.tartarus.snowball.ext.EnglishStemmer.<init>(EnglishStemmer.java:4
      9)
      at org.apache.solr.analysis.EnglishPorterFilter.<init>(EnglishPorterFilt
      erFactory.java:83)
      at org.apache.solr.analysis.EnglishPorterFilterFactory.create(EnglishPor
      terFilterFactory.java:66)
      at org.apache.solr.analysis.EnglishPorterFilterFactory.create(EnglishPor
      terFilterFactory.java:35)
      at org.apache.solr.analysis.TokenizerChain.tokenStream(TokenizerChain.ja
      va:48)
      at org.apache.solr.schema.IndexSchema$SolrIndexAnalyzer.tokenStream(Inde
      xSchema.java:348)
      at org.apache.lucene.analysis.Analyzer.reusableTokenStream(Analyzer.java
      :44)
      at org.apache.lucene.index.DocInverterPerField.processFields(DocInverter
      PerField.java:117)
      at org.apache.lucene.index.DocFieldConsumersPerField.processFields(DocFi
      eldConsumersPerField.java:36)
      at org.apache.lucene.index.DocFieldProcessorPerThread.processDocument(Do
      cFieldProcessorPerThread.java:234)
      at org.apache.lucene.index.DocumentsWriter.updateDocument(DocumentsWrite
      r.java:765)
      at org.apache.lucene.index.DocumentsWriter.updateDocument(DocumentsWrite
      r.java:748)
      at org.apache.lucene.index.IndexWriter.updateDocument(IndexWriter.java:2
      118)
      at org.apache.lucene.index.IndexWriter.updateDocument(IndexWriter.java:2
      095)
      at org.apache.solr.update.DirectUpdateHandler2.addDoc(DirectUpdateHandle
      r2.java:232)
      at org.apache.solr.update.processor.RunUpdateProcessor.processAdd(RunUpd
      ateProcessorFactory.java:59)
      at org.apache.solr.handler.dataimport.SolrWriter.upload(SolrWriter.java:
      69)
      at org.apache.solr.handler.dataimport.DataImportHandler$1.upload(DataImp
      ortHandler.java:288)
      at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilde
      r.java:319)
      at org.apache.solr.handler.dataimport.DocBuilder.doDelta(DocBuilder.java
      :211)
      at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java
      :133)
      at org.apache.solr.handler.dataimport.DataImporter.doDeltaImport(DataImp
      orter.java:359)
      at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.j
      ava:388)
      at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.ja
      va:377)

      dataSource in data-config.xml has been with the batchSize of "-1".
      <dataSource driver="com.mysql.jdbc.Driver" url="jdbc:mysql://host/dbname"
      user="" password="" batchSize="-1"/>

      Attachments

        1. SOLR-846.patch
          2 kB
          Noble Paul

        Activity

          People

            Unassigned Unassigned
            kleung11 Ricky Leung
            Votes:
            1 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: