Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-951

StackOverflow Error when using mahout lucene.vector

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.6
    • 0.7
    • classic
    • presumably would occur on any platform given the same data.

    Description

      Running mahout lucene.vector can result in a java StackOverFlowError.

      I think this is probably because the current implementation of LuceneIterator.computeNext() is recursive and with appropriate data the stack becomes too large. The recursion only occurs when you hit a document that doesn't have termvectors in the specified field - so you need a lucene.index with lots of documents lacking such in order to hit this problem.

      I've made minimal changes to convert to a loop rather than recurse and I'll attach a patch to this ticket.

      Attachments

        1. 951.patch
          3 kB
          Paul Rudin

        Activity

          People

            srowen Sean R. Owen
            paulrudin Paul Rudin
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: