Lucene - Core
  1. Lucene - Core
  2. LUCENE-1206

Ability to store Reader / InputStream fields

    Details

    • Type: New Feature New Feature
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: core/index
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      In some situations we would like to store the whole text, but the whole text won't always fit in memory so we can't create a String. Likewise for storing binary, it would sometimes be better if we didn't have to read into a byte[] up-front (even when it doesn't use much memory, it increases the number of copies made and adds burden to GC.)

      FieldsWriter currently writes the length at the start of the chunks though, so I don't know whether it would be possible to seek back and write the length after writing the data.

      It would also be useful to use this in conjunction with compression, both for Reader and InputStream types. And when retrieving the field, it should be possible to create a Reader without reading the entire String into memory up-front.

        Activity

        Trejkaz created issue -
        Mark Thomas made changes -
        Field Original Value New Value
        Workflow jira [ 12425291 ] Default workflow, editable Closed status [ 12562803 ]
        Mark Thomas made changes -
        Workflow Default workflow, editable Closed status [ 12562803 ] jira [ 12583715 ]
        Hide
        Jan Høydahl added a comment -

        This issue has been inactive for more than 4 years. Please close if it's no longer relevant/needed, or bring it up to date if you intend to work on it. SPRING_CLEANING_2013

        Show
        Jan Høydahl added a comment - This issue has been inactive for more than 4 years. Please close if it's no longer relevant/needed, or bring it up to date if you intend to work on it. SPRING_CLEANING_2013
        Hide
        Trejkaz added a comment -

        I think this would still be useful. The workaround of using a separate database to store larger text and binary stuff has never really sat with me terribly well.

        Show
        Trejkaz added a comment - I think this would still be useful. The workaround of using a separate database to store larger text and binary stuff has never really sat with me terribly well.

          People

          • Assignee:
            Unassigned
            Reporter:
            Trejkaz
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:

              Development