Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1438

PhoneExtractingContentHandler to not add individual MD entries for individual phone numbers

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Not A Problem
    • None
    • 1.7
    • None
    • None

    Description

      Right now we have the PhoneExtractingContentHandler adding phone numbers as individual metadata entires.... I feel that this is cumbersome.

      An example would be that we have a webpage with phone numbers on it, we then have many fields of the same type with different values!
      I propose we reverse this and have one field with multiple values.

      I would fully understand the current behaviour if we wished to augment the phone numbers further by associating dialing code, country, carrier, etc, however we are not currently doing this.

      Patch coming for trunk.

      Attachments

        1. TIKA-1438.patch
          0.8 kB
          Lewis John McGibbney

        Activity

          People

            lewismc Lewis John McGibbney
            lewismc Lewis John McGibbney
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: