Tika
  1. Tika
  2. TIKA-858

Tika add parsing support for ANPA-1312 news wire feeds

    Details

    • Type: New Feature New Feature
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 0.10
    • Fix Version/s: None
    • Component/s: mime, parser
    • Labels:

      Description

      This submission adds support for ANPA-1312 news wire feeds.

      Those feeds are the formats used by AP, AFP, NYT, Reuters in their daily news wire broadcasts.

      This was a pretty significant development effort, so am happy to share back as a thank you to the TIKA community.

      1. 7901V5.pdf
        535 kB
        Craig Stires
      2. IptcAnpaParser.java
        34 kB
        Craig Stires
      3. org.apache.tika.parser.Parser_ANPA.patch
        0.5 kB
        Craig Stires
      4. tika-mimetypes_ANPA.patch
        0.7 kB
        Craig Stires

        Activity

        Tyler Palsulich made changes -
        Labels new-parser
        Craig Stires made changes -
        Attachment 7901V5.pdf [ 12514433 ]
        Craig Stires made changes -
        Attachment IptcAnpaParser.java [ 12512972 ]
        Craig Stires made changes -
        Craig Stires made changes -
        Field Original Value New Value
        Attachment tika-mimetypes_ANPA.patch [ 12512970 ]
        Craig Stires created issue -

          People

          • Assignee:
            Unassigned
            Reporter:
            Craig Stires
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:

              Development