Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2250

Remove the x- prefix for some Microsoft image format mimetypes, eg BMP

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.14
    • Fix Version/s: 1.15
    • Component/s: mime
    • Labels:
      None

      Description

      Our main mime type for a number of Microsoft image formats, such as BMP or EMF, have an x- prefix because they weren't officially assigned

      In September of last year, Microsoft got round to doing the paperwork for getting several of these officially recognised, see https://tools.ietf.org/html/rfc7903 , so the main / canonical type can now be the one without the x-

        Activity

        Hide
        gagravarr Nick Burch added a comment -

        Mimetypes updated for WMF, EMF and BMP. The old ones are listed as aliases, since that's what many existing systems / uses will still call them as

        Show
        gagravarr Nick Burch added a comment - Mimetypes updated for WMF, EMF and BMP. The old ones are listed as aliases, since that's what many existing systems / uses will still call them as
        Hide
        hudson Hudson added a comment -

        UNSTABLE: Integrated in Jenkins build tika-2.x #205 (See https://builds.apache.org/job/tika-2.x/205/)
        TIKA-2250 As of RFC7903, the official mime type for BMP is now the one (nick: rev 58d56c33fc7d103e0c6875aa63f3377eaf8b7ae4)

        • (edit) tika-app/src/test/java/org/apache/tika/mime/TestMimeTypes.java
        • (edit) tika-core/src/test/java/org/apache/tika/mime/MimeTypesReaderTest.java
        • (edit) tika-app/src/test/java/org/apache/tika/parser/AutoDetectParserTest.java
        • (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
        • (edit) tika-core/src/test/java/org/apache/tika/parser/CompositeParserTest.java
        • (edit) tika-core/src/test/java/org/apache/tika/TikaDetectionTest.java
        • (edit) tika-parser-modules/tika-parser-multimedia-module/src/main/java/org/apache/tika/parser/image/ImageParser.java
        • (edit) tika-server/src/test/java/org/apache/tika/server/TikaMimeTypesTest.java
        • (edit) tika-parser-modules/tika-parser-multimedia-module/src/main/java/org/apache/tika/parser/ocr/TesseractOCRParser.java
          TIKA-2250 As of RFC7903, the official mime type for WMF is now an image (nick: rev 6668d78fa73e050a2e36bf5bc57106c0640bff6b)
        • (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
        • (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/AbstractPOIContainerExtractionTest.java
        • (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/rtf/RTFParserTest.java
        • (edit) tika-core/src/test/java/org/apache/tika/TikaDetectionTest.java
        • (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/HSLFExtractor.java
        • (edit) tika-app/src/test/java/org/apache/tika/mime/TestMimeTypes.java
          TIKA-2250 As of RFC7903, the official mime type for EMF is now an image (nick: rev bd667acde6a48e118574129d79dfacb1c3c2db25)
        • (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/rtf/RTFParserTest.java
        • (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
        • (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/AbstractPOIContainerExtractionTest.java
        • (edit) tika-parser-modules/tika-parser-multimedia-module/src/test/java/org/apache/tika/parser/pdf/PDFParserTest.java
        • (edit) CHANGES.txt
        • (edit) tika-app/src/test/java/org/apache/tika/mime/TestMimeTypes.java
        • (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/HSLFExtractor.java
        Show
        hudson Hudson added a comment - UNSTABLE: Integrated in Jenkins build tika-2.x #205 (See https://builds.apache.org/job/tika-2.x/205/ ) TIKA-2250 As of RFC7903, the official mime type for BMP is now the one (nick: rev 58d56c33fc7d103e0c6875aa63f3377eaf8b7ae4) (edit) tika-app/src/test/java/org/apache/tika/mime/TestMimeTypes.java (edit) tika-core/src/test/java/org/apache/tika/mime/MimeTypesReaderTest.java (edit) tika-app/src/test/java/org/apache/tika/parser/AutoDetectParserTest.java (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml (edit) tika-core/src/test/java/org/apache/tika/parser/CompositeParserTest.java (edit) tika-core/src/test/java/org/apache/tika/TikaDetectionTest.java (edit) tika-parser-modules/tika-parser-multimedia-module/src/main/java/org/apache/tika/parser/image/ImageParser.java (edit) tika-server/src/test/java/org/apache/tika/server/TikaMimeTypesTest.java (edit) tika-parser-modules/tika-parser-multimedia-module/src/main/java/org/apache/tika/parser/ocr/TesseractOCRParser.java TIKA-2250 As of RFC7903, the official mime type for WMF is now an image (nick: rev 6668d78fa73e050a2e36bf5bc57106c0640bff6b) (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/AbstractPOIContainerExtractionTest.java (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/rtf/RTFParserTest.java (edit) tika-core/src/test/java/org/apache/tika/TikaDetectionTest.java (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/HSLFExtractor.java (edit) tika-app/src/test/java/org/apache/tika/mime/TestMimeTypes.java TIKA-2250 As of RFC7903, the official mime type for EMF is now an image (nick: rev bd667acde6a48e118574129d79dfacb1c3c2db25) (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/rtf/RTFParserTest.java (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/AbstractPOIContainerExtractionTest.java (edit) tika-parser-modules/tika-parser-multimedia-module/src/test/java/org/apache/tika/parser/pdf/PDFParserTest.java (edit) CHANGES.txt (edit) tika-app/src/test/java/org/apache/tika/mime/TestMimeTypes.java (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/HSLFExtractor.java
        Hide
        hudson Hudson added a comment -

        SUCCESS: Integrated in Jenkins build Tika-trunk #1186 (See https://builds.apache.org/job/Tika-trunk/1186/)
        TIKA-2250 As of RFC7903, the official mime type for BMP is now the one (nick: rev 847156ac0f5fa7d4cc06964198359cf594b66d50)

        • (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
        • (edit) tika-core/src/test/java/org/apache/tika/parser/CompositeParserTest.java
        • (edit) tika-parsers/src/main/java/org/apache/tika/parser/image/ImageParser.java
        • (edit) tika-core/src/test/java/org/apache/tika/TikaDetectionTest.java
        • (edit) tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRParser.java
        • (edit) tika-server/src/test/java/org/apache/tika/server/TikaMimeTypesTest.java
        • (edit) tika-core/src/test/java/org/apache/tika/mime/MimeTypesReaderTest.java
        • (edit) tika-parsers/src/test/java/org/apache/tika/mime/TestMimeTypes.java
        • (edit) tika-parsers/src/test/java/org/apache/tika/parser/AutoDetectParserTest.java
          TIKA-2250 As of RFC7903, the official mime type for WMF is now an image (nick: rev e6c0082e41143a01f0bf646a8a8b6c06a85ca239)
        • (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
        • (edit) tika-parsers/src/test/java/org/apache/tika/parser/microsoft/AbstractPOIContainerExtractionTest.java
        • (edit) tika-parsers/src/test/java/org/apache/tika/parser/rtf/RTFParserTest.java
        • (edit) tika-parsers/src/test/java/org/apache/tika/mime/TestMimeTypes.java
        • (edit) tika-core/src/test/java/org/apache/tika/TikaDetectionTest.java
        • (edit) tika-parsers/src/main/java/org/apache/tika/parser/microsoft/HSLFExtractor.java
          TIKA-2250 As of RFC7903, the official mime type for EMF is now an image (nick: rev 90bf4f6e4c645240b36ded6973eb64961312fc0a)
        • (edit) tika-parsers/src/test/java/org/apache/tika/parser/rtf/RTFParserTest.java
        • (edit) tika-parsers/src/test/java/org/apache/tika/parser/pdf/PDFParserTest.java
        • (edit) tika-parsers/src/main/java/org/apache/tika/parser/microsoft/HSLFExtractor.java
        • (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
        • (edit) tika-parsers/src/test/java/org/apache/tika/mime/TestMimeTypes.java
        • (edit) tika-parsers/src/test/java/org/apache/tika/parser/microsoft/AbstractPOIContainerExtractionTest.java
        • (edit) CHANGES.txt
        Show
        hudson Hudson added a comment - SUCCESS: Integrated in Jenkins build Tika-trunk #1186 (See https://builds.apache.org/job/Tika-trunk/1186/ ) TIKA-2250 As of RFC7903, the official mime type for BMP is now the one (nick: rev 847156ac0f5fa7d4cc06964198359cf594b66d50) (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml (edit) tika-core/src/test/java/org/apache/tika/parser/CompositeParserTest.java (edit) tika-parsers/src/main/java/org/apache/tika/parser/image/ImageParser.java (edit) tika-core/src/test/java/org/apache/tika/TikaDetectionTest.java (edit) tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRParser.java (edit) tika-server/src/test/java/org/apache/tika/server/TikaMimeTypesTest.java (edit) tika-core/src/test/java/org/apache/tika/mime/MimeTypesReaderTest.java (edit) tika-parsers/src/test/java/org/apache/tika/mime/TestMimeTypes.java (edit) tika-parsers/src/test/java/org/apache/tika/parser/AutoDetectParserTest.java TIKA-2250 As of RFC7903, the official mime type for WMF is now an image (nick: rev e6c0082e41143a01f0bf646a8a8b6c06a85ca239) (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml (edit) tika-parsers/src/test/java/org/apache/tika/parser/microsoft/AbstractPOIContainerExtractionTest.java (edit) tika-parsers/src/test/java/org/apache/tika/parser/rtf/RTFParserTest.java (edit) tika-parsers/src/test/java/org/apache/tika/mime/TestMimeTypes.java (edit) tika-core/src/test/java/org/apache/tika/TikaDetectionTest.java (edit) tika-parsers/src/main/java/org/apache/tika/parser/microsoft/HSLFExtractor.java TIKA-2250 As of RFC7903, the official mime type for EMF is now an image (nick: rev 90bf4f6e4c645240b36ded6973eb64961312fc0a) (edit) tika-parsers/src/test/java/org/apache/tika/parser/rtf/RTFParserTest.java (edit) tika-parsers/src/test/java/org/apache/tika/parser/pdf/PDFParserTest.java (edit) tika-parsers/src/main/java/org/apache/tika/parser/microsoft/HSLFExtractor.java (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml (edit) tika-parsers/src/test/java/org/apache/tika/mime/TestMimeTypes.java (edit) tika-parsers/src/test/java/org/apache/tika/parser/microsoft/AbstractPOIContainerExtractionTest.java (edit) CHANGES.txt

          People

          • Assignee:
            Unassigned
            Reporter:
            gagravarr Nick Burch
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development