Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Common Crawl data shows lots of text files which are being recognized as application/octet-stream. Some appear to be due to being in a language other than English.

       

      Various sample files attached.

      Attachments

        1. 1990-01.etc
          0.9 kB
          Gregory Lepore
        2. 2008-09.3
          36 kB
          Gregory Lepore
        3. 20220708 YouTube1-1.kif
          5 kB
          Gregory Lepore
        4. bub0336d.007
          9 kB
          Gregory Lepore
        5. pacman.nas
          26 kB
          Gregory Lepore
        6. shab3_36.qbp
          0.7 kB
          Gregory Lepore
        7. wots.diz
          0.2 kB
          Gregory Lepore

        Activity

          People

            Unassigned Unassigned
            greg@rhobard.com Gregory Lepore
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: