Details
-
Sub-task
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
Common Crawl data shows lots of text files which are being recognized as application/octet-stream. Some appear to be due to being in a language other than English.
Various sample files attached.