Public signup for this instance is disabled. Go to our Self serve sign up page to request an account.
On TIKA-2440, Takahiro requested the ability to turn off extraction of phonetic runs. We should enable this for docx, too. We'll have to make fixes in POI for our DOM docx parser, but it should be fairly straighforward in our SAX docx parser.