-
Type:
New Feature
-
Status: Open
-
Priority:
Major
-
Resolution: Unresolved
-
Affects Version/s: None
-
Fix Version/s: None
-
Component/s: None
-
Labels:None
-
Patch Info:Patch Available
Right now, IDN's are indexed as ASCII. An IDNNormalizer is to be used with an indexer so it will encode ASCII URL's to their proper unicode equivalant.
- depends upon
-
NUTCH-1681 In URLUtil.java, toUNICODE method does not work correctly
-
- Resolved
-
- relates to
-
NUTCH-1320 IndexChecker and ParseChecker choke on IDN's
-
- Closed
-