JFlex 1.7.0, supporting Unicode 9.0, was released recently: http://jflex.de/changelog.html#jflex-1.7.0. We should upgrade.
StandardTokenizer doesn't separate hangul characters from other non-CJK chars
emoji sequence support in ICUTokenizer
Upgrade JFlex to 1.6.0
Update UAX29URLEmailTokenizer TLDs to latest list, and upgrade all JFlex-based tokenizers to support Unicode 8.0