uax_url_email analyzer appears unable to recognize the ".local" TLD among others. Bug can be reproduced by
curl -XGET "ADDRESS/INDEX/_analyze?text=First%20Lastfirstname.lastname@example.org&pretty&analyzer=uax_url_email"
will parse "email@example.com" and "corp.local" as separate tokens, as opposed to
curl -XGET "ADDRESS/INDEX/_analyze?text=First%20Lastfirstname.lastname@example.org&pretty&analyzer=uax_url_email"
which will recognize "email@example.com".
Can this be fixed by updating to a newer version? I am running ElasticSearch 0.90.5 and whatever Lucene version sits underneath that. My suspicion is that the TLD list the analyzer relies on (http://www.internic.net/zones/root.zone, I think?) is incomplete and needs updating.