An MERGE_BOTH option would be useful to train using some corpus. For example in a Portuguese corpus we have:
... devolva - me o livro .... (give the book back to me)
We need to detokenize it as "devolva-me o livro". Configure "-" token as MERGE_BOTH in the detokenizer dictionary would be helpful.