Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3329

RTG Translator with many-to-eng translation

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.0.0
    • translation

    Description

      The existing translation services in tika-translate are either commercial/paid engines (e.g. Google, Microsoft  etc ) or not state of the art (such as Joshua, Moses etc).

      Reader Translator Generator () is a neural machine translation toolkit https://isi-nlp.github.io/rtg/
       and has the implementation of Transformer NMT model (current state of the art).
      It also has massively multilingual pretrained NMT model  ( many-to-English translation direction)  https://hub.docker.com/repository/docker/tgowda/rtg-model
      in which about 500 source languages are represented, with atleast ~300 source languages have good enough quality (For a comparison Google translate has ~106 languages, and Microsoft has about 80 languages). 

      This issue is for integrating RTG Translator into tika-translate
       

      Attachments

        Activity

          People

            mattmann Chris Mattmann
            thammegowda Thamme Gowda
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: