icons-social-media-facebook-circleicons-social-media-twitter-circleicons-social-media-linked-in-circle
TAUS Launches Data-Enhanced Machine Translation
icons-action-calendar19 Jan 2022
4 minute read
TAUS Data-Enhanced Machine Translation provides a one-click solution to customized MT, powered by Amazon Translate and enhanced by TAUS Data.

Amsterdam, January 19, 2022 - TAUS, the one-stop language data shop established through deep knowledge of the language industry, globally sourced community talent, and in-house NLP expertise, launches a new service: Data-Enhanced Machine Translation (DEMT) on their Data Marketplace

MT customization essentially requires two elements: an MT engine and training data. By combining both into a single online service, DEMT offers an end-to-end solution to those who wish to produce customized MT output for their specific domains, without the hassle of going through the actual MT training process. Users can simply drop the file they would like to machine translate and select the datasets that they wish to be used in their customization. In the background, our technology processes the file through an Amazon Active Custom Translate integration by feeding the selected training dataset into the engine to produce a highly customized output. The translated file is then directly sent to the user’s inbox.

Generic MT engines are widely available. But to ensure that MT can handle domain-specific content well, proper customization is key,” says Jaap van der Meer, Director at TAUS. “With the TAUS DEMT service, we have made customized, affordable and high-quality machine translation accessible to anyone, regardless of their expertise or access to relevant training data. Based on the independent analysis, BLEU score points are proven to increase by 15.3% on average in the Ecommerce, Medical and Financial domains with the TAUS DEMT. 

The impact of the training datasets available for the DEMT service has been independently evaluated by Polyglot Technology LLC. “In total, we evaluated 8 language pairs for the E-Commerce domain, 18 language pairs for the Medical/Pharma domain and 4 language pairs for the Financial domain,” says Achim Ruopp, Owner at Polyglot Technology. “The customization of Amazon Translate with TAUS Data always improved the BLEU score measured on the test sets by more than 6 BLEU points on average and 2 BLEU points at a minimum. These are significant improvements that demonstrate the superiority of this customized translation for the E-Commerce, Medical/Pharma and Financial domains over non-customized MT outputs.”

The detailed analysis can be downloaded here. The library of available datasets for the DEMT service is planned to grow. You can try the TAUS DEMT service here.

 

About TAUS

TAUS was founded in 2005 as a think tank with a mission to automate and innovate translation. Ideas transformed into actions. TAUS has become the one-stop language data shop, established through deep knowledge of the language industry, globally sourced community talent, and in-house NLP expertise. We create and enhance language data for the training of better, human-informed AI services.

Our mission today is to empower global enterprises and their service and technology providers with data solutions that help them to communicate in all languages, faster, better, and more efficiently.

For more information, visit https://www.taus.net/

 

taus-launches-data-enhanced-machine-translation
Author
şölen-aslan

Şölen is the Head of Digital Marketing at TAUS where she leads digital growth strategies with a focus on generating compelling results via search engine optimization, effective inbound content and social media with over seven years of experience in related fields. She holds BAs in Translation Studies and Brand Communication from Istanbul University in addition to an MA in European Studies: Identity and Integration from the University of Amsterdam. After gaining experience as a transcreator for marketing content, she worked in business development for a mobile app and content marketing before joining TAUS in 2017. She believes in keeping up with modern digital trends and the power of engaging content. She also writes regularly for the TAUS Blog/Reports and manages several social media accounts she created on topics of personal interest with over 100K followers.

Related Articles
icons-action-calendar8 Jun 2022

Amsterdam,  June 8, 2022  TAUS, a leading provider of human-powered language data for AI solutions, has published the TAUS DeMT™ Evaluation Report 2022. The report compares TAUS DeMT™ performance against available major machine translation engines in 8 language pairs for the eCommerce domain, 18 language pairs for the Medical/Pharma domain, and 4 language pairs for the Financial domain. This report’s findings are based on Polyglot Technology LLC’s independent analysis to benchmark the machine translation quality of DeMT™ relative to other major online machine translation providers.

icons-action-calendar8 Jun 2022

AMSTERDAM, June 8, 2022 TAUS, a leading provider of human-powered language data for AI solutions, announced today that it has been listed as a Representative Vendor in the Gartner Market Guide for AI-Enabled Translation Services 2022 report. 

icons-action-calendar16 Feb 2022

Amsterdam, February 16, 2022 - TAUS, the one-stop language data shop, is pleased to announce that TAUS data products and data services are now available to users of Amazon Translate on the AWS Marketplace. This first step marks the beginning of continuous collaboration between TAUS and Amazon Translate. As of today, AWS customers can review and buy 30 bilingual corpora in the Ecommerce, Medical/pharmaceutical and Finance domains. The languages available are as follows: