icons-social-media-facebook-circleicons-social-media-twitter-circleicons-social-media-linked-in-circle
Domain-Specific Training Data Generation for SYSTRAN
icons-action-calendar19 Feb 2021
10 minute read
After the training with the TAUS datasets in the pandemic domain, the SYSTRAN engines improved on average 18% across all twelve language pairs compared to the baseline engines.

When the global pandemic hit the world in 2020, TAUS created a starter kit in several languages to train high-quality translation models customized for the pandemic domain. SYSTRAN, a leading AI-based translation technology company, partnered with TAUS to use these datasets to produce twelve translation models for English to/from French, Spanish, German, Italian, Chinese and Russian and make them available on SYSTRAN Marketplace where NMT models are offered to a network of language experts to train models in any language pair and domain.

After the training with the TAUS Corona datasets, the SYSTRAN engines improved on average 18% across all twelve language pairs compared to the SYSTRAN baseline engines.

 

domain-specific-training-data-generation-for-systran
Author
şölen-aslan

Şölen is the Head of Digital Marketing at TAUS where she leads digital growth strategies with a focus on generating compelling results via search engine optimization, effective inbound content and social media with over seven years of experience in related fields. She holds BAs in Translation Studies and Brand Communication from Istanbul University in addition to an MA in European Studies: Identity and Integration from the University of Amsterdam. After gaining experience as a transcreator for marketing content, she worked in business development for a mobile app and content marketing before joining TAUS in 2017. She believes in keeping up with modern digital trends and the power of engaging content. She also writes regularly for the TAUS Blog/Reports and manages several social media accounts she created on topics of personal interest with over 100K followers.

Related Articles
icons-action-calendar19 Sep 2022

Working as a collaborative partner, our language data for MT training solutions helped facilitate an MT experiment to inform the efficiency of automated translation processes for ING Hubs Poland

, a leading multinational banking and financial services corporation. The TAUS datasets improved the number of translations rated perfect by human testers by 15% and it was observed that the output from the engine trained with TAUS datasets will be better than the untrained 95% of the time in Anti Money Laundering (AML) and Human Resources (HR) domains.

icons-action-calendar1 Feb 2022

TAUS provided 172.980 segments of training data in French-German language pair, in a very specific area of the broadly legal domain for Custom MT, one of the latest and leading MT services companies delivering affordable machine translation engine training, evaluation, and integration.

icons-action-calendar19 Jan 2022

Online machine translation engines provide easy access to high-quality machine translations. They are optimized for content like news articles and social media posts that users of online platforms frequently translate.