Şölen is the Head of Digital Marketing at TAUS where she leads digital growth strategies with a focus on generating compelling results via search engine optimization, effective inbound content and social media with over seven years of experience in related fields. She holds BAs in Translation Studies and Brand Communication from Istanbul University in addition to an MA in European Studies: Identity and Integration from the University of Amsterdam. After gaining experience as a transcreator for marketing content, she worked in business development for a mobile app and content marketing before joining TAUS in 2017. She believes in keeping up with modern digital trends and the power of engaging content. She also writes regularly for the TAUS Blog/Reports and manages several social media accounts she created on topics of personal interest with over 100K followers.
After three years of travel restrictions, TAUS Massively Multilingual Conference made its way back to San Jose, California on 11-13 October 2022. TAUS Founder and CEO Jaap van der Meer gave the opening keynote addressing the changes the world has gone through in the last few years,- from bigger geolinguistic shifts, fragmentation of globalization, and rising populist movements in the world, to more migration than ever. Referencing the debate between Nicholas Ostler and Lane Greene at the 2013 TAUS event about whether English or MT was the new lingua franca, he then asked, based on all these changes, where is English today as a language. Without a doubt, the world is massively multilingual today. New ways of supporting this new world should be the focus of technology. Jaap asked “how do we support a world with total diversity and inclusion?” and answered, “we need a lot of data and humans in the loop.”
Working as a collaborative partner, our language data for MT training solutions helped facilitate an MT experiment to inform the efficiency of automated translation processes for ING Hubs Poland
, a leading multinational banking and financial services corporation. The TAUS datasets improved the number of translations rated perfect by human testers by 15% and it was observed that the output from the engine trained with TAUS datasets will be better than the untrained 95% of the time in Anti Money Laundering (AML) and Human Resources (HR) domains.
Amsterdam, June 8, 2022 – TAUS, a leading provider of human-powered language data for AI solutions, has published the TAUS DeMT™ Evaluation Report 2022. The report compares TAUS DeMT™ performance against available major machine translation engines in 8 language pairs for the eCommerce domain, 18 language pairs for the Medical/Pharma domain, and 4 language pairs for the Financial domain. This report’s findings are based on Polyglot Technology LLC’s independent analysis to benchmark the machine translation quality of DeMT™ relative to other major online machine translation providers.
AMSTERDAM, June 8, 2022 – TAUS, a leading provider of human-powered language data for AI solutions, announced today that it has been listed as a Representative Vendor in the Gartner Market Guide for AI-Enabled Translation Services 2022 report.
The AI scene of the 2010s was shaped by breakthroughs in vision-enabled technologies, from advanced image searches to computer vision systems for medical image analysis or for detecting defective parts in manufacturing and assembly. The 2020s, however, are foreseen to be all about natural language technologies and language-based AI tasks. NLP, NLG, NLQ, NLU… The list of abbreviations starting with NL (Natural Language) seems to grow each day. Regardless of the technology domain, it’s observed that natural language technologies will be in a field-shaping position in a variety of areas from business intelligence and healthcare to fintech.
Amsterdam, February 16, 2022 - TAUS, the one-stop language data shop, is pleased to announce that TAUS data products and data services are now available to users of Amazon Translate on the AWS Marketplace. This first step marks the beginning of continuous collaboration between TAUS and Amazon Translate. As of today, AWS customers can review and buy 30 bilingual corpora in the Ecommerce, Medical/pharmaceutical and Finance domains. The languages available are as follows:
TAUS provided 172.980 segments of training data in French-German language pair, in a very specific area of the broadly legal domain for Custom MT, one of the latest and leading MT services companies delivering affordable machine translation engine training, evaluation, and integration.
Amsterdam, January 19, 2022 - TAUS, the one-stop language data shop established through deep knowledge of the language industry, globally sourced community talent, and in-house NLP expertise, launches a new service: Data-Enhanced Machine Translation (DEMT) on their Data Marketplace.
Technologies such as Natural Language Processing (NLP), deep learning and computer vision have been thriving since data science has become well-established as a field of study and expertise. These developments have paved the way for the rise of machine learning (ML) to achieve the concept of artificial intelligence (AI). The transformative effects of these new technologies continue to be observed in our daily lives at a gradually increasing pace as we move into 2022.
Amsterdam, 7 October 2021
TAUS Data Marketplace has brought new opportunities to everyone, from individual linguists and LSPs to data and publishing companies, to leverage and monetize their content. The key to being a part of the surging trend of language data for AI is the successful conversion of available multilingual content into language data that is directly usable for AI model training.
Finding high-quality data for MT training has always been a challenge on the path to generating high-performing MT output. The challenge increases when the language pairs are rare or when training data in a lesser-known domain is needed.
Globalization lies at the core of the contemporary age. Knowing this, it’d be ideal to think that the enterprise of academic research would capitalize on contributions from researchers globally and also wants these contributions to be accessible by all students and academics all around the world. Yet, language barriers still present a considerable stumbling block when it comes to the global circulation of academic findings. English is the dominant language in the academic world, which means that researchers around the world are under pressure to publish their findings in English and academic students are expected to understand and digest all of these significant findings in English. This overall contributes to the creation of an academic monoculture.
Amsterdam, Paris - May 3, 2021 - TAUS and SYSTRAN announce that they have formed a strategic alliance. Both companies operate in the market of automated translation, but each of them brings a unique perspective and solution. TAUS specializes in language data that is needed to train and customize the translation engines, whereas SYSTRAN specializes in machine translation technology for professional users. By combining the expertise of the two, translation users will access a unique solution on the market.
The total volume of data created worldwide is expected to reach 149 zettabytes by 2045. Therefore, capitalizing on data has become as important as human, financial, or any other capital. Data as capital has gained even more importance now that data-trained systems start to dominate all imaginable aspects of the world we live in.
Data annotation is the categorization and labeling of data to be used in the training of AI applications. Training datasets must be carefully categorized and annotated for each specific use case. High-quality, human-powered data annotation allows companies to build and improve AI implementations which results in enhanced customer experience solutions such as product recommendations, relevant search engine results, computer vision, speech recognition, chatbots, and more.
Bias as a general term defines the tendency to lean towards a certain direction either in favor of or against the given topic, person or thing. When it comes to data science, a biased dataset can be classified as one that doesn’t represent a model’s use case fully and therefore produces results that are skewed or inaccurate and includes analytical errors.
In 2020, when the global pandemic struck the world, humanity had a groundbreaking learning experience that has forever altered almost every aspect of human interaction. But even before then, artificial intelligence (AI) and the branch of AI known as machine learning (ML) were already on the path to fundamentally reshape almost every industry.
When the global pandemic hit the world in 2020, TAUS created a starter kit in several languages to train high-quality translation models customized for the pandemic domain. SYSTRAN, a leading AI-based translation technology company, partnered with TAUS to use these datasets to produce twelve translation models for English to/from French, Spanish, German, Italian, Chinese and Russian and make them available on SYSTRAN Marketplace where NMT models are offered to a network of language experts to train models in any language pair and domain.