Synthetic parallel data generation by back-translation as a solution for the problem of translating low-resource languages and texts from low-resource domains.
Junior Machine Learning Engineer at TAUS with a background in linguistics, anthropology and text mining. Passionate about implementing state-of-the-art NLP solutions and doing the data work, while also following engineering best practices.
Purchase TAUS's exclusive data collection, featuring close to 7.4 billion words, covering 483 language pairs, now available at discounts exceeding 95% of the original value.
Explore the crucial role of language data in training and fine-tuning LLMs and GenAI, ensuring high-quality, context-aware translations, fostering the symbiosis of human and machine in the localization sector.