TAUS AIQE presents a cutting-edge solution that uses artificial intelligence to evaluate the quality of translations, by minimizing the need for reference translations and reducing reliance on human reviewers
We have conducted extensive tests to explore its potential benefits and limitations for multilingual MTPE projects. This article delves into what TAUS AIQE offers, how it works, and whether it is feasible for real-world translation projects.
What Is Quality Estimation?
Quality Estimation (QE) is an automated process that evaluates the quality of translations without requiring reference texts or manual oversight. Using AI models trained on examples of both high- and low-quality translations, QE generates scores that predict the quality and fluency of machine-translated segments.
These scores can help streamline translation workflows by:
- Identifying segments that require human post-editing.
- Automating translation routing.
- Supporting quality assurance efforts with targeted sampling.
The TAUS Estimate API is a notable implementation of QE, leveraging quality-labelled datasets from TAUS' extensive repository of multilingual data. This API is designed for seamless integration into Translation Management Systems (TMS), Machine Translation (MT) engines, and enterprise applications. Through its REST interface, users can submit translation segments for evaluation and receive real-time scores.
How TAUS QE Works
TAUS QE operates on two main types of models:
- Generic Models: Trained on multilingual datasets from the TAUS Data Repository, these models support over 100 languages. While versatile, their performance may vary depending on language and domain.
- Custom Models: Built specifically for clients, these models incorporate specialized jargon, brand terminology, and domain-specific data to deliver tailored quality scores.
The API offers several QE metrics, including the proprietary TAUS QE score and custom scoring systems, with potential compatibility with external metrics like COMET, allowing flexibility for different project requirements. Scores range from high-quality predictions (above 0.9) to poor-quality warnings (below 0.7).
Unlike traditional Translation Memory (TM) systems that rely on reference matches, TAUS QE evaluates accuracy and fluency independently. This allows it to predict quality for entirely new translations, making it an excellent complement to TM-based workflows.
What Our Tests Revealed
In our tests, the TAUS QE scores were used to predict which segments would require post-editing and to assess whether the MT output was usable as it is. The results were promising but highlighted several challenges.
The API excelled in identifying high-quality segments and providing a numerical confidence score for each translation. For technical content and straightforward text, this approach helped reduce post-editing efforts. However, for creative or context-heavy texts, where nuances and cultural adaptations are critical, the scores were less reliable.
We also noted the following challenges:
- Integration Issues: While the TAUS QE API integrates with memoQ, the implementation lacks full automation, which poses several challenges. Key settings for using the API cannot be saved to project templates, requiring manual configuration for each new project. Additionally, QE scores are not incorporated into memoQ's CAT Analysis, meaning segments with quality scores are treated as "no match" by default. This limits the ability to accurately estimate project costs, manage resources, and streamline workflows. These integration gaps increase manual effort for project managers and reduce the overall efficiency of incorporating QE into MTPE workflows.
- Variability in Language Support: While the generic model supported multiple languages, poorly resourced or less commonly spoken languages produced less reliable scores.
- Lack of Detailed Error Feedback: The scores provided no insight into specific issues within low-quality segments, leaving post-editors to manually identify errors.
- Creative Content Limitations: Like most QE tools, the model struggled with creative texts, where translation quality is subjective and dependent on stylistic and cultural considerations.
Strengths and Weaknesses of TAUS QE
One of the most significant advantages of TAUS QE is its ability to automate quality checks, reducing reliance on human intervention. It allows Kobalt to quickly identify segments that need post-editing, making it particularly useful for large-scale multilingual projects. Additionally, integration with CAT tools like memoQ is possible via APIs, enabling translators to incorporate scores into their workflows with custom implementation, enhancing efficiency.
However, there are limitations to consider. Over-reliance on automated scores could lead to overlooked errors in high-quality segments, especially for complex or context-sensitive content. Furthermore, initial setup and integration efforts may pose a challenge for teams working with tightly automated TMS workflows.
Is TAUS QE Feasible for Us?
While TAUS QE shows immense potential, its feasibility depends on addressing several key challenges. Integration with TMS and CAT tools needs to be seamless to avoid disrupting workflows, and the model’s limitations with certain languages and content types must be carefully managed.
That said, its strengths are undeniable. For technical and straightforward texts, TAUS QE offers a fast, scalable solution to evaluate and prioritize post-editing tasks. With custom models and continued refinement, it could also support more nuanced projects in the future.
Ultimately, TAUS QE is best viewed as a complementary tool. It enhances productivity by guiding post-editors and project managers but cannot entirely replace the expertise of human translators.
In a few words…
TAUS AIQE represents a promising step forward in automating translation workflows. By integrating AI-driven quality predictions, companies can reduce costs, improve efficiency, and better allocate resources for post-editing. While challenges like integration and creative content handling remain, the tool’s potential for transforming MTPE workflows is clear.
With further exploration and collaboration with technology providers, this tool has the potential to help us manage large volumes of content in optimal times while preserving quality. By implementing automated quality estimation, we can ensure that clients receive faster turnaround times without compromising accuracy, enabling them to meet their own business goals and maintain their standards for multilingual communication.
Leave your comment!
More...
LLMs, SMLs, Quantum Computing and Data Centres in Space
Myriam Garcia
Operations Manager
The Future of Global Content Creation? Takeaways from the TAUS Massively Multilingual AI Conference
Ricard Sierra
CEO
Kobalt Languages Revolutionizes Video Localization with Cutting-Edge AI Technology
Kobalt
Content without borders
This Website uses third-party cookies for analytical purposes. Access to and use of the Website implies your acceptance. For more information, please visit our Cookie Policy.
more informationI agree