Machine translation is pervasive today and even the most conservative estimates say that MT is “translating” trillions of words a month across multiple large public MT portals and is used by hundreds of millions of internet users daily at virtually no cost.

As more of the global population comes online, people need MT to access the content that interests them even if only in a gist-sense, and today we see that there is growing momentum in the development and advancement of the state-of-the-art (SOTA) on “low-resource” (languages with limited or scarce data) languages to further accelerate global MT use.

MT technology has been around in some form for the last 70 years and unfortunately has a long history of over-promising and under-delivering. A history of eMpTy promises as it were. However, the more recent history of data-driven MT has been especially troubling for translators, as SMT and NMT pioneers have repeatedly claimed to have reached human parity.

These over-exuberant claims about the accomplishment of MT technology, have driven translator compensation down and have made many would-be translators reconsider their career choices.

It does not help that a more careful examination of the human parity claims by experts shows that these claims are not true, or perhaps only true for a tiny sample of test sentences.

Many say, that the market perception of exaggerated MT capabilities has damaged translator livelihood and there is often great frustration by many who use MT in production environments where the high-quality human equivalent translation is expected but never delivered, without significant additional effort and expense.

To add insult to injury, the overly optimistic MT performance claims have also resulted in many technology-incompetent LSPs attempting to use MT to reduce costs by forcing translators to post-edit low-quality MT output at low rates.

It does not seem to matter that most LSPs have yet to properly learn to use MT in localization production work, according to a survey of MT use by LSPs done by Common Sense Advisory last year.

It is also very telling that the author wrote a blog post on MT post-editing compensation in March 2012 that has had the widest readership of any post he has written ever, and continues even in 2022 to be an actively read post!

Thus, often "monolithic MT" is considered a dark, unuseful, and unwelcome factor in the lives of translators. However, this state of affairs is often a result of incompetent and unethical use of the technology rather than a core technology characteristic.

The Content and Demand Explosion

However, the news on MT is not all doom and gloom from the translator's perspective. There is a huge demand for language translation as evidenced by the volume of use of public MT, and by the digital transformation imperatives for global enterprises driving the need for better professional MT.

Both public MT and enterprise MT are building momentum. The demand for content from across the globe is exponential which means that translation volumes will also likely explode. And, while much of it can be handled with carefully optimized Enterprise MT, it will also need an ever-growing pool of tech-savvy translators to drive continuously improving MT technology.

World Bank estimates say that by 2022, yearly total internet traffic is projected to increase by about 50 percent from 2020 levels, reaching 4.8 zettabytes, equal to 150,000 GB per second. The growth in global internet traffic is as dazzling as the volume. Personal data are expected to represent a significant share of the total volume of data being transferred cross-border.

It is estimated that the amount of digital data created over the next five years will be more than twice the amount created since the advent of digital storage. Global data creation and replication will experience a compound annual growth of 23 percent in the 2020–2025 forecast (IDC, 2021a). Data traffic trends are related to economic development, value creation, and prosperity.

13 comments:

Ralph CalistroApril 18, 2022 at 12:22 PM
MT definitely needs the human in the loop
Jeff AllenApril 18, 2022 at 12:37 PM
This has been the result of a lot of MT providers doing a poor job over the past 25 years to create relationships with professional human translators. I warned us in some LinkedIn posts in 2009 and 2010 that the result would be professional translators becoming enemies rather than allies. The professional translators have a very large influence on customers. Some LSPs even train their sales people to show bad quality of MT to sell more higher quality translation services without MT.

Very few training programs in universities with MT software. Very little involvement in building up the next generation of MT post-editors, or whatever the participants can be called.

And the market materials continue to promise very good to excellent MT without extra effort by humans.

Some MT providers are trying to change this, but they are the minority.
Maria StrangeApril 18, 2022 at 12:39 PM
I am not sure if it is just my personal perception but it seems that the level of colaboration between programmers and translators is low when it comes to develop MT/AI. I would love the idea of having an AI-Translator capable to adapt to my style of working and that I can train to deliver a faster and most efficient service to LSPs - but somehow translators and linguists got the feeling that we have been ignored and MT providers have gone directly to LSPs.
Ayman Saa'dApril 18, 2022 at 12:42 PM
Thanks Kirti Vashee for sharing. A very good read indeed.
Luigi MuziiApril 20, 2022 at 10:22 AM
Google alerts for “translation technology” only focus on MT, because what players in this industry know as such is irrelevant for most customers. Alerts for “machine translation” indicate a steady growth in interest around MT, because the relationships most customers have been experiencing with LSPs is disappointing when not definitely poor, despite all the BS. Or maybe due to it.
There should be no surprise in considering how enduring the interest in MT has been in over 70 years. The whole language community should have anticipated it, embraced it, and exploited it rather than fought it. How stupid!
There is no human in the loop, there’s never been: no technology can exist and work and be useful without human beings, devising it, implementing it, using it. It is simply too late for linguists: machine are already doing most of the work with humans being paid peanuts for a job that has remained invariably the same for decades. There are no contrarians nor visionaries in this industry, only one or two Kassandras, and many Trojans. The Achaeans have been in for years and still too many pretend they’re safe. Possibly waiting for someone to show them the way to El Dorado (the mythical premium market). Fools.
Raymond DoctorApril 20, 2022 at 10:28 AM
In the final context, the human touch is always needed. No NMT engine however good can replace a human being. In my opinion, it is a question of trust. I will trust a reputed agency to translate a text, but I will not trust a NMT engine. The day we start trusting the output of these engines [like we do with Google Maps, albeit to a large extent], the days for translators, I am afraid, are numbered and it is only in material which has legal or economic repercussions that a translator will be called upon
AnonymousApril 30, 2022 at 2:03 PM
I tried modern Mt years ago. I prefer deepl. Why do you prefer modern Mt? Are you sure that it is consistent with terminology inside TM?

eMpTy Pages

Pages

Friday, March 11, 2022

The Evolving Relationship of MT with the Translator

The Content and Demand Explosion

The Importance of the Human-In-The-Loop

ModernMT: An MT system designed for the translator

Conversation with Paul Urwin of Proz on MT

13 comments:

Get new posts by email:

Search This Blog

Pages

Featured Post

Comparing MT System Performance