eMpTy Pages: Translator Strategies For Dealing With PEMT

Friday, November 8, 2013

Translator Strategies For Dealing With PEMT

I recently had the opportunity to speak to a group of translators and interpreters about machine translation and how it increasingly impacts their work lives. Given that more and more agencies are using MT nowadays, it is now much more likely that a translator might be approached to do post-editing work and thus my message to the translators at the event focused on how to assess these opportunities (or hazards) and maximize the benefit of any interaction.

Translators have much more power than they realize and I predict that they will eventually learn to separate the wheat from the chaff. Translation agencies will hopefully figure out that while MT technology will proliferate, the shortage of “good” translators will only intensify in a future where global companies want to translate 10X or more the volume of information they do today.

We see many examples of MT use by agencies today but very few of these would qualify as skillful and appropriate and even fewer would be considered fair to the post-editors. It is my sense that MT technology will only offer long-term competitive advantage to those who use it with skill and real expertise and have skilled translators involved in the process. It is very easy to dump data into an instant MT portal and get some kind of an engine, but not so easy to get an engine that provides a long-term cost and efficiency production advantage.

If you are one of those translators who feels that they will NEVER do post-editing work or have decided that you simply don’t want to do it because you have plenty of “regular” work then this post will probably not be of any interest. I am one of those people who believe that MT will continue to gather momentum and that it is useful to translators to understand why and determine when to get involved or not. (And this is not just because I am involved with the sales and marketing of this technology.) It just simply makes sense at a common sense level. The first thing to understand is that all MT engines are not equal and that free online MT is not the best example of professional use of this technology even though it can be surprisingly good in some languages.

Since there is a great deal of variation in the specific MT output that translators are expected to post-edit, I think it makes sense for a translator to understand each unique opportunity as it comes along, and determine whether it is worth his/her time and engagement. Some MT opportunities can pay better than standard translation work, if the word rate to MT quality ratios are properly determined, and thus I think it makes sense for translators to understand when this is actually the case. Early experiences with incompetent or unscrupulous MT practitioners have helped PEMT work develop a reputation for being mind-numbing work that is poorly compensated. This IMO is more a reflection of the quality of these early efforts than of the real possibilities of the technology when used with expertise.

Thus I have come up with a simple checklist for a translator to evaluate a potential PEMT “opportunity” and decide whether to engage or not.

1) Compensation is linked to actual work effort

The MT output quality to word rate (financial compensation) relationship is a fundamental issue for translators. It is important to understand the “average” output quality of the MT output and then understand the effort required to fix it to target quality levels, and ensure that it is related to the compensation offered.

Since with PEMT we are generally talking about asking translators to accept a lower rate than they normally charge, it is important that there is a modicum of trust with the agency in question. This would allow a fair and reasonable rate to be established that matches the effort required to get the MT output to required target levels. This subject is dealt with in some detail here and here. The better you understand your own personal productivity with the specific MT output you are dealing with the more informed your decision will be. The specific effort level can be assessed quickly by doing a small test with a “representative sample” of a 100 or so sentences. The throughput measurements you make can then be used to extrapolate and calculate an acceptable rate.

So if your normal throughput is 2,500 words/day (313 words/hour) and you find that with the test MT output you can expect to do 5,000 words a day, it would be reasonable to accept a rate that is 60% of your normal rate and even 50% might be fair if you feel the sample is very representative and you do not mind this type of work. (I would err on the higher side as the test is only as good and representative as your test sample.)

A critical skill to develop in these scenarios is the quick assessment of the MT output quality and determine what your work throughput and thus acceptable rate is. Remember small grammar and word order errors are much easier to correct than word salad and bad and inconsistent terminology problems which require research. The rapid assessment of the quality of the MT output should be an important part of determining when a project is worth doing or not. Having a basic understanding of BLEU, Edit Distance and other methodologies is useful as this can expedite assessment of the PEMT opportunity. Asia Online offers free software to run BLEU and develop your own error classification based calculations.

Some things to be wary of include:

Agencies that establish an arbitrary lower word rate independent of language and MT output quality. This is a pretty good clue that they don’t know what they are doing and a sign that there will be dissatisfaction all around.
Agencies using DIY MT who don’t really understand what they are doing. Expect great inconsistency and variability in the output quality and usually lower overall quality which means a greater PEMT effort.
Agencies that have the same rate for tough languages like Japanese and easier languages like Spanish PEMT work. I would generally expect that that the effort would be greater for tough languages and so they should be paid at higher rate.
Agencies that give you MT output that is lower in quality than you could get on your own from Google or Microsoft. This is a sign that they do not understand what they are doing.
There are many agencies out there that have very little understanding of the complexities of MT and are only using it as a way to reduce costs. They will give you crappy output to edit and expect you to fix it for a fraction of a reasonable rate. Identify these agencies and let fellow translators know who they are. Avoid working with them.
Hourly rates may actually be better for some kinds of MT projects where the translator is expected to only do a partial correction. Research suggests that it is very hard to define how far a partial correction goes.

An example of agencies that do it right and use objective and trusted measures to establish fair compensation include Advanced Language Translation and Omnilingua. It is worth understanding their process.

2) Trust and communication around technological uncertainty

I think that one of the main reasons MT has taken so long to gain momentum is the low levels of trust within the supply chain and unfortunate early experiences with MT where rates were lowered unfairly and translators were expected to bear the brunt of incompetent use of MT technology. The stakeholders all need to understand that the nature of MT requires a higher tolerance for “outcome uncertainty” than most are accustomed to. Though it is increasingly clear that domain focused systems in Romance languages are more likely to succeed with MT, it is not clear very often how good an MT engine will be a priori, and investments to measure this need to be made to get to a point to understand this.

The stakeholders all need to understand this and work together and each make concessions and contributions to make this happen in a mutually beneficial way. This is of course easier said than done as somebody has to usually put some money down to begin this process. The reward is long-term production efficiency so hopefully enterprise buyers are willing to fund this, rather than go the fast and dirty MT route as some have been doing. Agencies that are new to MT and post-editing are those most likely to get it wrong and translators should seek out agencies that are sensitive to resolving the uncertainty in a fair way.

Some specific things that translators can watch for include:

The quality of the dialogue and rapport with the project managers at the agency.
Some agencies provide very clear examples of what they expect you to do with different kinds of errors. This is a good sign and helps focus the work in the most efficient way. Some like Hunnect develop an online training course for post-editors to help clarify this.
The agencies that are willing to work with translators to deal with this technological uncertainty are the ones to focus on. Again Scott Bass from ALT provides wise words on PEMT Best Practices and provides an example of what a win-win scenario looks like.

3) Ability to interact with and control MT technology

One of the common complaints about PEMT is about the drudgery of error correction work. This does suggest that not all translators want to do this kind of work or are well suited to it. Many translators are also seeking to provide feedback and steering advice to the MT system to reduce the drudgery, however, not many MT systems can properly use and leverage this type of feedback. Some like the Asia Online Language Studio are designed from the outset to utilize this type of feedback. We are seeing now that many translators do realize that MT can be an aid, much like TM, to get repetitive translation work done faster. MT offers “fuzzy matches” for each new segment that is translated through the system. Good MT systems will produce the equivalent of high quality fuzzy matches and will be much more consistent in output quality than what most of us experience with free online MT (or most DIY efforts) as shown in the second graphic above. Bad MT systems will be inconsistent, unpredictable, produce lower quality output and generally be unresponsive to any corrective feedback, especially when the practitioners are simply dumping data into an instant MT engine making portal.
The following are some characteristics of superior MT platforms:

The ability to provide some initial error pattern feedback to reduce mind-numbing correction work.
Noticeable improvements in quality with relatively small amounts of corrective feedback.
The ability to control the MT output with terminology or repetitive error pattern corrections at run time in addition to the upfront overall training, as this can greatly enhance the speed of the post-editing work.
A defined process to take small amounts of corrective feedback to improve the engine BEFORE a production run to reduce the post-editing effort.
The ability to control the overall linguistic style of the translations to requirements.

This outlines some kinds of corrections that can be run at the time of running a translation through an existing Asia Online MT engine.

Examples of correcting problematic source text to make the post-editing task easier.

Example of using preferred terminology in the event that the original training chooses other terms.

If you have a good feeling about all three items in the list above PEMT can be just another kind of translation task and can sometimes be one that offers greater financial reward.

There have been several studies of varying quality that examine how PEMT compares with regular translation approaches and we see mixed results and often experimental bias. I just saw this study on The Efﬁcacy of Human Post-Editing for Language Translation from Stanford that attempts to measure this in as objective manner as possible. I like that they also summarize many previous studies. Some may find fault with this one too because they use oDesk, even though these were translators who had passed a 40 question skill/competence test. IMO the study is perhaps more objective and rigorous than most I have seen from the localization community and I think it is worth noting the key findings and is worth a closer look by anybody interested this issue. They ran a carefully monitored regular vs. PEMT comparison test for 3 languages (English to Arabic, French, and German) and found the following:

Most translators found the MT (Google Translate) useful and preferred it to not having a suggestion
PEMT reduces the time taken to get the task done
Across languages they found that the suggested translations improve final quality
Across languages, users provided the following ranking of basic parts of speech in order of decreasing translation difficulty: Adverb, Verb, Adjective, Other, Noun.

“Our results clarify the value of post-editing: it decreases time and, surprisingly, improves quality for each language pair. Our results strongly favor the presence of machine suggestions in terms of both translation time and final quality. If translators benefit from a barebones post-editing interface, then we suspect that more interaction between the UI and MT backend could produce additional benefits.”

I would love to hear what other translators may have to share about their PEMT experiences, both positive, negative and suggestions they might have to improve the process. I would like even more to hear what they think about an ideal post-editing environment or workbench and recommendations they would have.

If you are interested in my slides from the MiTiN presentation you can find them here.

37 comments:

Brian FlackNovember 9, 2013 at 10:51 AM
In my experience, PEMT is not much different to the process of verifying a translation performed by a non-native. You compare the translated text with the originall and assess whether or not the two documents convey the same message accurately.
The main criticism I have of MT is that it has the potential to make more disastrous translation errors than is likely with a human translator. The consequence of this is that the correction of the erroneous translation can take as much time as if you had perormed the whole translation yourself.
I have not had the opportunity to compare different MT processes as I only receive the occasional surprise request to verify a document that troubles one of my clients. Having handled several such tasks, I can only say that the standard has been similar in all. Maybe they are all using the same MT provider.
By Brian Flack
ReplyDelete
Replies
ShaiNovember 10, 2013 at 10:21 AM
I'm among those who don't plan on doing PEMT, unless I will be able someday to setup and train my own local MT engine and incorporate it as part of *my* workflow, although I'm not sure how beneficial even this type of solution will be compared to referencing quality self-made TMs, Glossaris, and well, the subject specific expertise for which a professional translator is hired.

MT is indeed an aid, but in its current form it is not a professional aid, it is an aid for the brokers and generalist, often questionable-quality translation working in the low market segments (as evident by the study cited above); the former abuse it and the latter use it as a hand holding aid. The truth is that currently MT is not being used by merit, but for pure financial benefit of the brokers that operate in the shallow, muddy water of the pond they call the "industry" (i.e. an artificial group of "service providers" that leech off the translation profession just because they manage to make some money). It is a smoke and mirrors game in which the brokers and the technology developers try to exploit both clients and translators in a desperate attempt to survive in the toxic environment that they helped create.

I thank you for writing this article and for touching the subject of fair compensation that commensurates with the effort (and its abuse), as well as pointing out that most brokers out there have no clue about MT and they just use it arbitrarily to reduce costs (for them) and increase margins (again, for them) with no consideration to ethics and professionalism. The mere talks about "controlled language" (i.e. reducing the quality level of the source to better prepare it for MT) it a testament on the true and probably only motive (financial) of using MT. In the long time those abusive and unprofessional practices will backfire on all the entities involved, only by then who knows how many true professionals, as opposed to brainwashed cogs who are trained to to PEMT according to some "metric" and "standard" or whatever, while having no clue about what translation is really about, will be left in this business.

Also, I don't quite see the connection between translation and PEMT. Translation required a specific set of skills, while PEMT is a (mind-numbing) process that requires a completely different set of skills. It is like programming and debugging, seem superficially similar, but completely different things in practice.

One technology that is almost never mentioned in the MT as a productivity aid debate is dictation (text-to-speech), with clear evidence of comparing (in a relatively bad day) and/or exceeding the best MT scenarios in terms of productivity, not to mention the quality. If half the budget and effort put into MT and other snake-oil profit-driven (not for the translators, of course) "solutions" development were invested in optimizing and developing dictation technology. I suspect things were different.
ReplyDelete
Replies
William CassemiroNovember 10, 2013 at 2:37 PM
Great article. As a freelancer, my experience until now with PEMT is not related to companies, but to my workflow for some materials I translate. I use ProMT in a workflow where a small chunk of text is translated, reviewed and then I create glossary entries to improve quality, send this back to ProMT and then translate another chunk, repeating this process until the end of the text. Quality gets better and better and time used in preparing glossary and all other steps becomes irrelevant when you see how fast and precise you can finish the hole text. Of course, I'm talking about a very small amount of text if compared to a company volume, but I do believe companies which use a feedback system to improve MT quality are able to provide great material to be post-edited and I surely would evaluate the possibility to partner with them. It would really be a win-win.
ReplyDelete
Replies
Tomas Mosler, MITINovember 11, 2013 at 12:48 PM
I have summarized my two main points why I'm cautious about MT in my recent blog post here:

http://www.englishczechtranslator.com/blog/why-i-dont-think-supporting-machine-translation-systems-is-a-good-idea/

Kirti: "no reason to consider a PEMT job differently from any job that requires you to use available TM"

I dare to disagree, not only for reasons outlined in my blog. (Of course, those reasons may not be the case with every project, but anyway.) Technical aspects aside, TM contains human translation. Professionally, I find it somewhat derogatory to reprocess/recreate something "soulless", without a human touch (at least as long as the difference is noticeable) just because someone wants to save money or time.

Up to everybody, but IMHO words are not bricks - I have the impression that PEMT adds or supports the routine/robotic approach to translation (TM might be sometimes perceived as having similar effect, but it is not the same thing) which compromises the genuine creativity and joyful work.
ReplyDelete
Replies
Valerij Tomarenko (@En_De_Ru)November 12, 2013 at 8:21 AM
I already said it elsewhere: machine translation is like a machine gun – fast, destructive and devastating for your client’s reputation. In case of PEMT, PE is similarly devastating - for a (former) translator’s writing skills (and mind).
ReplyDelete
Replies
Valerij Tomarenko (@En_De_Ru)November 13, 2013 at 12:30 PM
I would also feel unsafe operating a motor vehicle if the road looked like it does here - have a look to the right and to the left of our posts. (Wondering about the telescope above...)
ReplyDelete
Replies
Aurora HumaránNovember 14, 2013 at 3:12 AM
«As long as PEMT can deliver final translations that are close to (or sometimes even better) »

Hi, Kirti,
I would like to see statistical evidence of this (not by Common Sense Advisory).
Needless to say, I am totally aligned with Valerij and Kevin's comments.
Thank you very much,
ReplyDelete
Replies
Aurora HumaránNovember 14, 2013 at 2:01 PM
«The Stanford study cites several sources in http://vis.stanford.edu/files/2013-PostEditing-CHI.pdf.»

Thank you, Mr. Vashee.
I'll read it to understand the grounds to say that MpT can sometimes be better than real translation. In fact, even when comparing two real translations an expert in translation studies would find it difficult to define which one is "better." I'll read it. Thank you.

«If you look at the writings of Sharon O'Brien and Ana Guberof»

I disagree with most of Ms. Guberof's ideas, but I guess this does not surprise you.

«Asia Online has presented several case studies where this has been true for some specific cases -- you can find these by searching on PEMT in my blog.»

If you could give me more specific directions on MpT being better than real translation in your blog, I'd appreciate it.

ReplyDelete
Replies
Valerij Tomarenko (@En_De_Ru)November 15, 2013 at 8:27 AM
Kirti and Aurora,

You will find a lot of statistical evidence e.g. here http://www.mt-archive.info/conf/MTS-2013-TOC.htm as well as in many other sources listed in this archive (http://www.mt-archive.info/), to the CONTRARY effect.

Unfortunately, all the evidence that PEMT CANNOT “deliver final translations that are close to (or sometimes even better)“ will be interpreted as “examples of bad or incompetent USE of MT“ (as Kirti just demonstrated) by those who have an intrinsic interest to promote PEMT and sell MT systems.

Similarly, if you’d ask for statistical evidence that a bicycle’s performance is close to that of a motor car, you might hear that yes, but it “requires special skills, expertise and experience and needs to be tailored...“.

Granted, it is not an appropriate example, but the blurred road on both sides of this column is sort of inspiring. Also, no matter how adequate a simile can be, chances are it will be interpreted as a “tirade“.
ReplyDelete
Replies
Ana G.November 18, 2013 at 1:10 AM
Thank you, Kirti, for the mention. This is a link to my thesis http://www.tdx.cat/handle/10803/90247 and in the Bibliography section there are plenty of references to very significant work: O'Brien, Plitt, Tatsumi, just to name a few. In all this work, we see an increase in productivity and quality with MT but this is achieved always under certain circumstances related to, of course, the subject matter, but more importantly the quality of the output for that particular language combination in that particular domain, among others. Also, it is consistent that translators behave in very different ways when translating in general with MT or with TMs or on their own. We cannot make quick statements that MT always helps productivity or quality, but in certain circumstances it can do. This could be obvious but I think the more we help to understand how MT works and its impact in actual work, with empirical data rather than opinions, the better it will be for the translators. But I guess opinions can gather more followers :)
ReplyDelete
Replies
ClaudioPorcellanaDecember 20, 2013 at 3:02 PM
hello there

I already asked this to you Kirti, by email some time ago ...

I told you about the crappy translation into Panasonic digital camera manuals, unreadable even in the English version, and I asked (humorously) if Asia Online could have been the culprit ...
J

then I asked you some sample in my language pair (English to Italian) but unluckily you wasn't able to send me anything

now, I think that all readers here could have a better insight in the matter, if they were able to check some samples in their language pair ..
ReplyDelete
Replies
Kirti VasheeDecember 20, 2013 at 9:23 PM
Hi Claudio

I did respond to your email -- perhaps you did not see the response. Asia Online has had no involvement with Panasonic manuals. I sent you some samples from Italian to English but I have no samples for the opposite direction.

Thanks

Kirti
ReplyDelete
Replies

Add comment

eMpTy Pages

Pages

Friday, November 8, 2013

Translator Strategies For Dealing With PEMT

37 comments:

Get new posts by email:

Search This Blog

Pages

Featured Post

Comparing MT System Performance