Comments on eMpTy Pages: Dispelling MT Misconceptions

@"German NAATI translator" - like high q...

2013-06-23T01:44:07.344-07:00

@"German NAATI translator" - like high quality human translation requires skill and training in a specific language pair and domain, so does high quality machine translation. The skills leveraged, tasks performed and technology use differ between language pairs and domains. For example to create a high quality engine in a Slavic language, there is a large amount of data manufacturing required to handle all the inflected forms well. To translate in Japanese, syntax tools are used in a hybrid approach with SMT to deliver higher quality. Even a more simple language like Spanish has specific tasks performed. But the skills also extend to domains - travel reviews have a simple grammar structure, but when you consider the volume and scope of named entities possible in the travel domain, these too have to be processed via data generation and syntax supported rules in order to deliver high quality. Many LSPs make the mistake of simply loading their translation memories and running a SMT solution such as Moses without any additional effort. It is these kind of efforts that I have not heard any comment on in the slides or articles that Lori Thicke has published in recent months. These skills are complex and language/domain specific. For this reason, we have taken exception to broad and sweeping statements such as "RBMT is better than SMT for Japanese" or similar.

Good to know that MT in now a complex affair requi...

2013-06-20T01:36:14.696-07:00

Good to know that MT in now a complex affair requiring many skills, expertise and understanding that are not commonplace, to enable successful deployment as a productivity enhancing technology for business translation needs.

For this reason, when Asia Online works with LSPs ...

2013-06-17T14:59:43.134-07:00

For this reason, when Asia Online works with LSPs and other customers, we hide the complexity and engage our linguists to execute data manufacturing and optimization processes on the clients behalf. This means that the required level of expertise for our customers and the effort is greatly reduced. The Asia Online team are experts in Language Studio, our SMT based hybrid platform, and know how to optimize it to deliver the best results. The Asia Online team claims no expertise in other SMT based products or RBMT based products as each have their own merits, approaches and optimal configurations.

We strongly recommend that anyone looking at any MT technology or approach whether RBMT or SMT look for skills that are vendor and product specific. These are not easy to come by, but are the only way to deliver high quality and reduce risk of deployment. Any vendor stating that they are experts in SMT and RBMT without explicitly listing the specific vendors and products that they work with is not going to give the optimal results. Generic skills in SMT and RBMT approaches most certainly cannot deliver the optimal result. Specialists in individual products, not just an understanding of an approach, are required.

Additional counter positions to more misconceptions raised by the author in this presentation and other previous publications can be found in Kirti Vashee’s blog (http://kv-emptypages.blogspot.co.uk/2013/02/dispelling-mt-misconceptions.html)

What the author has omitted in their presentation ...

2013-06-17T14:58:57.156-07:00

What the author has omitted in their presentation is information about the corpus that was used to train the SMT engines and the actual product used to support each specific assumption. The author also omits information on how metrics were performed, how many segments were compared, how productivity was measured in post editing and over what time period productivity was measured. Too few segments and too short a time period can dramatically impact and incorrectly skew results. Additionally while languages were referred to, domains were excluded. The complexity of a domain is an important factor that impacts metrics and quality. Comparatively, LexisNexis case studies listed 9 MT systems and the metrics performed, the data volumes used and the results.

We agree that a technology agnostic approach to MT is very viable, but as Andrew Rufener points out in his presentation the integration costs and skill levels required to run multiple MT platforms were significant and can often outweigh the benefits of selecting multiple MT technology solutions. Adobe, PayPal and others have successfully deployed multiple MT technologies and some such as Autodesk have been very open with their metrics. However they have made significant investment in skills, time, data acquisition and data optimization, as well as software development. They also are focused on their own narrow domains, not a broad range of domains in multiple languages like an LSP. Thus they have their own existing language assets and do not need to perform as much management of data as an LSP would when receiving TMs from multiple sources such as TAUS and other LSP partners. Trying to make such data that is not in the domain fit a new purpose is very difficult and unlikely to deliver the optimal quality. This mixture of data approach is commonly referred to as “dirty data SMT”, which is very different from the focused domain “clean data SMT” approach that Asia Online takes in Language Studio.

We encourage measurement and publication of metrics with all the relevant information about how the metrics were performed and with what data. However only including a small number of products in the group that are evaluated, without considering many of the leading commercial products from multiple vendors and not performing data optimization and data manufacturing means that the results irrespective of product will be biased, skewed and limited to only what the raw corpus can provide. Modern commercial SMT systems go well beyond the capabilities of open source Moses and Microsoft Translator.

What many LSPs seem to underestimate is the complexity of delivering high quality MT. Being able to install Moses and train your own engine does not mean that you will get high quality any more than owning a sewing machine and cloth make one an expert tailor. As per the LexisNexis presentation, a significant investment in skills is needed. Andrew Rufener notes in his presentation the significant effort that they put into learning SMT approaches and optimizing data. It is our position that there is no one individual or organization that has the necessary skills to be an expert in each of the technology and approach. Much like healthcare, it is too complex for one individual to be an expert in all fields. For this reason, specialist medical professionals are needed for cancer, brain and other treatments. While a general medical practitioner can deal with common low level issues, more complex issues are referred to a specialist. Machine translation is complex. There are few true specialists globally and even fewer that have solid experience in multiple technologies and approaches. Finding an expert in optimizing any of these technologies is difficult. Finding an expert that can deliver the optimal approach and quality from all or even multiple SMT and RBMT vendors’ products is not realistic.

5. RBMT is better suited to documentation and soft...

2013-06-17T14:57:30.057-07:00

5. RBMT is better suited to documentation and software. This is incorrect. There are many published case studies to the contrary. As an example, the case study of Omnilingua on the Asia Online website shows 52% of raw MT required zero edits for their technical automotive documentation. There are many other examples from Asia Online and other vendors.

6. RBMT is better suited to post editing. This is incorrect. As with the above mentioned case study from Omnilingua, engines based on SMT can deliver near perfect quality. The quality of an engine greatly comes down to pre and post processing technologies and the amount of suitable data / corpus that is available for the SMT customization process. With less data or low quality data, translation quality will be poor and the editing will be difficult. With more, high quality data that is in domain, the editing will be less and because it has learned from the clients own translation memories, editing will be significantly less.

7. SMT is not effective with a limited training corpus. This is incorrect. Advances in data manufacturing technologies such as those available in Language Studio mean that even when no data at all is available an engine can still be customized to a high level of quality. The case study on Kirti Vashee’s blog (http://kv-emptypages.blogspot.co.uk/2013/04/pemt-case-study-advanced-language.html) shows how Advanced Language Translation was able to customize engines for their clients with no data at all and only using data manufacturing technologies from within Language Studio.

8. SMT is not as good at languages like Russian, Japanese and German. This is incorrect. The quality of a SMT engine greatly depends on the quality of the data that is used for training. If the author is getting poor results this may be due to insufficient data / corpus, low quality data or insufficient skills to prepare and process the data in a manner that delivers high quality output (see skills comment below). There are many high quality engines based on SMT that excel over RBMT. Andrew Rufener presented “Implementing large scale Machine Translation in Patent Information” (http://dotsub.com/view/159ce97c-dbd4-4d6a-90c2-427a3a3e755f) where he shows metrics from many RBMT and SMT systems. He took a technology agnostic approach and performed detailed metrics before selecting Language Studio.

It appears in the article that the author has not managed data well when creating SMT systems and has not used any data manufacturing and optimization technologies as they are never mentioned. This is evidenced by the authors incorrect assumptions that systems based on SMT cannot have managed terminology and are unpredictable.

The author only considers the hybrid approach of RBMT + SMT based smoothing that is available in Systran and does not consider other hybrid approaches of other vendors such as the hybrid approach of SMT guided by rules and syntax that is offered in Language Studio.

We strongly recommend that the author expand beyond the 3 MT products listed and undertake to learn about data management and data manufacturing for SMT approaches. In Language Studio, Asia Online undertakes these complex tasks for our customers so that the customer can focus on providing the right data for SMT to learn from without the need for skills and the understanding of the complexity of data optimization. Once the initial optimization and data manufacturing is complete, control is handed to the end customer to add and further refine terminology and other linguistic features.

In response to the slide deck on SlideShare (http:...

2013-06-17T14:55:36.267-07:00

In response to the slide deck on SlideShare (http://www.slideshare.net/TAUS/12-june-2013-taus-mt-showcase-moses-in-the-mix-a-technology-agnostic-approach-to-a-winning-mt-strategy-lori-thicke-lex-works)

This information is reposted from SlideShare as the formatting was lost in the SlideShare comments display.

-----

The reason for this detailed post is to correct misinformation so that potential MT users can make informed decisions about all MT based products that are based on fact. This is not an attack on the author; rather it is a series of proof points where we and others have disagreed with the author’s perspective with links and references to third party information in support of the counter positions presented.

At the end of this presentation 4 different organizations raised issues with the content presented. SMT and RBMT are approaches to machine translation and not products. The presentation author is referring to SMT and RMBT as if they are unique products and not a technology approach. There are many products based on either or both approaches that have different features.

The author seems to have only considered the features available in Moses, Microsoft Translator and Systran, while ignoring the features of many other commercial MT products that have already resolved many of the issues raised. The author appears to have not performed data management and optimization of the training data when creating statistical models. As a result, in the authors experience, with a limited number of products and a subset of features that commercial SMT based products can offer, sweeping statements are made that cover a wide scope have been bundled as if there were just 1 product. Each vendor’s products, whether SMT or RBMT based, have a range of different features that are not being recognized by the author.

Many of the assumptions that are presented may have been true several years ago. However many of the issues raised with SMT in particular have been recognized and addressed some time ago by commercial MT vendors and as such are no longer true. If Microsoft Translator and Moses do not support a feature, it does not mean that all SMT based products do not support a feature.

The author makes many sweeping statements that are factually incorrect in the presentation and can readily be verified as such. Multiple individuals have pointed out these discrepancies, but the author has chosen to ignore these proof points and continue to disseminate misleading information. As noted above, several of these issues were raised directly at the end of the presentation where these slides were delivered.

Examples include:

1. SMT cannot handle software tags properly. This is incorrect. Moses cannot handle software tags, but many commercial MT platforms based on SMT such as Asia Online’s Language Studio handle tags very well.

2. SMT does not retain correction to terminology. This is incorrect. If the data is managed properly then management of terminology becomes very easy. Moses and Microsoft Translator do not provide terminology management tools and processes, but products such as Language Studio provide tools to manage and normalize terminology, both when preparing data for training and at translation runtime.

3. SMT does not have a rapid developed customization cycle. This is incorrect. In Andrew Rufener’s presentation (link below), he notes clearly that Asia Online system improved dramatically over 3 days. And that as they added further data, they had control and the system improved quickly.

4. SMT output is not predictable. This is incorrect. If the data is managed properly and supported with data manufacturing such as within Language Studio, then the output can be very predictable.

@Lori - I am sitting in your session at Localizati...

2013-06-12T03:38:52.157-07:00

@Lori - I am sitting in your session at Localization World now listening to you say the same things that you said in your Multilingual magazine article. The same things that Kirti addressed as incorrect in his blog post where he addressed the misconceptions that were disseminated in your Multilingual Magazine article.

I am responding in a direct manner this time as it seems the points that Kirti made in his previous post have fallen on deaf ears and it is confusing potential users of MT. It is time to update your knowledge on the advances in SMT in recent years. Many things you say in your presentations and published articles were correct several years ago, but are no longer correct as they have been addressed and resolved. It is a little perplexing and puzzling that you continue to disseminate incorrect information that can be verified as being incorrect with may proof points. This confuses the market unnecessarily in a market that already has enough misconceptions over the last 50+ years.

Simple things like "SMT does not handle formatting tags", while RBMT does is complete nonsense. If you use raw SMT from Moses perhaps, but if you use a commercial SMT product such as Language Studio then that is not the case.

To say universally that RBMT is better for post editing is again nonsense. Asia Online has systems with published case studies such as that from Omnolingua where 52% of the output required zero edits and the client came back to Omnilingua stating that the final quality with MT + PE was higher than HT only.

Productivity is key and this is reflected in the post editing experience. As per the Sajan case study which achieved 328% productivity gains, with 62% of the raw MT requiring zero edits on PE review.

Likewise the need for large volumes of in domain data is no longer a requirement. We have case studies where no client data was provided as data manufacturing technologies were deployed in the process. See Kirti's recent blog post on Advanced Language Translation as an example.

You mention the lack of control and unpredictably of terminology in SMT, yet with a clean data approach such as that used with Language Studio, there is complete control - all the way down runtime normalization of the source as it passed through, normalization after translation in the target language and even control down to the writing style and target audience.

While I have given our own examples using Language Studio and our customers above, there are many MT providers with both RBMT and SMT technologies. Each is different with different features. To summarize them as you have gives the perception that all MT or all SMT is the same. Systran is different to PROMT, Asia Online Language Studio is different to SDL BeGlobal and Moses. Wide sweeping statements that reduce each to the lowest common denominator where features are ignored is not helpful to anyone (i.e. SMT does not support tags)

Lori The primary focus of my comments was direct...

2013-03-25T15:45:20.002-07:00

Lori

The primary focus of my comments was directed to your conclusions about what is possible with SMT. I dispute your conclusions about SMT, not your experience with Bentley. The “years” is in quotes because you state in a comment that you have been working with SMT for that long. There is no disrespect intended. I am suggesting (respectfully , I think) that your conclusions about what is possible with SMT is based on a relatively shallow knowledge base and that those with deeper knowledge about the inner workings of SMT can handle pretty much all the SMT shortcomings that you point out in your article. Asia Online is not the only company/person who disputed your claims. Other with deep SMT expertise did too.

You also have a very Systran-centric viewpoint when you use the word hybrid, and perhaps I should point out that “hybrid” has a broader meaning than the one used by Systran. Many more advanced (not Moses) SMT initiatives use a combination of linguistics and data – not just the data only approach that most basic Moses approaches have. Hybrid with SMT can be all or combinations of the following: linguistic rules development, POS parsing and adjustments, syntax and morpho-syntactic approaches which Moses users are unlikely to try. Today most people who have been working with SMT for a few years are ALL doing hybrids in the broader sense of the word.

I am also suggesting that every SMT attempt is not the same, unless you have 2 users throw in exactly the same data into the exact same training system. As the NIST tests used to prove in the past – depending on the expertise and skill of the practitioners, SMT can yield very different results especially in a language like Japanese where some linguistic work would have to be done to get better results.

I agree that finally the only thing that really counts is the output quality and the quality of the post-editing experience and it does not matter how you get there – the productivity can only come from good systems that editors feel are responsive to feedback and where compensation is related to the difficulty of the effort.

PS Kirti, I haven't read all the subsequent co...

2013-03-16T18:38:52.109-07:00

PS Kirti, I haven't read all the subsequent comments but I will when I get a chance. I just see that you have put Lexcelera's years of experience as "years" of experience. I don't understand why you would do that.

You have personally met with a longtime MT customer, Bentley Systems, who we started working with in 2007.

I guess this is as good a time as any to announce that that customer has just joined us as our new CEO. John Papaioannou was Director of Release Services at Bentley Systems, for whom we have done many millions of words in MT in French, English, Spanish, German, Dutch, Italian and Japanese. I guess he liked the results because he is now Lexcelera's new CEO.

Could we stick to the facts, folks?

Kirti, Controversial statements are not a bad thi...

2013-03-16T18:32:28.490-07:00

Kirti,

Controversial statements are not a bad thing because it gives us all a chance to share our experience. In that spirit I welcome the chance to respond to some of your statements with which I do not agree.

Your comment:

“It is my sense that MT is still complex enough that meaningful expertise can only be built around one methodology i.e. RbMT or SMT and that anybody who tells you that they can do both should be viewed with some skepticism. It is almost certain that they cannot do both well, and also quite likely they cannot do either well if they claim expertise in both, since very different kinds of skills are required.”

Our answer (thanks to Laurence Roguet from our Paris office)

The only thing true here is that the skills are very different – but that does not prevent an LSP from integrating both profiles and skills internally. In fact, this is necessary to achieve even better results via a best of breed approach. LSPs are not attached to one tool (or at least they shouldn't be): they are attached to their customers, and giving them the best results.

Your comment:

“I think the Rules-based MT vs. Statistical MT arguments are largely irrelevant, even though I think it is increasingly evident that SMT is becoming the preferred approach, especially as more linguistics are added to the data-driven approach. To a great extent most systems out there except for raw Moses systems are all hybrids of some sort. Recently MT technology has evolved to a point where SMT and RBMT concepts are being merged into a single ”hybrid” approach.”

Our answer:

It’s false to say that both technologies are now achieving the same results through hybridization. You need only to assess the results - including a sentiment analysis of final users and post-editors - to see that from one language pair to the other and from one content type to another, the results are very different, depending on the engine.

Furthermore, the hybrid approach is not always the best one. For example, when we work in Japanese we get the best results from a pure rules-based approach. But we’re not relying on hearsay: before starting any major new project we train three engines for testing – RMBT, SMT and Hybrid – and use only the engine that delivers the best results. (Quality MT output is something we feel we owe our post-editors.)

Your comment:

“Have you considered the possibility that others with much deeper expertise than you (Lexcelera)have, could get very different i.e muc better results with Japanese or really any project where you may run something through a basic SMT setup and conclude that it will not work for you?”

Our answer:

(Laurence clearly felt passionate her response because she wrote it in French.)

Ceux qui croient en la MT qu’elle que soit la technologie – investissent en elle. Il ne s’agit pas de s’amuser à faire quelque one shots qui ne peuvent qu’aboutir à la perte d’un client – ce qu’aucune LSP ne peut se permettre. Lexcelera a investi depuis 2007 pour mieux répondre à ses clients. Ce sont principalement des investissements humains, l’acquisition de compétences et de connaissances ne pouvant se passer des compétences spécifiques des différents acteurs.

Au fur et à mesure de sa montée en expertise, Lexcelera a intégré les équipes nécessaires à la bonne et intelligente intégration de nouveaux outils et processus qui leur sont liés. Les heures de R&D ne sont effectivement pas négligeables pour déterminer « quel processus pour quel outil pour quelle langue répond au mieux au défi que me confie mon client ». C’est à partir de ces travaux, et des multiples déploiements en production déjà effectués, que certaines conclusions ont pu être émises. Que ces assertions ne servent et ne plaisent pas à tout le monde, n’est au fond que le problème de ceux à qui cela a déplu !
PS : et la confrontation des différents profils et compétences intégrés en notre sein, nous rend même encore plus performant et certainement plus « brainstormed » and «challenged » than any company attached to one single technology…

Tom, Not sure that we have reached a circular log...

2013-03-14T18:43:43.154-07:00

Tom,

Not sure that we have reached a circular logic. Perhaps we continue to maintain a predictable and steadfast perspective that is a product of our respective business missions.

Though, I think that you do seem to be missing the implications of the conclusions drawn by Lori (and thus Lexcelera) in her referenced article. IMO She is drawing many erroneous conclusions about what SMT can do based on her “years” of experience which sort of invalidates your point that people will somehow learn by playing with Moses in “2+” years. The Multilingual article is a very clear example of how you might NOT learn – also to clarify, the reports I cite from TAUS are all Moses case studies presented in 2012.

Additionally:
- Pretty much every SMT deficiency pointed out by Lori in her article, based on her Moses experience, are easily addressed by Asia Online and others who have deeper knowledge and expertise,
- Most LSPs don’t want or need to learn the intricate details of SMT. Just like a house, you could build it yourself, but most hire an architect, builder, plumber, electrician etc. because even handymen realize some things are better left to people who do it all the time.
- Most LSPs don’t want to wait 2+ years (as you described) to get returns, they need them now if MT is going to be worth the investment at all. Hiring an expert means that the engine is available sooner and the risk is lowered substantially.
- The corpora that has been developed over many years by Asia Online or other experts cannot be matched by an LSP or even research institutions. So the starting point is way lower.
- The cloud being dangerous and a security risk is just not true in 2013

However, I do admit that there will be a few who might learn through experience e.g. Autodesk

Tom Hoar • I think we've reached a poing of ci...

2013-03-14T18:38:16.168-07:00

Tom Hoar • I think we've reached a poing of circular logic. We agreed that one can't "avoid the learning process which is to build many engines... with many different kinds of data sets." Isn't that the inherent answer to the question, "So why bother building engines that editors know produce inferior output to what they can do for free elsewhere?"

Learning happens when users build, fail, analyze, adjust and try again. "Expertise" is the skill of an expert gained through experience. We "bother building engines" because we can not know what produce inferior or superior output until after the system is built. The experience in this corrective feedback cycle develops expertise.

Outsourcing is a valid option for those who do not wish to develop their own expertise. SYSTRAN and AO hire Dr. Koehn for his expertise. In so doing, both have extended their own areas of expertise. Lexworks has worked for years develping their own expertise. The list of experts grows longer each day since the TAUS reports you cited. When were those studies done? 4 or 5 years ago?

Lori Have you considered the possibility that oth...

2013-03-07T10:39:56.120-08:00

Lori

Have you considered the possibility that others with much deeper expertise than you (Lexcelera)have, could get very different i.e muc better results with Japanese or really any project where you may run something through a basic SMT setup and conclude that it will not work for you?

Tom, My original blog post focused on “Dispelling...

2013-03-07T10:25:27.316-08:00

Tom,

My original blog post focused on “Dispelling MT Misconceptions” and the basic thrust of the post challenged and questioned several assertions made in this article in Multilingual: http://lexworks.com/cms/wp-content/uploads/2013/03/PostEdShrtgMT-Thicke.pdf

I felt that they were especially misleading and incorrect with regard to what is possible with SMT, and thus I provided contrary evidence to some specific points made in the Multilingual article. I suspect that many of the conclusions were reached because the author had limited and only shallow experience with SMT that many DIYers typically have. The issues she highlights have been solved by others who have deeper expertise and a broader skill base with SMT related technology. The author is an LSP who claims to have a few “years” of experience with SMT (Moses and MSFT Translator Hub) and her conclusions about SMT were based on this experience. As she says in a comment to the blog post: “But for years we have been working in SMT as well, and in fact are part of a group working on SMT daily under a large EU grant.” I think the comments made in the article suggest that many LSPs will get stuck at basic hurdles that can be easily managed by specialists who can afford to invest more into the process and invest more into building critical skills to solve the highlighted problems.

My point is that few LSPs have the skills/knowledge upfront or the urge to invest in developing the expertise necessary to resolve many of the problems and technical challenges involved in taking Moses or other technologies up to viable professional-use quality levels. Several TAUS Moses case studies show this as well, as many have pointed out that they spent much more money and time than anticipated in developing engines on their own, and had yet to reach quality levels that matched free MT.

I think it does matter that LSP MT engines produce better output and be better than free MT. Every freelancer/post-editor ALWAYS has the option to either use free MT or use poorer quality MT output from an LSP. If a post-editor determines that free MT is going to be easier to work with and edit, then I expect many will use it and prefer it over low quality output from a DIY Moses effort. So why bother building engines that editors know produce inferior output to what they can do for free elsewhere?

I agree with you, that for many, using free MT will be a better choice than undertaking a low-quality DIY effort where one may or may not reach competitive quality levels in 2+ years.

Developing good MT systems in 2013 is to some extent like building houses, one can try and do it oneself, and learn along the way and hopefully one may learn in 2+ years (or not as the Multilingual article shows) or you could hire contractors who have built hundreds of houses and focus on the finishing and developing skills that are closer to core competence. While some LSPs may indeed learn the various skills needed to build their own systems, I think most will find that it is often more expensive to do this and results in low quality systems when all the learning and time opportunity costs are factored in.

I am biased as you know, as I work for a company that provides expert services (contractors) and is required to prove that systems we develop are better than anything the customer could do with free MT or often with DIY solutions too.

Finally I think RbMT or SMT based approaches can both work but both require long-term investment to really get to any kind of distinctive expertise and I remain skeptical of those who think that they can do it all-- especially at a level of competence that adds business value.

Hi Lori Of course you are allowed to correct, cla...

2013-03-05T11:00:18.623-08:00

Hi Lori

Of course you are allowed to correct, clarify or comment on anything I have said.

Re: "it is difficult to produce MT engines th...

2013-03-05T10:47:41.153-08:00

Re: "it is difficult to produce MT engines that even reach the output quality levels of Google and MSFT." Let's not forget that SMT has many variables. Google and MSFT often achieve excellent qualities in some language pairs for some subject domains. So, it's not suprising that those quality levels are difficult to reach.

* I know as of mid-2012, Agoda.com in Bangok was paying for translations via Google's API because they found Google's results in their travel and tourism domains were quite adequate for their post-editing teams, the price was right, and confidentiality was't a requirement.

* Likewise, a few years ago a travel/tourism project with one LSP found their customized engines struggled to reach, much less exceed, Google's quality results which were the defined benchmarks for the project. For this project, the LSP had professional help.

* As its first project, one of our customer converted parallel PDF travel descriptions to text, aligned them and trained an engine. The training data from the PDFs consisted of artistic, flowery, creative writing styles. This was on their own and without asking our advise. The resulting SMT translations were horrible. They learned from the experience, bounced back and today their results are astounding.

So, not exceeding Google and MSFT is not necessarilly a bad thing. On the other side, getting it right the first time could be "beginner's luck" and it's not necessarilly a good thing. If you're in this for the long haul, it pays off to learn, bounce back, and create lasting assets that compound returns.
By Tom Hoar

I agree (S)MT is complex. It takes time to learn t...

2013-03-05T10:45:33.901-08:00

I agree (S)MT is complex. It takes time to learn that (a) it does work and (b) how it works.

Where else would you have a beginner start, other than with a first engine with one language pair? Edison didn't invent the lightbulb on his first try. Problem solving is part of the learning process. Without problems, there's no learn. The SMT experts refer to building SMT models as "experiments" for a reason. So, I go back to my original questions that remain unanswered.

If it's not good enough to have 2+ years of learning across hundreds of users culminate in the users' declarations that they are satisfed with and using their own results, then what will it take to move the action forward? If the users' satisfactions aren't good enough, is it desireable to make (S)MT less complex in the first place?
By Tom Hoar

Oh my goodness, I was just reading your post Kirti...

2013-03-04T18:07:53.413-08:00

Oh my goodness, I was just reading your post Kirti and I see you are quoting me. Is that from Multilingual Computing? Am I allowed to comment?

We're not a technology vendor, so we have no axe to grind. We came from a rules-based environment, absolutely true. But for years we have been working in SMT as well, and in fact are part of a group working on SMT daily under a large EU grant.

I guess the most important thing to say is that before any large project we test extensively, comparing performance among the three approaches, RBMT, SMT and Hybrid. (What RBMT or SMT engine we choose to train and to test with is based on various factors: for SMT it's not always Moses - if RTT, for example, we may test with Microsoft Translator Hub.)

I cannot speak for every SMT engine under the sun, but in our experience Japanese does not react well at all to the statistical approach. When testing we even find we have to throw out what the Hybrid does with that language as well. Our tests always point to higher quality in Japanese with a straight rules approach. And our post-editors confirm this.

Tom I maintain that it is useful and even necess...

2013-03-04T10:45:34.468-08:00

Tom

I maintain that it is useful and even necessary to understand HOW it works and what might cause problems before you build your first engines for any long-term and sustainable value to accrue even for 1 language.

As many of the TAUS case studies show - it is difficult to produce MT engines that even reach the output quality levels of Google and MSFT -- remember that any translator can get this level of output by themselves, and will have little incentive to use a low quality LSP engine if the PE work is in fact harder.

I think that many underestimate the skills needed to steer these MT engines into something that delivers measurable productivity. The deep MT knowledge skills are for the most part not usually available at most LSPs -- mostly because it would be hard to cost-justify

Of course, it requires more than an overview and h...

2013-03-04T10:43:17.230-08:00

Of course, it requires more than an overview and humor to achieve results that contribute to the user's bottom line. I agree that one can't "avoid the learning process which is to build many engines" and "with many different kinds of data sets." The number of language combinations, however, depends solely on what languages they choose to support.

From our experience, our customers focus on 3 or 4 languages in various combinations where they have internal expertise. They evolve their own definitions of "results that contribute to the user's bottom line" to meet their needs. They acknowledge that their results do not typically reach the maximum optimization. Their costs stay within their budgets. Their increased productivities yield a savings that still drives profits up.

Since they are still new in the field, they reinvest their savings into new user education and training. With each iteration of building models, they acquire the internal expertise by building many SMT models in the few language combinations that are important to them.

This investment in human capital shows up as an ROI multiplier in their out years. Localization engineers at our earliest customers (2-3 years) now build new SMT models with only a few minutes of their attention, before spawning the training/tuning process that typically runs overnight or a couple of days. Why? Like you said, they have worked with the system, they've grown to understand the impacts of different data and they have organized their to suit their needs.

By Tom Hoar

I think there is a lot happening in terms making t...

2013-03-01T10:49:54.785-08:00

I think there is a lot happening in terms making the mechanics of building an SMT engine easier but addressing the other is much more difficult.

At this point in the technology's evolution I do not think one can avoid the learning process which is to build many engines across many language combinations with many different kinds of data sets to start to get an understanding of HOW it works and what can be done on a systematic basis to ensure optimal results.

I think the opening sentence of Kirti's blog s...

2013-03-01T10:39:24.765-08:00

I think the opening sentence of Kirti's blog says it all and encapsulates a very accurate picture: "MT in 2013 is still a complex affair requiring many skills, expertise and understanding that are not commonplace, to enable successful deployment as a productivity enhancing technology for business translation needs."

Two important questions come to mind:

Is it desirable to make MT a less complex affair?

Is it desirable to make the associated skills, expertise and understanding more commonplace?

If the answers to these two are yes, then what are we willing to do to forward the action?

By Tom Hoar