To facilitate the discussion I have created my own very rough evaluation and analysis model, which is an adaptation of the CSA Localization Maturity Model which is an adaptation of the software industry’s capability maturity model (CMM). Basically, it is a way of assessing if the technology user understands the technology, and is also using it in an efficient and effective manner by properly linking it to other organizational processes.
Thus, companies that are at a higher stage of the CSA LMM model referenced above, will tend to have much more responsive and adaptive localization practices, that enable them to be much more nimble and efficient and effective in international market initiatives. These maturity models define very specific “stages” to characterize the efficiency and effectiveness of the business process as shown below. Thus an LSP at a higher stage is probably much better to work with, and an Enterprise at a higher level is also much more nimble and international market savvy and superior localization practices.
While this is interesting, it is somewhat theoretical, and I am going to attempt to make it less so in my own analysis of MT technology. This will make my comments less academically robust, but since all I do here is just speak my opinions on things it does not really matter. I do NOT represent any corporate opinion here, and these comments are purely personal observations though hopefully still valid professional assessments. My intention is to highlight best practices and point out what I think are interesting trends.
MT (machine translation) or Automated Language Translation (ALT) or Machine Pseudo Translation (MpT) use can also be described to some extent using these maturity model stages, however, I am going to try and simplify this further, as I only use the maturity model perspective to better structure my comments and analysis, of how this might apply to the business use of MT technology to further international initiatives.
So while there are still many naysayers who will never see any value to professional business use of MT, there is a growing community of users that are working through the vagaries and complexity of MT with varying degrees of success. The Gartner Hype Cycle curve below provides a useful graphic to describe the stages from a very common expectations cycle, that so many, if not most users of this technology seem to go through. I will describe the various maturity stages in more detail in upcoming posts though my analysis may not be quite as tightly defined as the CSA analysis.
It may be useful at the outset, to list some of the most common pitfalls, which are the opposite of best practices, as the continual recurrence of these factors seem to plague many business use cases of MT technology. They are:
- Looking for cheap, fast and “easy” approaches like Moses and Instant Do-it-yourself solutions.
- Expecting to do better than the really decent generic engines provided by Microsoft and Google without investments in time and core skill building.
- Looking to use MT as a wholesale replacement for human translation.
- Using MT for one-off projects and/or for small volume requirements (LT 500,000 words). MT makes most sense for very large projects that involve millions of words, especially those that would not make sense to do using only human translators for time and cost reasons.
Emerging MT Trends
While the “translation industry” only focuses on the older SMT approaches most often based around Moses, we are seeing an increasing and building momentum around newer approaches to building MT engines. These involve techniques like deep learning, neural nets and artificial intelligence to improve on the results of the current approaches. These new approaches are apparently yielding much better results than ideas like morpho-syntactic approaches to SMT which have also had small imporvements.
The presentation here presents some of the new perspectives in AI and Neural MT versus the traditional SMT NLP views in a relatively understandable way.
I have also noted that Microsoft appears to have seriously stepped up their MT technology of late. They are attempting to solve the most difficult automated translation challenges in the world today, specifically:
- Facebook comments (so I now understand what Clio Schils, Renato and my Russian Facebook friends are talking about)
- Skype based multilingual voice conferences
- Customization with limited sets of training data using AI and allowing four levels of customization.
I found out that Prince died today and while I was never a real fan, I think his guitar solo here is quite exceptional, and should leave no doubt to his amazing musical ability. Theatrics aside, it is the notes and varied textures that he plays that make it so special, and wins the respect and admiration of the other band members. It is worth a listen. May he rest in peace.