It has been sometime since I first wrote a blog
post about MT humor primarily because I really have not been able to find anything worth the
mention, until now, and except for some really lame examples about how MT mistranslates (sic) I have not seen much to laugh heartily at. It seems a group of
people on the web have discovered the humorous possibilities of MT in translating song
lyrics which might be difficult even for good human translators. (It really seems
strange to be saying “human translator”.)
I should point out that in all these recent cases one does have to work at degrading the translation quality by running the same text through a whole sequence of preferably not closely related languages.
It has often surprised me that there are some in the MT industry who use “back translation” as a way to check MT quality, as from my vantage point it is an exercise that can only result in proving the obvious. MT back translation by definition should result in deterioration since to a very great extent MT will almost always be something less than a perfect translation. This point seems to evade many who advocate this method of evaluation, so let me clarify with some mathematics as math is one of the few conceptual frameworks available to man where proof is absolute or pretty damned certain at least.
If one has a perfect MT system then the Source and Target segments should be very close if not exactly the same. So mathematically we could state this as:
But in real life where humans play on the internet
and you have DIY MT systems being used to determine what MT can produce, the
results are less likely to equal 1 which is perfect as shown in the example
above.
So lets say you and I do a somewhat serious evaluation of the output of various MT systems (each language direction should be considered a separate system) and find that the following table is true for our samples by running 5,000 sentences through various MT conversions and scoring each MT translation (conversion) as a percentage “correct” in terms of linguistic accuracy and precision.
So if we took 1,000 new sentences and translate them with MT we should expect that the percentage shown above would be “correct” (whatever that means). But if we now chain the results by making the output of one, the input of the other, we will find that results are different and and get continually smaller e.g.
So with MT we should expect that every back test
will result in a lower or degraded results as we are multiplying the effect of two different
systems. Since computers don’t really speak the language one cannot assume that
they have equal knowledge going each way and if you provide a bad source from system A to system B you
should expect a bad target as computers like some people, are very
literal.
So now if we take our example and run it through multiple iterations we should see a very definite degradation of the output as we can see below.
So if you are trying to make MT look silly you have to run it through multiple iterations to get silly results. It would help further if you chose language combinations like EN to Japanese to Hindi to Arabic as this would cause more rapid degradation to the original English source. Try it and share your results in the comments.
So here we have a very nicely done example and you should realize it takes great skill for the lead vocalist to mouth the MT words as if they were real lyrics and still maintain melodic and rhythmic integrity so be generous in your appreciation of their efforts.
This video shows very effectively how using multiple languages very quickly can degrade the original source as you can see when they go to 64 languages. Somehow words get lost and really strange.
And here is one from a vlogger who really enjoys the effect of multiple rounds of MT on a songs lyrics. She is a good singer and is able to maintain the basic melody without breaking into a smile so I found it quite enjoyable and I would not be surprised that some might believe that these were indeed the lyrics of the song. She has a whole collection of recordings and has what I consider are high production values for this kind of stuff.
And she produces wonderful results on this Disney classic "When you paint the colors of your air can" which used to be a favorite of my daughter. I actually think the song from the Little Mermaid is much funnier and was done by just running it only through four iterations in Google Translate, but since I could not embed it here directly I have given the link.
Here is another person who has decided that 14 iterations is enough to get to generally funny with this or any pop song. I'm not sure how funny this really is since I don't know the original song.
So it appears that we are going to see a whole class of songs that are re-interpreted by Google Translate and it is possible to get millions of views as MKR has, and probably even make a living doing this. So here you see one more job created by MT.
So anyway if somebody suggests doing a back test with MT you should know the cards are clearly stacked against the MT monster and the results are pretty close to meaningless. A human assessment of a targeted sample set of sentences is a much better way to understand your MT engine.
Hope you all had a good Thanksgiving vacation and are not feeling compelled to shop too fervently now.
In this time of strife and distrust in Ferguson it is good to see spontaneous goodwill and instant musical camaraderie between these amateur musicians.
My previous posts on MT humor for those who care are:
I should point out that in all these recent cases one does have to work at degrading the translation quality by running the same text through a whole sequence of preferably not closely related languages.
It has often surprised me that there are some in the MT industry who use “back translation” as a way to check MT quality, as from my vantage point it is an exercise that can only result in proving the obvious. MT back translation by definition should result in deterioration since to a very great extent MT will almost always be something less than a perfect translation. This point seems to evade many who advocate this method of evaluation, so let me clarify with some mathematics as math is one of the few conceptual frameworks available to man where proof is absolute or pretty damned certain at least.
If one has a perfect MT system then the Source and Target segments should be very close if not exactly the same. So mathematically we could state this as:
Source (1) x Target (1) = 1
since in this case we
know our MT system is perfect ;-)
So lets say you and I do a somewhat serious evaluation of the output of various MT systems (each language direction should be considered a separate system) and find that the following table is true for our samples by running 5,000 sentences through various MT conversions and scoring each MT translation (conversion) as a percentage “correct” in terms of linguistic accuracy and precision.
Language Combination | Percentage Correct |
English to Spanish | 0.8 or 80% |
Spanish to English | 0.85 or 85% |
English to German | 0.7 or 70% |
German to English | 0.75 or 75% |
So if we took 1,000 new sentences and translate them with MT we should expect that the percentage shown above would be “correct” (whatever that means). But if we now chain the results by making the output of one, the input of the other, we will find that results are different and and get continually smaller e.g.
EN > ES > EN = .8 x .85
= 0.68 or 68% correct
EN > DE > EN = .7 x .75
= 0.525 or 52.5% correct
So now if we take our example and run it through multiple iterations we should see a very definite degradation of the output as we can see below.
EN > ES > EN(from MT) > DE > EN = .8
x .85 x .7 x .75 = 0.357 or 35.7%
So if you are trying to make MT look silly you have to run it through multiple iterations to get silly results. It would help further if you chose language combinations like EN to Japanese to Hindi to Arabic as this would cause more rapid degradation to the original English source. Try it and share your results in the comments.
So here we have a very nicely done example and you should realize it takes great skill for the lead vocalist to mouth the MT words as if they were real lyrics and still maintain melodic and rhythmic integrity so be generous in your appreciation of their efforts.
This video shows very effectively how using multiple languages very quickly can degrade the original source as you can see when they go to 64 languages. Somehow words get lost and really strange.
And here is one from a vlogger who really enjoys the effect of multiple rounds of MT on a songs lyrics. She is a good singer and is able to maintain the basic melody without breaking into a smile so I found it quite enjoyable and I would not be surprised that some might believe that these were indeed the lyrics of the song. She has a whole collection of recordings and has what I consider are high production values for this kind of stuff.
And she produces wonderful results on this Disney classic "When you paint the colors of your air can" which used to be a favorite of my daughter. I actually think the song from the Little Mermaid is much funnier and was done by just running it only through four iterations in Google Translate, but since I could not embed it here directly I have given the link.
Here is another person who has decided that 14 iterations is enough to get to generally funny with this or any pop song. I'm not sure how funny this really is since I don't know the original song.
So it appears that we are going to see a whole class of songs that are re-interpreted by Google Translate and it is possible to get millions of views as MKR has, and probably even make a living doing this. So here you see one more job created by MT.
So anyway if somebody suggests doing a back test with MT you should know the cards are clearly stacked against the MT monster and the results are pretty close to meaningless. A human assessment of a targeted sample set of sentences is a much better way to understand your MT engine.
Hope you all had a good Thanksgiving vacation and are not feeling compelled to shop too fervently now.
In this time of strife and distrust in Ferguson it is good to see spontaneous goodwill and instant musical camaraderie between these amateur musicians.
My previous posts on MT humor for those who care are: