Decouple the actions
Decouple the actions
Posted Aug 20, 2024 14:54 UTC (Tue) by khim (subscriber, #9252)In reply to: Decouple the actions by yeltsin
Parent article: FreeBSD considers Rust in the base system
> It's still a helpful tool (or they wouldn't be using it) in the sense that it's better than nothing, bit it has a looooooong way to get to get to the level of an *average* translator who can at least produce something coherent that gets the point across, even if it's not written in Shakespearean style.
Are you sure? I remember how documentation that was received from Chinese companies looked like before advent of LLMs and that description was true for the majority of it:
> the original "translation" is often impossible to understand for someone not trained in deciphering it.I just had to accept the fact that this is what average human translator produces.
P.S. I have no idea what causes that effect, BTW: I know that documentation translated from Russian or German was never that bad (in both directions), both with humans and LLMs, but something in Chineese just makes it impossible to translate adequately without someone who fully understands what these sentences actually mean, regular translators without knowledge of specific technology always produced garbage and LLMs still do that.
Posted Aug 20, 2024 20:33 UTC (Tue)
by Wol (subscriber, #4433)
[Link] (9 responses)
It's presumably because English, German and Russian are all Indo-European languages, and as such are all descended from a fairly recent ancestor. I remember reading something about "four waves of languages" and Indo-European belongs to wave 4. I believe Hungarian and Finnish are wave 2 languages, and while their vocabulary is completely different, they share a similar grammatical structure.
I think Gaelic, Basque, Catalan might be wave 3. Where Chinese, Japanese etc fit I don't have a clue.
But the point is that the structure of European languages is similar, so it's mostly a case of translate the individual words, be aware of idiom, and converting a crude translation to a good one isn't that much work. Start translating into a language from a completely different group, and many concepts may become completely untranslateable, Language shapes your view of the world just as much as your view of the world shapes language. A good example is "Borgeois, Burgerlich, Middle-class". Three words, three languages, the same basic concept, but each word is unique to its language, and while a naive translator might think they are the same, they all three mean something rather different one from the other. Indeed, I don't even know that any of those have an exact translation into any of the other languages.
Imagine that in three closely related languages. Now extend that to massively more different languages ...
(I see that - slightly differently - all the time at work. I have Polish, a Chinese, and an Indian colleague. Because these languages all differ in the sounds they use, I have difficulty hearing them clearly, and they have difficulty hearing me clearly.)
Cheers,
Posted Aug 20, 2024 22:26 UTC (Tue)
by NYKevin (subscriber, #129325)
[Link] (1 responses)
(For the curious: I found https://en.wikipedia.org/wiki/Chinese_grammar a helpful starting point, but I must admit that I don't know Chinese myself, so I have no idea how accurate it is.)
Posted Aug 21, 2024 0:21 UTC (Wed)
by Cyberax (✭ supporter ✭, #52523)
[Link]
Another thing that especially trips documentation writers is passive voice. It's rarely used in Chinese, unless talking about something serious ("he was hit by a car" type serious). A sentence like "once a job is processed" is difficult to translate word-for-word.
Posted Aug 21, 2024 6:46 UTC (Wed)
by viro (subscriber, #7872)
[Link] (3 responses)
Posted Aug 21, 2024 10:46 UTC (Wed)
by Wol (subscriber, #4433)
[Link] (2 responses)
And actually - from what you say - I suspect the language of Rabbie Burns may be *further* from BBC English than Catalan from Spanish (I was under the impression that Catalan might have been a pre-existing language swallowed up into Spain. Bit like Welsh and English). Rabbie spoke Scots (aka "the language of the Angles"), while BBC is English (aka "the language of the Saxons").
And where does Basque fit into all this?
Cheers,
Posted Aug 21, 2024 15:24 UTC (Wed)
by anselm (subscriber, #2796)
[Link]
Robert Burns is actually not a great example of Scots because in many cases he would tone down his Scots to be rather more like English (he had an audience to consider, after all). The Scots language, as distinct from other varieties of English in Britain, became its own thing only by the 15th century, when “Angles vs. Saxons” hadn't been an issue for close to a millennium or so.
Basque is really an outlier because it is the only surviving language in Europe that is not somehow related to some other language. The general thinking is that early Basque developed before Indo-European languages (such as Celtic or Romance languages) reached the area. Basque has now assimilated various words from its neighbours but the grammar is still considerably different from Indo-European languages.
Posted Aug 21, 2024 16:38 UTC (Wed)
by viro (subscriber, #7872)
[Link]
Posted Aug 22, 2024 3:43 UTC (Thu)
by kenmoffat (guest, #4807)
[Link] (2 responses)
But your point that all of these have similar sentence structures whereas Chinese dialects/languages (the choice of which they are is a political choice) are completely different is very true. As is the difficulty of hearing correctly - e.g. in Han or (ex-Han languages such as Korean) sounds which we English-speakers can distinguish such as l,n,r are hard for speakers of those languages to differentiate, and I'm certain the corollary is true for certain of their sounds which we did not learn as children.
Posted Aug 22, 2024 3:48 UTC (Thu)
by kenmoffat (guest, #4807)
[Link] (1 responses)
Posted Aug 23, 2024 10:05 UTC (Fri)
by Wol (subscriber, #4433)
[Link]
Cheers,
Decouple the actions
Wol
Decouple the actions
Decouple the actions
Decouple the actions
Decouple the actions
Wol
Decouple the actions
Rabbie spoke Scots (aka "the language of the Angles"), while BBC is English (aka "the language of the Saxons").
And where does Basque fit into all this?
Decouple the actions
Decouple the actions - languages
Decouple the actions - languages
Decouple the actions - languages
Wol
