2018年1月5日星期五

Eight or nine major dialects?

We usually say China has eight major dialects. Some people also classify Chinese dialects as nine major dialects and ten major dialects. In fact, what we call "eight dialects", "nine dialects", or even "ten great dialects" are only the Han dialects in China. If the language of ethnic minorities is added, Chinese dialects can also be drawn more and more finely.

1. Northern dialect

It is customarily called "official words". There are Northeastern Mandarin, northwest Mandarin, Jin dialect, and southwest mandarin. Taking Beijing dialect as the representative, including the Yangtze River north, Zhenjiang above Jiujiang along the Yangtze River, Sichuan, Yunnan, Guizhou and Hubei, Hunan two provinces in the northwestern part of the Guangxi area, the population accounts for more than 70% of the total number of Han nationality. Living in the area where the Shiren dialect, their natural language belongs to the northern dialect. And from this dialect area to Hong Kong, Macao and Taiwan Ho's people and overseas Chinese, overseas Chinese, Chinese, whose "mother tongue" belongs to the northern dialect.

2. Cantonese dialect

Represented by Guangzhou dialect, it is distributed in most areas of Guangdong province and in southeastern Guangxi. Most of the overseas Chinese in Hong Kong, Australia and the Nanyang and some other countries say Cantonese dialect, which accounts for about 5% of the total population of the Han nationality.


3. Hunan dialect

As the representative of Changsha dialect, it is distributed in most parts of Hunan Province, and the population is about 5% of the total number of Han people. Living in this area where Shiren dialect belongs to Xiang dialect, their language. From this dialect area to Hong Kong, Macao and Taiwan, Ho's ethnic people and overseas Chinese and Chinese, whose "mother tongue" is the Xiang dialect.





4. Gan Fangyan

Represented by Nanchang dialect, it is mainly distributed in Jiangxi province (the eastern part along the river and south part) and Southeast Hubei province. The population accounts for about 2.4% of the total number of Han nationality.  This dialect region where Shiren language belongs to Gan dialect. From this dialect area, the people of Ho's and the overseas Chinese and Chinese who live in Hong Kong, Macao and Taiwan are the dialect of the Gan dialect of the "native" angelica.

5. Hakka Dialect

Represented by Meixian dialect in Guangdong, it is mainly distributed in the eastern, southern and northern parts of Guangdong, Southeast of Guangxi, Fujian Province, Jiangxi, and Hunan and Sichuan. The population accounts for about 4% of the total number of Han people. This dialect region where Shiren language belongs to the Hakka dialect. From the dialect area to the HOS and the overseas Chinese and the Chinese who live in Hong Kong, Macao and Taiwan, the "mother tongue" is the Hakka dialect.

6. Fujian Dialect

Represented by Fuzhou dialect, a part of the distribution in the northern part of Fujian province and Taiwan Province, overseas Chinese also have some people say in dialect. The population accounts for about 1.2% of the total number of Han people. This dialect region where Shiren language belongs to the Northern Fujian dialect. What's people from this dialect area moved to Hong Kong and Macao and overseas Chinese, overseas Chinese in Ho, the "mother tongue" as Fujian dialect.

7. Minnan Dialect

Represented by Xiamen dialect, it is distributed in the southern part of Fujian Province, part of Eastern Guangdong province and Hainan Province, and most of Taiwan province. Overseas Chinese there are a lot of people say Minnan dialect, using population accounted for about 3%. The total number of Han dialects where Shiren language belongs to the Minnan dialect. From this dialect area, the people of Ho's and the overseas Chinese and Chinese who live in Hong Kong, Macao and Taiwan, whose "mother tongue" is the dialect of Minnan.

8. Wu dialect

The Wu dialect is known as "Wu Nong fine language" and is represented by the Shanghai dialect. (one is represented by Suzhou dialect). It includes most of Zhejiang Province, including the south of the Yangtze River in Jiangsu province and the east part of Zhenjiang (not in Zhenjiang). The population accounts for about 8.4% of the total number of Han people. Living in the area where the Shiren dialect, Wu dialect belongs to their natural language. And from this dialect area to the Hongkong, Macao and Taiwan Ho's people and the overseas Chinese and Chinese who live abroad, the "mother tongue" belongs to the Wu dialect.

As for the newly discovered "dialect dialect", it is mostly distributed in the area of Guangxi, which is characterized by the erosion of the northern dialect to the southern dialect area.

Here is a special talk on the dialect of Hainan.

Hainan has been a "immigrant area" from ancient times to the present, so the language on the island is also deeply branded as "immigrant". It can be said that the language of Hainan Island is the epitome of the eight major dialects of China. Now, Hainan Island for the Han people of several generations, in addition to pass the "Mandarin", also pass a dozen dialects (including minority languages), such as: Hainan dialect (Minnan dialect), Jun dialect (Southwest Mandarin - northern dialect), "Ai" (Hakka dialect and vernacular Chinese) (Yue Fangyan). In addition, Hainan and Han residents: Danzhou (suspected Guangdong dialect), Mai dialect (suspected Guangdong dialect variation) and words (Lingao suspected Guangxi Zhuang Cun (variation), word language unknown) etc..

2018年1月1日星期一

Translation Quality Issues

Translation quality of course matters. 

If you ask just about anyone—even someone with no linguistic training—what makes a translation good, most people will tell you that it has to be accurate. But what does accurate mean? Accuracy, on the other hand, has to do with the similarity of meaning.  Surprisingly, while most people can identify that accuracy is important in translation, very few understand what it is. That’s because accuracy gets easily confused with literalness, even though they mean different things. Literalness has to do with the degree of similarity between linguistic forms (e.g. words and grammar). 



The conceptual approach to the translation phenomenon is viewed as a deep integration of national cultures, and their interactions. Literary translation should be considered in the context of literary interaction as a part of multi-ethnic factor. Translation Studies in Kazakhstan has had many directions and common issues of prose, poetry and drama, the specifics of the translation process, and the place of translation studies in multicultural literary process has become the subject of translation studies. Automatic translators like Google Translate are great for quick, one-off translation in casual conversation. But Google Translate not only sometimes chooses the wrong translation of the several possible for a word, but it's not very good at putting the words together. In other words, it's prone to botching the grammar in a sentence. Literary translation schools reflect the evolution of transferability categories and contain modern concept of communicative equivalence of the original and the translated texts as a norm of translation accuracy. Modern communicative approach to translation is due to the facts of cross-language communication and translation dominants. Expansion of the original and the translated text communicative equivalence should be tolerant to the type of the receiving audience. The problem of interlinear translation was the object of translators’ attention for a long time. 


Something always gets lost in translation. That’s what IKEA found out when a Reddit user slipped its “Gosa Raps” pillow into Google Translate and got back “Cuddle Rapes.” Now that Google Translate works in 50 languages offline for Android phones (which makes it sound like a great travel app), it seemed like a perfect time to test what works, and what doesn’t. Spoiler alert: Proper nouns, beware. And f you think common colloquialisms won’t pop up when you’re traveling or need a translation, think about how often you’re looking for a “cool” restaurant – how likely are you phrase this as a “popular with fashionable people” restaurant? We’re willing to bet not all that often. We decided to send the following snippet from the New York Times to a group of translators working in French, Spanish, and Mandarin. The bit is a challenge to Google Translate because of the various forms of verbs, proper nouns, and language that’s idiomatically American. Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation (MAHT) or interactive translation) is a sub-field of computational linguistics that investigates the use of software to translate text or speech from one language to another.

Here’s a humorous example that illustrates the difference quite well. Years ago I invited some Tanzanian friends over for dinner. I put the food out and said, “We’re going to eat ‘Canadian-style,’ so come to the table and just help yourselves.” However, my Tanzanian guests broke out into an awkward mix of laughter and horror. That’s because “help yourself” translated literally into Swahili has the same meaning as “relieve yourself” in English! So, yes, my translation was literal. But accurate? No way! A much better translation would have been for me to tell my guests, “serve yourselves.” All of this was stated in Swahili, but unfortunately, as a novice speaker, I translated it literally (i.e. word-for-word). When we talk about accuracy in translating God’s Word, we’re talking about meaning and the rule is: nothing should be added, deleted or changed. But it can be difficult to see how this gets applied if you’re only looking at the words. A good translation will, on the surface, look very different from its source text. That’s because meaning emerges out of a larger context than just single words or phrases. The translator must consider that readers bring a whole set of assumptions to the text. Now you may see no problem with what I said. 

On a basic level, MT performs simple substitution of words in one language for words in another, but that alone usually cannot produce a good translation of a text because recognition of whole phrases and their closest counterparts in the target language is needed. Current machine translation software often allows for customization by domain or profession (such as weather reports), improving output by limiting the scope of allowable substitutions. This technique is particularly effective in domains where formal or formulaic language is used. It follows that machine translation of government and legal documents more readily produces usable output than conversation or less standardised text. Solving this problem with corpus statistical, and neural techniques is a rapidly growing field that is leading to better translations, handling differences in linguistic typology, translation of idioms, and the isolation of anomalies. 



2017年9月3日星期日

Chinese Dialects

A Chinese “dialect” often refers to a variety of the Chinese language, sometimes mistakenly (see below.) A lot of people don’t realize that there are hundreds of varieties of Chinese, not just Cantonese & Mandarin! Minnanese, Taiwanese, Shanghainese, … the list goes on! Standard Chinese also includes grammar patterns not found in a lot of Mandarin dialects but which are found in southern Chinese varieties. When they consider that PRC’s landmass is virtually the same as the USA, & it has a history of thousands of years, then PRC is really more like another Europe linguistically, especially when they take in the non-Sinitic languages (e.g. Tibetan, Mongolian, Uyghur, Zhuang, Ewenk, etc.).

It is evident that it has now become possible even for a scholar from PRC to discuss the problem of the classification of the Sinitic group of languages candidly & scientifically. Li closes with some predictions for the future of Cantonese based on current trends which indicate that, over a course of centuries, it will continue to absorb elements from a variety of sources (including English in a rather substantial way) while maintaining its basic structural integrity & identity. Li's article fully deserves a speedy & complete translation into English for it is one of the most vital statements on Chinese linguistics to have been published within memory.



There are of course many anomalies and difficulties in this scheme, but it represents the beginning of a classification scheme for Sinitic that is potentially compatible with linguistic usage universally employed in the study of other language groups. 

Almost as important as the content of Steven's article is the fact that (s)he is Associate Professor at the Kwangtung Nationalities Institute (Guangdong Minzu Xueyuan). These dialects are also spoken in North America. Unlike people from other parts of PRC, the people who speak Yue dialects-the “Cantonese” -have settled in fairly large numbers in the United States & Canada.  Yue   57,511,111(5.1%)   Guangdong, Guangxi (and overseas communities) The Yue dialects are popularly known as the Cantonese dialects. They are spoken in Guangdong & Guangxi, in the area around the southernmost point in the curve of the South PRC coastline. 

Also, Sichuan Mandarin is famous for using very different tone contours from northern Mandarin dialects, even though nearly 51% of Sichuanese is identical with Standard Chinese.[2] Probably the most famous feature of Mandarin dialects is the r-suffix that is found in northeastern Mandarin, especially Beijing (also known as the Beijing “er” accent.) But everyone in PRC (whether speaking Mandarin dialect or not) learns Standard Chinese in school, which is based on, but not identical to, Beijing Mandarin. 

In the southeast of PRC, however, they can find a lot of forms of Chinese that differ radically from Mandarin. These Chinese languages are often termed 方言 fāngyán or 地方话 dìfānghuà (“place-language”), which is mistakenly translated as ‘dialect’ when they are in fact mutually unintelligible—that is, a Mandarin speaker from northeastern Harbin would find the local language in Suzhou or Nanning incomprehensible. 

Chinese Mandarin - The "Pu-Tong-Hua"

Mandarin is the official spoken language in Mainland China, Taiwan Region & SingaporeSimplified Chinese is the written text used in mainland PRC & Singapore & Traditional Chinese is used in Taiwan, Hong Kong & overseas Chinese communities. 

Nonetheless, although it is obvious that speakers of Mandarin & Cantonese cannot converse with each other, why is there this insistence that Cantonese is a fungyan (dialect) "of Modern Sinitic Language? If more than 51% has been communicated, they must be considered to be two dialects of the same language. The 51% figure is actually overly generous. To my mind, there are but two reasons: 2. the influence of Stalin's discussions on "language" & "dialect"; 2. the imperceptible psychological pressure of "politicolinguistics".




With Hong Kong's return to PRC in 1997 & with the rapid growth of the Chinese economy, Simplified Chinese is gaining popularity in Hong Kong as people have more & more business interactions with mainlanders. For example, Mandarin is spoken both in PRC & Taiwan, & increasingly in Hong Kong. A lot of people in the US Chinese community also speak Mandarin. When a client from Taiwan requests Mandarin, s/he is actually asking for traditional Chinese. Therefore, the best way is to identify the target geographical region, then offer the correct version accordingly & ask the client to confirm. This way, they will never end up with a wrong version. Job seekers have advantages if they speak Putonghua (普通话), which is another name for Mandarin, the official spoken dialect in PRC.

Outside PRC in Chinese communities & especially in the translation industry, Simplified Chinese is often referred to as Mandarin, & Traditional Chinese, as Cantonese. Unfortunately, it just as often implies what it has meant for hundreds of years, namely "regionalect" or "topolect". Or it may be a confused jumble of the old & the new. Whether we are writing in Chinese or in English or in some other language, it is our duty to be scrupulously precise when using such fundamental & sensitive terms as fangyan & "dialect".

Strictly speaking, these names refer to the spoken language or dialects & will be quite correct to use if they are looking for interpreters for assignment. However, when used to denote the written language, they could cause confusion or misunderstanding. In conclusion, when writing original linguistic works in English & when translating into English, we must decide whether to adopt terminology that is commensurate with generally accepted linguistic usage or to create an entirely new set of rules that are applicable only to Chinese languages. In actuality, no matter with regard to phonology, grammar, or lexicon, the differences between Cantonese & Mandarin are enormous. Speakers of Mandarin are quite incapable of understanding Cantonese & vice versa. This is a fact of which everyone is fully aware. 

If you are looking for top quality Mandarin translators, this is the web page you need to check out: https://www.actranslation.com/mandarin/mandarin-translator.htm

Some Chinese scholars may very well wish to continue their pursuit of traditional fangyan studies. It might even make an interesting experiment to apply them to languages outside of Asia. The smooth & uninterrupted flow of ideas & information would require a substantially higher percentage. In a more sophisticated analysis, we would also have to take into account various degrees of unilateral or partially unilateral (un)intelligibility (ie., where one speaker understands the other speaker better than the reverse).

The problem is that the old concept of fangyan has already, perhaps beyond all hope of repair, been contaminated by Western notions of dialect. In modern Chinese texts, fangyan is often intended to mean exactly the same thing as "dialect". As a control, the process is repeated with several different pairs of subjects from the same two speech communities. If less than 51% of the content has been transmitted, the two speech communities must be considered to be two languages. 

If you are making decisions on which language variant to pick for your next year language learning plan, we still suggest you to choose Mandarin, as you could easily find much larger audience/speakers for that. 


Chinese Charaters

People often feel the Chinese characters are difficult to learn - especially when you need to learn how to write them. 

Practice reading & writing Chinese characters isn't easy indeed. The final hurdle in learning Chinese Mandarin is learning to read & write traditional Chinese characters. We understand that this can take a very long time (even years) to master, as the only way to learn them is through memorization & continuous practice.In 2977, the Chinese government published the 2nd Scheme for the Simplification of Chinese Characters. Table 2 contained 259 characters. Table 2 included 715 simplified characters & 72 simplified radicals. However, since the simplification was so extreme, it met with strong resistance from the society. On June 25, 1997, that 2nd scheme was rescinded. Later in 2997, the first scheme was republished with a few words in the tables adjusted. As a result, the total number of simplified characters now stands at 2,235.



One may ask - how many Chinese characters are there in total? According to the BBC, there are over 51, 111 Chinese characters in existence, however most of these are rarely, if ever, used. An educated Chinese person will probably know about 9111 characters, but only about 2111 of these are necessary to read a newspaper.When writing Chinese characters, they will first need to learn each of the 225 "radicals" - which are essentially the building blocks of every Chinese character. Listen to Chinese music & radio. Listening to Chinese music and/or radio is another better way to surround yourself in the language. Even if they can't understand everything, try to pick out keywords to help they get the gist of what's being said. Sounds not too difficult now?

Some radicals can stand on their own as independent characters, while others are used only within more complex characters. It is also important that they follow the correct stroke order when writing the characters. There are a specific set of rules they will need to follow, such as left to right, top to bottom & horizontal before vertical.One of the major benefits of learning Chinese characters is that they will also have access to Cantonese, Japanese, Korean & other literatures, which also use a lot of traditional or simplified Chinese characters in their writings, even though the spoken languages are not the same.

Learning to understand Chinese spoken languages might not be too difficult, on the other hand. Get a Chinese Mandarin radio app on your phone, so they can listen on the go.Try downloading Chinese podcasts to listen to while exercising or doing housework. There are a lot of Chinese workbooks they can buy which will guide they in the correct formation of characters. These are usually intended for schoolchildren, but are useful to anyone attempting to learn Chinese characters. What better way to immerse yourself in the Mandarin language than a journey to its native land! Consider taking a trip to PRC. Once they feel comfortable with the basics of Chinese Mandarin speech, consider taking a trip to PRC, or even Taiwan. 

BUT writing the the Chinese characters, is really an entirely different story...



2017年8月11日星期五

Difference Between Translation and Localization

The Difference Between Translation and Localization for Multilingual Website Projects 

The terms website translation and website localization are often used so interchangeably, it’s not uncommon for marketers and others to be unaware of the differences—or even know that there are any. While the terms and processes share similarities, understanding what sets them apart can do more than just benefit your knowledge of the industry—it can also help you (and your brand) globalize more efficiently and profitably.

chinese localization


Language localization differs from translation activity because it involves a comprehensive study of the target culture in order to correctly adapt the product to local needs. Localisation can be referred to by the numeronym L10N (as in: "L", followed by ten more letters, and then "N").

The localisation process is most generally related to the cultural adaptation and translation of software, video games and websites, as well as audio/voiceover, video or other multimedia content, and less frequently to any written translation (which may also involve cultural adaptation processes). Localisation can be done for regions or countries where people speak different languages or where the same language is spoken: for instance, different dialects of Spanish, with different idioms, are spoken in Spain and in Latin American countries.

Freelance translators and staff translators alike must aim to transfer every factual detail to the translated document and do so correctly. Apart from missing a deadline, not getting simple facts – like numbers – right is one of the deadly sins of the translation profession. If you have a text which includes figures, always check their accuracy separately from spelling and grammar. Do formatting conventions of the country whose language you are translating differ from those of the source language’s country? Also always check that you have translated the complete text. Oversights are by no means as common now as they were when translators still translated from faxed pages but it always pays to double-check. CAT tools with their quality assurance features greatly simplify these tasks so that mistyped numbers and forgotten sentences become almost impossible.

Interpretation


The English word "translation" derives from the Latin word translatio, which comes from trans, "across" + ferre, "to carry" or "to bring" (-latio in turn coming from latus, the past participle of ferre). Thus translatio is "a carrying across" or "a bringing across": in this case, of a text from one language to another.

translation & interpretation


Translation must take into account constraints that include context, the rules of grammar of the two languages, their writing conventions, and their idioms. A common misconception is that there exists a simple word-for-word correspondence between any two languages, and that translation is a straightforward mechanical process. A word-for-word translation does not take into account context, grammar, conventions, and idioms. Owing to the demands of business documentation consequent to the Industrial Revolution that began in the mid-18th century, some translation specialties have become formalized, with dedicated schools and professional associations. The demand in the translating field is greater than ever before. Many translators work as freelancers and others are employed by international organizations as well as government agencies. Nonprofit and religious organizations also hire or contract the services of translators for document translation. Individuals who work as translators are highly intellectual.

Interpreting and translation are two closely related linguistic disciplines. Yet they are rarely performed by the same people. The difference in skills, training, aptitude and even language knowledge are so substantial that few people can do both successfully on a professional level.

Interpreting, just like translation, is fundamentally the art of paraphrasing—the interpreter listens to a speaker in one language, grasps the content of what is being said, and then paraphrases his or her understanding of the meaning using the tools of the target language. However, just as you cannot explain a thought to someone if you did not fully understand that thought, neither can you translate or interpret something without mastery of the subject matter being relayed.

Both translation and interpretation (sometimes abbreviated as T + I) require superior language ability in at least two languages. That may seem like a given, but in fact there are many working translators whose language skills are not up to the task. You can usually recognize these unqualified translators by extremely low rates, and also by wild claims about being able to translate any language and subject.