CZ:Romanization: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>J. Noel Chiappa
(Improve lead para (lost a much better version because I didn't want to hit "Save" too often - %R@*$%R@&!@*&^$%!!!!!)
imported>J. Noel Chiappa
(Transfer a lot of good content from CZ:Proposals/Romanization)
Line 1: Line 1:
The Citizendium will sometimes require the use of words (for example, names of people, and placenames) which are in languages which are not written in the Latin (Roman) alphabet used for written English. These words should appear in articles in a form that readers unfamiliar with the original languages will understand. To do this, these words must be 'transliterated' from one script into the Latin (Roman) alphabet; this process is known as 'romanization'.
{{editintro}}


However, in many cases there are several competing standards for writing the language in romanized form; for example, there are two major ways of romanizing Japanese. Even within prescribed romanization methods, there may be variations in spelling and style. To take just one example, the Japanese word for a small police station (交番 or こうばん) could be written kouban, kōban, kooban or koban - four versions in the same 'Hepburn' romanization system. Other languages such as Chinese Hanzi script are even more complicated with dozens of competing romanization methods covering a large variety of dialects of Chinese.
The Citizendium will sometimes require the use of words (for example, names of people, and place-names) which are in languages which are not written in the Latin (Roman) alphabet used for written English. These words should appear in articles in a form that readers unfamiliar with the original languages will understand. To do this, these words must be 'transliterated' from their native script into the Latin (Roman) alphabet; this process is known as 'romanization'.


If individual authors are allowed to choose the form of Romanization independantly then we will quickly end up with a mess. If one article talks about Guizhou (a province of China), another article mentions Kweichow, and a thrid article discusses Kwei-chow, then the reader will be confused and left wondering if these are all different places or really the same place. A common standard for of romanization is required throughout all Citizendium articles.
The point is not to convert names and words into English; rather, it is to convert names and words normally written in non-Roman scripts into Roman script. Very few English-speaking people can read 小泉 純一郎 and know that the former Prime Minister of Japan is being referred to. However, it is not a settled matter that his name should be rendered as "Junichiro Koizumi" or "Koizumi Jun'ichirō", or any of the other possible options. Each language will have its own specific Romanization policy on Citizendium, to guide this transliteration process here, on words and names from that language.
 
We should eschew creating our own romanization systems from scratch, as this will impede relatively painless transition from other print media to Citizendium. Innovation in romanization will cost more than it gains. For most languages written in non-Roman script, there are generally already one (or more!) romanization systems, and we should prefer those, precisely for commonality with other written material (which generally will use them). Some of them have taken over Roman symbols for sounds which don't occur in the language in question, and reused them for related sounds in that language which don't have a Roman character; e.g. the 'Q' in the Chinese system [[Hanyu Pinyin]]. However, these formalisms will be known to people with any familiarity with that area, and we can of course provide guides as well.
 
However, as noted, in many cases there are several competing standards for writing the language in romanized form. For example, there are two major ways of romanizing Japanese. Even within prescribed romanization methods, there may be variations in spelling and style. To take just one example, the Japanese word for a small police station (交番 or こうばん) could be written kouban, kōban, kooban or koban - four versions in the same 'Hepburn' romanization system. Other languages such as Chinese Hanzi script are even more complicated, with dozens of competing romanization methods covering a large variety of dialects of Chinese.
 
If individual authors are allowed to choose the form of Romanization independantly, then confusion will quickly result. If one article talks about Guizhou (a province of China), another article mentions Kweichow (the same name in a different romanization system), and a thirdd article discusses Kwei-chow, then the reader will be confused, and left wondering if these are all different places, or really the same place. A common standard for romanization of each language is required throughout all Citizendium articles.


This page provides a defined process to be used in each language to create standards to use when romanizing foreign words on the Citizendium; it also provides links to the policies (and discussion of the policies, which should take place on the appropriate Talk: page) for varous languages. Once agreement is reached, those decisions will form policy.
This page provides a defined process to be used in each language to create standards to use when romanizing foreign words on the Citizendium; it also provides links to the policies (and discussion of the policies, which should take place on the appropriate Talk: page) for varous languages. Once agreement is reached, those decisions will form policy.


{{TOC-right}}
{{TOC-right}}
==Issues==
Discussion should aim at striking a balance between what is culturally and linguistically accurate, versus what is easiest for a reader to understand, where the reader is not a user of the language in question. Some romanization systems favour native speakers' understanding of the distinct sounds of their language, while others distort the rendering of the 'phonemes' to show equivalences with the sounds of English. (For example, one form of Japanese romanization includes chi because the sound is similar, but not the same as English ch sound; however, to native speakers, the sound is more like ti, and is written as such in another system.) Whether to render words strictly accurately, or allow potentially misleading spellings (if these make it easier for English users) is one of the factors to take into account.
===IPA===
The International Phonetic Alphabet, while useful for its intended purpose, is not very much use for this transliteration, because it requires phonetics knowledge which most of our readers do not have in order to be able to understand it. It is not directed toward the same goal as romanization, either: for example, the IPA is used to show the pronunciation of English words as well. The IPA would be a good system if we were looking to represent the phonology of a given non-English word, but that is not the chief goal of romanization.
This is not to say that we should ignore the pronunciation of different terms; indeed, if CZ is to ultimately become a vital resource, help with correct pronunciation would be really useful. While not perfect, IPA is the best thing out there for this, and we are in no way trying to discourage the use of IPA in Citizendium articles; indeed, its use in limited circumstances (such as at the beginning of an article, to show the correct pronunciation of the subject) is desirable.
==Modern Languages==
==Modern Languages==
''Please add titles and language links below as necessary and start discussion on the new pages.''
''Please add titles and language links below as necessary and start discussion on the new pages.''
Words from languages other than English which are written in the Latin alphabet (with or without diacritical marks) should generally be rendered as-is, with pronunciation guides. (Just because an English-speaker might not know how to pronounce "Schlumberger" or "Bad Tölz-Wolfratshausen", we should not rewrite the name phonetically.) Only languages which do not fall within this group need romanizaation policies.


===Middle Eastern Languages===
===Middle Eastern Languages===
Line 23: Line 41:
*[[CZ:Romanization/Sanskrit|Sanskrit]]
*[[CZ:Romanization/Sanskrit|Sanskrit]]


==Also see==
==See also==
*[[CZ:Pronunciation guides]]
*[[CZ:Pronunciation guides]]
*[[CZ:Translation]]
*[[CZ:Translation]]
*[[CZ:Proposals/Romanization]]


{{Content Policy}}
{{Content Policy}}

Revision as of 21:14, 4 May 2008

[edit intro]

The Citizendium will sometimes require the use of words (for example, names of people, and place-names) which are in languages which are not written in the Latin (Roman) alphabet used for written English. These words should appear in articles in a form that readers unfamiliar with the original languages will understand. To do this, these words must be 'transliterated' from their native script into the Latin (Roman) alphabet; this process is known as 'romanization'.

The point is not to convert names and words into English; rather, it is to convert names and words normally written in non-Roman scripts into Roman script. Very few English-speaking people can read 小泉 純一郎 and know that the former Prime Minister of Japan is being referred to. However, it is not a settled matter that his name should be rendered as "Junichiro Koizumi" or "Koizumi Jun'ichirō", or any of the other possible options. Each language will have its own specific Romanization policy on Citizendium, to guide this transliteration process here, on words and names from that language.

We should eschew creating our own romanization systems from scratch, as this will impede relatively painless transition from other print media to Citizendium. Innovation in romanization will cost more than it gains. For most languages written in non-Roman script, there are generally already one (or more!) romanization systems, and we should prefer those, precisely for commonality with other written material (which generally will use them). Some of them have taken over Roman symbols for sounds which don't occur in the language in question, and reused them for related sounds in that language which don't have a Roman character; e.g. the 'Q' in the Chinese system Hanyu Pinyin. However, these formalisms will be known to people with any familiarity with that area, and we can of course provide guides as well.

However, as noted, in many cases there are several competing standards for writing the language in romanized form. For example, there are two major ways of romanizing Japanese. Even within prescribed romanization methods, there may be variations in spelling and style. To take just one example, the Japanese word for a small police station (交番 or こうばん) could be written kouban, kōban, kooban or koban - four versions in the same 'Hepburn' romanization system. Other languages such as Chinese Hanzi script are even more complicated, with dozens of competing romanization methods covering a large variety of dialects of Chinese.

If individual authors are allowed to choose the form of Romanization independantly, then confusion will quickly result. If one article talks about Guizhou (a province of China), another article mentions Kweichow (the same name in a different romanization system), and a thirdd article discusses Kwei-chow, then the reader will be confused, and left wondering if these are all different places, or really the same place. A common standard for romanization of each language is required throughout all Citizendium articles.

This page provides a defined process to be used in each language to create standards to use when romanizing foreign words on the Citizendium; it also provides links to the policies (and discussion of the policies, which should take place on the appropriate Talk: page) for varous languages. Once agreement is reached, those decisions will form policy.

Template:TOC-right

Issues

Discussion should aim at striking a balance between what is culturally and linguistically accurate, versus what is easiest for a reader to understand, where the reader is not a user of the language in question. Some romanization systems favour native speakers' understanding of the distinct sounds of their language, while others distort the rendering of the 'phonemes' to show equivalences with the sounds of English. (For example, one form of Japanese romanization includes chi because the sound is similar, but not the same as English ch sound; however, to native speakers, the sound is more like ti, and is written as such in another system.) Whether to render words strictly accurately, or allow potentially misleading spellings (if these make it easier for English users) is one of the factors to take into account.

IPA

The International Phonetic Alphabet, while useful for its intended purpose, is not very much use for this transliteration, because it requires phonetics knowledge which most of our readers do not have in order to be able to understand it. It is not directed toward the same goal as romanization, either: for example, the IPA is used to show the pronunciation of English words as well. The IPA would be a good system if we were looking to represent the phonology of a given non-English word, but that is not the chief goal of romanization.

This is not to say that we should ignore the pronunciation of different terms; indeed, if CZ is to ultimately become a vital resource, help with correct pronunciation would be really useful. While not perfect, IPA is the best thing out there for this, and we are in no way trying to discourage the use of IPA in Citizendium articles; indeed, its use in limited circumstances (such as at the beginning of an article, to show the correct pronunciation of the subject) is desirable.

Modern Languages

Please add titles and language links below as necessary and start discussion on the new pages.

Words from languages other than English which are written in the Latin alphabet (with or without diacritical marks) should generally be rendered as-is, with pronunciation guides. (Just because an English-speaker might not know how to pronounce "Schlumberger" or "Bad Tölz-Wolfratshausen", we should not rewrite the name phonetically.) Only languages which do not fall within this group need romanizaation policies.

Middle Eastern Languages

East Asian languages

Ancient Languages

See also


Citizendium Content Policy
Approval Standards | Article Mechanics | Subpages | Importing material from other sources | Citable articles
How to Edit
Getting Started Organization Technical Help
Policies Content Policy
Welcome Page