Khmer character picker

ក ខ គ ឃ ង
ច ឆ ជ ឈ ញ
ដ ឋ ឌ ឍ ណ
ត ថ ទ ធ ន
ប ផ ព ភ ម
យ ល ឡ រ អ
ស ហ វ    


៌ ៉ ៊



្
្ក ្ខ ្គ ្ឃ ្ង
្ច ្ឆ ្ជ ្ឈ ្ញ
្ដ ្ឋ ្ឌ ្ឍ ្ណ
្ត ្ថ ្ទ ្ធ ្ន
្ប ្ផ ្ព ្ភ ្ម
្យ ្ល ្ឡ ្រ ្អ
្ស ្ហ ្វ    
ឥឦឧឩឪឯឰឱឲឳឫឬឭឮ
្ឧ្ឯ្ឫ្ឬ
ាាំិីឹឺុុំុះូួេេះែៃោោះៅើឿៀំះ
ៈ់៍ ៎ ៏ ័
០១២៣៤៥៦៧៨៩៛
ៗ។ល។៚។៕៖៙
 

​
advanced
 
bbប ccច+ជ+ឆ+ឈ chcʰឆ+ឈ ddដ+ត+ឌ ffហ្វ hhហ+ស kkក+គ+ខ+ឃ khkʰខ+ឃ llល+ឡ mmម nnណ+ន ñɲញ ŋŋង ppព+ប+ប៉+ផ+ភ phpʰផ+ភ qʔអ+ក+គ+ខ+ឃ rrរ ssស ttត+ទ+ដ+ឋ+ឌ+ឍ+ថ+ធ thtʰឋ+ឍ+ថ+ធ wwវ yjយ រ+ហ
aːɔː្ ៌៉៊់៍៎៏័ៗ។ល។៚។៕៖៙
aa័+ា់+ៈ aaaːា aəaəើ aeaeែ+ឯ ahahះ amamាំ aŋaŋាំង+ំ aoaoោ+ឧ+ឱ+ឲ awawៅ+ឳ ayaj័យ+ៃ+ឰ ɑɑ់ ɑɑɑː ɑhɑhោះ+ស់ ɑmɑmំ+ម់ eeិ eeeːេ eə̆eə̆័+ា់+ៈ eə̆heə̆hះ+ាះ eə̆ŋeə̆ŋាំង ehehិះ+េះ+ែះ+ែស eieiេ
əəិ+ឹ+េ+ឥ əəəːើ əhəhើះ+ើះ əɨəɨឺ əwəwូវ+ឪ əyəjិយ+ី+ឥ+ឦ ɛɛិ ɛɛɛːែ iiិ iiiːិយ+ី iəiəា+ៀ ihihិះ+េះ ɨɨិ+ឹ+េ+ឥ ɨɨɨːឺ ɨəɨəឿ ɨwɨwូវ+ៅ ɨyɨj័យ+ៃ+េយ ooុ+ឧ oooːោ oə̆oə̆័+ា់ ohohុះ oə̆moə̆mាំ omomុំ ououូ+ឩ
ɔɔɔː ɔəɔə័រ+៌ uuុ+់+ឧ uuuːូ+ឩ uəuəួ uə̆uə̆់+់ uə̆huə̆hោះ+ស់ uhuhុះ umumម់+ុំ+ំ+ុម upupប់ rərə៌ rɨrɨឫ rɨɨrɨɨឬ lɨlɨឭ lɨɨlɨɨឮ
‍‍‌‌   ​ ** 0០1១2២3៣4៤5៥6៦7៧8៨9៩៛
b bʰ ccʰdɖ dʰ ɖʰ ʤ ʤʰ fg gʰ h ḥ ɦ jk kʰ lm ɱ nŋ ɳ ɲ p pʰ
rɽɽʰs ʂʃ tʈ ṯ tʰ ʈʰvwyy̌z aæeioɔurɨ̃
 
bbបpប+ព+ភ bppព+ប៉ chcʰឆ+ឈ ddដ+ត+ឌ dttត+ទffហ្វ gkក+គ hhហ+ស jcហ+ស kkʰខ+ឃkក+គ+ខ llល+ឡ mmម nnណ+ន ñɲញ ngŋង ppផ+ប៉+បpʰផ+ភ qʔអ+ក+គ+ខ+ឃ rrរ ssស ttʰថ+ធ+ឋឍtថ+ទ+ធ+ដ+ឋ+ឍ+តthtʰឋ+ឍ+ថ+ធ vwwវ wwវ yjយ រ+ហ
aːɔː្ ៌៉៊់៍៎៏័ៗ។ល។៚។៕៖៙
aaា់+ɑ់+័+ៈ aaaːា aeaeែ+ឯ ahahះ aiajៃ+ឰ amamាំ angaŋាំង aoaoោ+ឱ auawៅ aʉaəើ ayaj័យ eeិ+័ eeeeេeiេəjឦ ehehេះ+ិះ eiɨjៃəjឥ eyəjីɨj័យ ɛaeə̆ា់ ɛɛɛːែ ɛaheə̆hះeə̆ៈ ɛangeə̆ŋាំង
əəឹɨឹ əəəːើ əmumំ əʉɨwៅ iiិ iiiːីjយ iaiəា+ៀ ihihេះ+ិះ ikikច+ជ ooុ+u់ ooɔːouូoːោoឧ oaoə̆ា់ɔə៌oə̆័ɔə័រ oamoə̆mាំ ohohុះ omomុំ owəwឪ
ɔɔɑː+អ ɔhɑhោះ ɔmamំ uuុ uuuːូ uauəួ uahuə̆hោះ uhuhុះ umumុំ ʉʉəɨឺɨɨឺ ʉaɨəឿ rərɨឫ rʉʉrɨɨឬ ləlɨឭ lʉʉlɨɨឮ
‍‍‌‌   ​ ** 0០1១2២3៣4៤5៥6៦7៧8៨9៩៛
b bʰ ccʰdɖ dʰ ɖʰ ʤ ʤʰ fg gʰ h ḥ ɦ jk kʰ lm ɱ nŋ ɳ ɲ p pʰ
rɽɽʰs ʂʃ tʈ ṯ tʰ ʈʰvwyy̌z aæeioɔurɨ̃
shape1 shape2 shape3 shape4 shape5 shape6 shape7 shape8 shape9 shape10 shape11 shape12
កគតភឥ ពឰឭឮ៣ញញញ្ញណលសឍ៧ឈឦ្ធ្ញ ចថឋបឫឬមហឃយបាបោបៅ ឆធជផដងឯ ខឌឧឩឪឱឳន ទឡ្ឡ្ឡ រេវ៛អែៃោៅើឿៀបោបៅាីឺ៍័បា ាោៅ ំះៈ៖ ិីឹឺ់៉៍៎៏័៌៊ែៃើឿ ្ក្គ្ដ្ណ្ត្ព្ភ្ខ្ឌ្ឧុ៉៊ូ្ង្ញ្ថ្ល្ច្ទ្ធ្ន្ម ្អ្ហ្វ្ឆួ្ជ្ឋ្ផ្ឯ្ឫ្ឬ្ឡញឰឡ ្ឃ្ឈ្ឍ្ប្យ្ឡ្ស្រឡ
០៙១២៤៥៦៨៩ឲៗ។៕៚
្ ​ 
Click on characters above to create text in the box below, then copy & paste to your content.
Font list:
Custom font:
Size:
px
Rows:
Add codepoint:
Clear search results.Search for:
Normalise: NFC
Convert output to Normalization Form C. Convert output to Normalization Form D. Don't normalise output.

Notes:

Quick start
(You must have JavaScript enabled.) Choose a view (see below). Click on characters/shapes to insert text into the output field or use your keyboard for Latin characters, delete, etc. You can also add codepoints and escapes via the "Add codepoint" field (hit return to add to the output field).
Then cut & paste the result to your document, or use the tabs to get further information about the characters. You can also paste text into the output field to get information about it. Use the yellow box to set preferences or search (regular expressions allowed - for example, to find the letter GA surrounded by spaces in the name, enter \bga\b, or the short form :ga:).
About the chart
Includes all the characters in the Unicode Khmer and Khmer Symbols blocks (in the default panel).
All text is output in Unicode normalisation form NFC by default. You can change to NFD or no normalisation by clicking on the buttons in the yellow area. Note that normalization only takes place when you click on a character - text pasted into the box won't be normalised until you click on another character above, or click on a button in the yellow area. (Note: normalization is turned off for Han characters in this application.)
Alternative views
The following alternative views are available. You can start up directly in one of the views by appending the following to your URI: ?view=, followed by one of, respectively, default, shape, huffman, gilbert or fontgrid.
Default This view is likely to be most useful to people who are somewhat familiar with the alphabet and characters of Khmer. Characters are arranged to assist in input. Simple consonants are to the left in mostly alphabetic order. To their right are combining characters that follow the initial consonant, then subscript consonants, then vowels and other symbols. Independent vowels appear at the top, then combining vowel signs, then other combining marks. At the bottom are digits and the currency symbol, and various other symbols and punctuation. Clicking on the subscript characters produces a coeng sign followed by a consonant.
Click on the 'Advanced' arrow top right for rare and deprecated characters, as well as divination and lunar characters.
Shape This view is purely based around shape, and is therefore good when you don't know the script well at all, or for shapes you don't know. Characters are grouped and ordered by visual similarity, and include groups of characters that interact to form new shapes (this is not an exhaustive list of shapes in Khmer writing, but may help locate most ligatures and conjuncts you don't recognise)..
Each orange key near the top of the page represents a significant part of the shape of two or more characters; as you mouse over the keys, characters and combinations of characters that incorporate that shape are displayed below. Click on these characters to add them to the output. Within a group I attempted to put easily confusable characters close to each other.
The shapes below the grey line are a mixed bag of characters that didn't fit elsewhere.
A small orange plus sign to the right of a shape indicates that you will find similar shapes after the large plus sign to the right of the current line. These characters may cause confusion because they share elements, or because their shape may be similar, though not quite the same.
Transcription I use this for typing in text for which I have a transcription, or for creating phonetic transcriptions. Although the Khmer script is mostly phonetic if you know the rules behind it, there are quite a lot of rules. This transcription therefore aims to provide at least 80% of the work needed, and you may need to tweak the remainder.
The large characters on a grey background represent characters in two different transcription systems: one is that used by Huffman in Cambodian System of Writing, another is used by Gilbert and Hang in Cambodian for Beginners. (An attempt has been made to ensure that the phonetic transcription the produced by clicking on characters in both views is the same, but there will be small differences.) To type Khmer text starting from a transcription, click on these characters. If there is only one Khmer character corresponding to the transcription letter, it is inserted directly into the output field. If there are multiple alternatives, these are presented to you in a selection list: click on the Khmer character you need in the selection list and it is added to the output.
Each Khmer character is associated with a phonetic symbol (a Latin/IPA symbol on white background to its left in the selection lists). If there is more than one possible phonic representation you will see the selection list divided appropriately. As you select characters, the phonetic symbol to its left is stored. If you click on the Phonemes button, below the output area, these are all added to the output. This provides a quick way of generating a phonetic transcription from a Latin transcription. In some cases a Khmer character is repeated within the same selection list because it has more than one possible phonetic equivalent - in such cases, choose the right one if you want to generate this phonetic transcription.
In a small number of cases, you will need to click twice on the components that make up the sound (eg. when bantoc is used on the following consonant). These cases are indicated by a small red plus sign between two clickable shapes (one of which may be just a hyphen).
If you click on the boxes representing the inherent vowel, no Khmer character is added to the output, but the phonetic symbol is added to the buffered phoneme list.
Just above the output area there is a line of Latin characters. This represents the union of all transcription and phonetic characters, and is provided in case you wish to just type in a transcription directly.
For less common characters, switch to the Alphabetic view.
As you mouse over the Latin characters on the grey background, the corresponding SCRIPT characters are also displayed near the top of the page. This is to aid in searching, but you can also select characters from there.
Font grid Shows characters in Unicode order, using whatever font is specified in the Font list or Custom font input fields. This allows comparison of fonts (especially useful in IE, which shows if a glyph is missing from a font).
Special commands
Khmer>>IPA The transcription tool is provided as a means for me to generate phonetic transcriptions based on the rules in Franklin Huffman's Cambodian System of Writing. However, it needs some assistance from the user. This is because Khmer doesn't use spaces between words, and it is often ambiguous as to whether a consonant represents a syllable-final sound or a syllable in its own right. It also needs help to identify unstressed syllables. I don't have the means to do automatic word segmentation, so you will need to provide this information.
After the first syllable on the line, put a zero-width space or ordinary space before each consonant or independent vowel sign that begins a new syllable (not word). (Note that this may split consonant clusters. The Khmer text will look strange but still work.) You should also indicate unstressed syllables by following the syllable with a hyphen, rather than a space. For many bisyllabic words, this means putting a hyphen after the first of the two syllables. For example, converting ប្រកាន់និទៀន to ប្រ-កាន់ និ-ទៀន will produce the following transcription [prɑkannitiən]. Note that, if you don't know Khmer well enough to know when a syllable is unstressed, you can still get an approximation to the pronunciation using only spaces (zwsp or ordinary space). For instance, the previous example separated by spaces only will yield [prɑːkanniʔtiən].
If your system supports OpenType fonts, I recommend, for best results, that you install one of the following fonts for viewing the transcription: Doulos SIL, Charis SIL, Gentium. These exceptionally good, free fonts can be found by searching the Web.
Although the transcription is based on rules by Franklin Huffman in Cambodian System of Writing, some symbols are changed to be more recognizable to those familiar with IPA. While the transcription rules are quite detailed, and Khmer is largely regular, there are a few exceptions, particularly in words from Sanskrit or Pali, or ambiguities, for example in a few independent vowel signs, that cause problems for the transcription. The transcription is non-reversible. I created it to help me quickly reproduce (simple) phonetic alternatives for examples in my notes on Khmer.
Phonemes While you click on Khmer characters in the Phonic views, the picker automatically records in a buffer the associated phonemic character (ie. the nearest transciption character to the left of each character you click on). Clicking on this icon will dump those characters into the output area at the current cursor position, and clear the buffer. It is quite basic (for example, it doesn't take into account backspacing), but is offered as a way of speeding up text entry where you want to type both the Khmer characters and the phonemic transcription.
Hyphens are provided for the silent or inherent sounds (eg. 'o') to help produce these transcriptions. They produce no output in Khmer script, but the phonemic value is stored in the buffer.
Other features
For further information about features of the tool or user interface, see How to use..
Useful URIs
Downloadable TrueType and OpenType fonts: Wazu Japan, Alan Wood
Khmer script notes (my rough notes)
Khmer script description in Wikipedia
Khmer block in UniView
Other pickers
If something is missing
... let me know.
Copyright © 2006-2009, Richard Ishida. Last modified: 2010-01-09 16:07