<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>&#62;&#62; blog &#187; writings</title>
	<atom:link href="http://rishida.net/blog/?feed=rss2&#038;cat=3" rel="self" type="application/rss+xml" />
	<link>http://rishida.net/blog</link>
	<description>News of changes to my main site, and W3C related posts.</description>
	<lastBuildDate>Mon, 08 Feb 2010 09:36:36 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Bengali script notes updated</title>
		<link>http://rishida.net/blog/?p=374</link>
		<comments>http://rishida.net/blog/?p=374#comments</comments>
		<pubDate>Sat, 06 Feb 2010 09:34:42 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[script notes]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[writings]]></category>
		<category><![CDATA[bengali]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=374</guid>
		<description><![CDATA[

If you&#8217;re interested, I just did a major overhaul of my script notes on Bengali in Unicode. There&#8217;s a new section about which characters to use when there are multiple options (eg. RRA vs. DDA+nukta), and the page provides information about more characters from the Bengali block in Unicode (including those used in Bengali&#8217;s amazingly [...]]]></description>
			<content:encoded><![CDATA[<div>
<p style="float: right; width: 260px; margin-left: 1em; margin-bottom: 1em"><img src="http://rishida.net/blog/images/bengalinotes.png" alt="Characters in the Unicode Bengali block." /></p>
<p>If you&#8217;re interested, I just did a major overhaul of my <a href="http://rishida.net/scripts/bengali/">script notes on Bengali</a> in Unicode. There&#8217;s a new section about which characters to use when there are multiple options (eg. RRA vs. DDA+nukta), and the page provides information about more characters from the Bengali block in Unicode (including those used in Bengali&#8217;s amazingly complicated currency notation prior to 1957).</p>
<p>In addition, this has all  been squeezed into the latest look and feel for script notes pages.</p>
<p>The new page is at a new location.  There is a redirect on the old page.</p>
<p>Hope it&#8217;s useful.
</p></div>
<p style="float:left; font-size: 150%"><a href="http://rishida.net/scripts/bengali/">&gt;&gt; Read it</a></p>
<p><br clear="all" /></p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=374</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Myanmar (Burmese) script notes ready</title>
		<link>http://rishida.net/blog/?p=169</link>
		<comments>http://rishida.net/blog/?p=169#comments</comments>
		<pubDate>Mon, 06 Oct 2008 10:48:15 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[script notes]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[writings]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=169</guid>
		<description><![CDATA[&#62;&#62; Read it !

I finally got to the point, after many long early morning hours, where I felt I could remove the &#8216;Draft&#8217; from the heading of my Myanmar (Burmese) script notes.
This page is the result of my explorations into how the Myanmar script is used for the Burmese language in the context of the [...]]]></description>
			<content:encoded><![CDATA[<p style="font-size: 150%;"><a href="http://rishida.net/scripts/myanmar/">&gt;&gt; Read it !</a></p>
<p style="float: right; margin: 1px solid teal; margin-left: 1em; margin-bottom: 1em;"><a href="http://rishida.net/blog/images/myanmarpicker7.png"><img src="http://rishida.net/blog/images/myanmarexcerpt.png" alt="Picture of the page in action."/></a></p>
<p>I finally got to the point, after many long early morning hours, where I felt I could remove the &#8216;Draft&#8217; from the heading of my Myanmar (Burmese) script notes.</p>
<p>This page is the result of my explorations into how the Myanmar script is used for the Burmese language in the context of the Unicode Myanmar block. It takes into account the significant changes introduced in Unicode version 5.1 in April of this year.</p>
<p>Btw, if you have JavaScript running you can get a list of characters in the examples by mousing over them.  If you don&#8217;t have JS, you can link to the same information.</p>
<p>There&#8217;s also a <a href="http://rishida.net/scripts/myanmar/myanmar.pdf">PDF version</a>, if you don&#8217;t want to install the (free) fonts pointed to for the examples.</p>
<p>Here is a summary of the script:</p>
<div style="background-color: #F0FFF0; padding: 10px; padding-bottom: 5px; margin-bottom: 10px;">
<p>Myanmar is a tonal language and is syllable-based. The script is an abugida, ie. consonants carry an inherent vowel sound that is overridden using vowel signs.</p>
<p>Spaces are used to separate phrases, rather than words. Words can be separated with ZWSP to allow for easy wrapping of text.</p>
<p>Words are composed of syllables. These start with an consonant or initial vowel. An initial consonant may be followed by a medial consonant, which adds the sound j or w. After the vowel, a syllable may end with a nasalisation of the vowel or an unreleased glottal stop, though these final sounds can be represented by various different consonant symbols.</p>
<p>At the end of a syllable a final consonant usually has an &#8216;asat&#8217; sign above it, to show that there is no inherent vowel.</p>
<p>In multisyllabic words derived from an Indian language such as Pali, where two consonants occur internally with no intervening vowel, the consonants tend to be stacked vertically, and the asat sign is not used.</p>
<p>Text runs from left to right.</p>
<p>There are a set of Myanmar numerals, which are used just like Latin digits.
</p></div>
<p>So, what next.  I&#8217;m quite keen to get to Mongolian.  That looks really complicated. But I&#8217;ve been telling myself for a while that I ought to look at Malayalam or Tamil, so I think I&#8217;ll try Malayalam.</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=169</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Moving paper</title>
		<link>http://rishida.net/blog/?p=151</link>
		<comments>http://rishida.net/blog/?p=151#comments</comments>
		<pubDate>Mon, 07 Apr 2008 17:18:36 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[writings]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=151</guid>
		<description><![CDATA[I&#8217;m sitting here watching a video of Timbl talking on a BBC news page and I suddenly realised how good this was.
The page design helps give the impression &#8211; there are no clunky boxes around the video itself &#8211; but there&#8217;s also no need to view in a different area, or switch to another tool, [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m sitting here watching a video of Timbl talking on a <a href="http://news.bbc.co.uk/1/hi/technology/7299875.stm">BBC news page</a> and I suddenly realised how good this was.</p>
<p>The page design helps give the impression &#8211; there are no clunky boxes around the video itself &#8211; but there&#8217;s also no need to view in a different area, or switch to another tool, or even wait for a download to get started &#8211; it&#8217;s just there as part of the page, but a part that moves and produces sound.  Kind of like the moving paper in Harry Potter&#8217;s world.</p>
<p>It&#8217;s great how technology marches on sometimes.</p>
<p>[Update: Since I wrote the above the video has acquired grey panels around the edges for controls, which I think is a shame.  It's still pretty good technology though. ]</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=151</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Be flexible when referencing Unicode!</title>
		<link>http://rishida.net/blog/?p=135</link>
		<comments>http://rishida.net/blog/?p=135#comments</comments>
		<pubDate>Mon, 04 Feb 2008 19:59:53 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[writings]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=135</guid>
		<description><![CDATA[This post is about the dangers of tying a specification, protocol or application to a specific version of Unicode. 
For example, I was in a discussion last week about XML, and the problems caused by the fact that XML 1.0 is currently tied to a specific version of Unicode, and a very old version at [...]]]></description>
			<content:encoded><![CDATA[<p>This post is about the dangers of tying a specification, protocol or application to a specific version of Unicode. </p>
<p>For example, I was in a discussion last week about XML, and the problems caused by the fact that XML 1.0 is currently tied to a specific version of Unicode, and a very old version at that (2.0). This affects what characters you can use for things such as element and attribute names, enumerated lists for attribute values, and ids. Note that I&#8217;m <em>not</em> talking about the content, just those names. </p>
<p>I <a href="http://www.w3.org/2005/03/02-ishida-tech-plen/">spoke about this</a> at a W3C Technical Plenary some time back in terms of how this bars   people from using certain aspects of XML applications in their own language if they use scripts that have been added to Unicode since version 2.0. This includes over 150 million people speaking languages written with Ethiopic, Canadian Syllabics, Khmer, Sinhala, Mongolian, Yi, Philippine, New  Tai Lue, Buginese, Cherokee, Syloti Nagri, N&#8217;Ko,  Tifinagh and other scripts.</p>
<p>This means, for example, that if your language is written with one of these scripts, and you write some XHTML that you want to be valid (so you can use it with AJAX or XSLT, etc.), you can&#8217;t use the same language for an <a href="#ኢሺዳ" name="ኢሺዳ" id="ኢሺዳ">id attribute value</a> as for the content of your page. (Try <a href="http://validator.w3.org/check?verbose=1&#038;uri=http%3A%2F%2Frishida.net%2Fblog%2F">validating this page</a> now. The previous link used some Ethiopic for the name and id attribute values.) </p>
<p><strong>But there&#8217;s another issue</strong> that hasn&#8217;t received so much press &#8211; and yet I think, in it&#8217;s own way, it can be just as problematic. Scripts that were supported by Unicode 2.0 have not stood still, and additional characters are being added to such scripts with every new Unicode release. In some cases these characters will see very general use. Take for example,  the Bengali character U+09CE BENGALI LETTER KHANDA TA.</p>
<p>With the release of Unicode 4.1 this character was added to the standard, with a clear admonition that it should in future be used in text, rather than the workaround people had been using previously.</p>
<p>This is not a rarely used character. It is a common part of the alphabet. Put Bengali <a href="#উতসহ" name="উতসহ" id="উতসহ">in a link</a> and you&#8217;re generally ok. Include <a href="#কুৎসিত" name="কুৎসিত" id="কুৎসিত">a khanda ta letter</a> in it, though, and you&#8217;re in trouble. It&#8217;s as if English speakers could use any word in an id, as long as it didn&#8217;t have a &#8216;q&#8217; in it. It&#8217;s a recipe for confusion and frustration.</p>
<p>Similar, but much more far reaching, changes will be introduced to the Myanmar script (used for Burmese) in the upcoming version 5.1. Unlike the khanda ta, these changes will affect almost every word. So if your application or protocol froze its Unicode support to a version between 3.0 and 5.0, like IDNA, you will suddenly be disenfranchising Burmese users who had been perfectly happy until now.</p>
<p>Here are a few more examples (provided by Ken Whistler) of characters added to Unicode after the initial script adoption that will raise eyebrows for people who speak the relevant language: </p>
<ul>
<li>01F6 LATIN SMALL LETTER N WITH GRAVE: shows up in NFC pinyin data for Chinese.</li>
<li>0219 LATIN SMALL LETTER S WITH COMMA BELOW:  Romanian data.</li>
<li>0450 CYRILLIC SMALL LETTER IE WITH GRAVE: Macedonian in  NFC.</li>
<li>0653..0655 Arabic combining maddah and hamza:  Implicated in  NFC normalization of common Arabic letters now.</li>
<li>0972 DEVANAGARI LETTER CANDRA A: Marathi.</li>
<li>097B DEVANAGARI LETTER GGA: Sindhi.</li>
<li>0B35 ORIYA LETTER VA:  Oriya.</li>
<li>0BB6 TAMIL LETTER SHA:  Needed to  spell sri.</li>
<li>0D7A..0D7F   Malayalam chillu letters:  Those will be  ubiquitous in Malayalam data, post Unicode 5.1.</li>
<li>and a bunch of Chinese additions. </li>
</ul>
<p>So the moral is this: decouple your application, protocol or specification from a specific version of the Unicode Standard. Allow new characters to be used by people as they come along, and users all around the world will thank you.</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=135</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Negotiating the Iron Curtain</title>
		<link>http://rishida.net/blog/?p=129</link>
		<comments>http://rishida.net/blog/?p=129#comments</comments>
		<pubDate>Thu, 20 Dec 2007 18:51:31 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[writings]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=129</guid>
		<description><![CDATA[I just read a post by Ivan Herman about how Hungary has joined the Schengen Agreement, and will soon be removing border controls on the EU side.  That put me in mind of the first time I tried to pass through the Iron Curtain.
I was travelling from Vienna to Budapest (probably about 25 years [...]]]></description>
			<content:encoded><![CDATA[<p>I just read a <a href="http://ivanherman.wordpress.com/2007/12/20/schengen-non%e2%80%93borders/">post by Ivan Herman</a> about how Hungary has joined the Schengen Agreement, and will soon be removing border controls on the EU side.  That put me in mind of the first time I tried to pass through the Iron Curtain.</p>
<p>I was travelling from Vienna to Budapest (probably about 25 years ago) and I had decided to go through <a href="http://maps.google.com/maps?q=sopron">Sopron</a>, rather than Hegyeshalom, so I could see something a little more off the beaten track. This was a month-long InterRail trip, so I was able to follow my whim and jump on whatever train I wanted. The train connections worked, and I found myself heading south from Vienna.</p>
<p>Eventually, the train passed into Hungary and stopped. I needed a visa, so I got off with a bunch of other (Hungarian looking) people, and traipsed over to a small outbuilding, where I found myself at the back of a queue of people jostling bags of various sizes and dressed and coiffed in what looked to me to be a very Eastern European fashion.  Looking out of the window, everything was grey. I could see rail tracks and points and small, grey buildings but also several very tall towers with machine gun nests perched on top (quite large looking machine guns). The queue moved slowly, and I was surprised at one point to see my train pulling away and disappearing.  It seemed a bit odd (and I was glad I&#8217;d brought all my stuff with me), but I figured this was probably normal, and I&#8217;d just have to catch another train.</p>
<p>I finally arrived at the desk and asked for a visa. The guy behind the desk started talking to me in a somewhat animated fashion, but I had no idea what he was saying. I hadn&#8217;t learned German yet, and Hungarian was completely incomprehensible to me.  I kept trying to explain, politely, in English, that I needed a visa.  Finally, he gave me an exasperated look and called someone out of a nearby room.  The guy who emerged was huge, bald and intimidatingly business-like. (Some time later I saw the film Midnight Express, and realised that the prison guard and he could have been the same person.) He shouted at me &#8220;Nicht visa!&#8221;.  And I tried to explain, in English, that, yes, I had no visa, but would like to obtain one, please. This didn&#8217;t appear to get across clearly, because he simply repeated &#8220;Nicht visa!!&#8221; several times, increasing in volume.</p>
<p>Finally, the tension broke and gave way to action.  He motioned for me to follow him out of the building, and we started walking away across a couple of sets of railway tracks.  I noticed, feeling slightly less at ease but still hopeful, that I was flanked by a soldier with a gun on either side. They weren&#8217;t exactly giving me encouraging looks, and as I glanced up at the machine gun towers and at the surrounding barbed wire, I began to wish I knew what was happening.</p>
<p>Soon we arrived at the end of a short train. The very last carriage of this train looked like something you&#8217;d expect to see in a Wild West film.  It had a kind of standing area at each end with a railing, a door into the carriage and steps leading down to the ground on either side.  I was ushered up one set of steps and into what turned out to be an empty carriage. The door was shut behind me, and within a minute or so, as I remember it, the train started moving off, in the same direction my earlier train had disappeared. So I wasn&#8217;t just being sent back across the border. </p>
<p>That last realisation started to trouble me a little, since I still had no visa and no idea what was happening.  It didn&#8217;t help that there was a small round window in the door at each end of the carriage, through which I could see  guards sitting on the steps at each corner, all holding machine guns at the ready.  As the towers slid away behind us, night started to fall.</p>
<p>Twenty five years has dulled the memory of some of what happened next, but eventually I got off at a small station, having reached the end of the line. The guards were gone, and the station turned out to be quite modern and clean looking. I still couldn&#8217;t understand anything anyone was saying, so I still had no idea where I was, but I was able figure out that I was somehow back in Austria.  It was much later that I was to realise that Sopron is on a peninsular that sticks into Austria, and I had come in one side and been sent out the other.</p>
<p>I slept that night on the floor of the main station building, and the next morning set off to find someone who spoke English and could tell me where I was &#8211; and just as importantly how to get into Hungary.  The town was quite small, maybe just a village.  In spite of that it took me a while, but I eventually came across a chap in a supermarket who was able to explain to me that visas are not issued on entry into Hungary by train via Sopron.  I was ahead of him there.  He also offered to drive me to the border, telling me that I would be able to get a visa at the road entry point.</p>
<p>It&#8217;s nice to think about that person whenever I relive this story.  He really went out of his way, leaving work to assist a complete stranger, with no fuss or thought for reward.  I wonder whether he remembers me.  I doubt it. Of course, these days he may even be reading this blog post&#8230;</p>
<p>So it was that, eventually, I got the stamp in my passport that I needed, and somehow found my way onto another train heading for Budapest.  Well, it wasn&#8217;t quite the end of the fun.  That continued when I tried to meet up with my father in the capital.  But that, as they say, is another story&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=129</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>No break space issue in Firefox</title>
		<link>http://rishida.net/blog/?p=123</link>
		<comments>http://rishida.net/blog/?p=123#comments</comments>
		<pubDate>Thu, 29 Nov 2007 10:03:35 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[writings]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=123</guid>
		<description><![CDATA[Tim Greenwood just pointed out to me a &#8216;bug&#8217; in my converter program, which I think is actually, in my mind, a bug in Firefox (although I imagine it was implemented by someone as a feature).
If you type A0 (the hex code for a non-breaking space) in the Hexadecimal code points field, then press Convert, [...]]]></description>
			<content:encoded><![CDATA[<p>Tim Greenwood just pointed out to me a &#8216;bug&#8217; in my <a href="http://rishida.net/scripts/uniview/conversion.php?codepoints=00A0&#038;origin=codepoint">converter program</a>, which I think is actually, in my mind, a bug in Firefox (although I imagine it was implemented by someone as a feature).</p>
<p>If you type A0 (the hex code for a non-breaking space) in the Hexadecimal code points field, then press Convert, you will get a blank space in the Characters field that should be U+00A0 NO-BREAK SPACE.  Then press Convert or View Names above this Characters field and you&#8217;ll find that what was supposed to be a NBSP has changed into an ordinary space.  IE7, Opera and Safari all continue to show the character in the field as a NBSP.  </p>
<p>(However, all four browsers substitute an ordinary space when you copy and paste the text from the Characters field into something else.)</p>
<p>I tried this with <a href="http://rishida.net/scripts/uniview/?codepoints=2002%202003%202006%202008%202009%20200B">a range of other types of space</a> , but had no such behaviour (<a href="http://rishida.net/scripts/uniview/conversion.php?codepoints=2002%202003%202006%202008%202009%20200B&#038;origin=codepoint">try it</a>).  They all remained themselves.  </p>
<p>Anyone know what this is about?</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=123</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Web Standards support day</title>
		<link>http://rishida.net/blog/?p=122</link>
		<comments>http://rishida.net/blog/?p=122#comments</comments>
		<pubDate>Mon, 26 Nov 2007 17:54:01 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[writings]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=122</guid>
		<description><![CDATA[
 
 
 
  Blue Beanie Day
  
  Originally uploaded by r12a
 

Monday, November 26, 2007 is the day thousands of Standardistas (people who support web standards) will wear a Blue Beanie to show their support for accessible, semantic, and hopefully internationalized web content.
I haven&#8217;t got a blue hat, so I cheated [...]]]></description>
			<content:encoded><![CDATA[<div style="float: right; margin-left: 10px; margin-bottom: 10px;">
 <a href="http://www.flickr.com/photos/ishida/2066406474/" title="photo sharing"><img src="http://farm3.static.flickr.com/2182/2066406474_00e6cb1290_m.jpg" alt="" style="border: solid 2px #000000;" /></a><br />
 <br />
 <span style="font-size: 0.9em; margin-top: 0px;"><br />
  <a href="http://www.flickr.com/photos/ishida/2066406474/">Blue Beanie Day</a><br />
  <br />
  Originally uploaded by <a href="http://www.flickr.com/people/ishida/">r12a</a><br />
 </span>
</div>
<p>Monday, November 26, 2007 is the day thousands of Standardistas (people who support web standards) will wear a Blue Beanie to show their support for accessible, semantic, and hopefully internationalized web content.</p>
<p>I haven&#8217;t got a blue hat, so I cheated a little by borrowing bits of the cover of Jeffrey Zeldman&#8217;s great book, &#8220;Designing with Web Standards&#8221;. That&#8217;s me under the hat though.</p>
<p>(If you&#8217;re wondering, the text on the left says the same as the text top right, in Arabic, Urdu, Inuktitut, Simplified Chinese, Traditional Chinese, Khazakh, Greek, Dzonkha, Ethiopian, Hebrew, Hindi, Nepali, Japanese, Korean, Hungarian, Punjabi, Thai and Venda.)</p>
<p>See the <a href="http://www.flickr.com/groups/bluebeanieday2007/pool/">Flickr pool</a>.<br />
<br clear="all" /></p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=122</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Myanmar is changing!</title>
		<link>http://rishida.net/blog/?p=120</link>
		<comments>http://rishida.net/blog/?p=120#comments</comments>
		<pubDate>Mon, 19 Nov 2007 20:48:49 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[writings]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=120</guid>
		<description><![CDATA[
The word Mandalay in Myanmar script.
I&#8217;ve been brushing up on the Myanmar script, since major changes are on the way with Unicode 5.1.  
I upgraded my myanmar picker to handle the new characters, and I edited my notes on how the script works.
The new characters will make a big difference to how you author [...]]]></description>
			<content:encoded><![CDATA[<p style="float: right; width: 280px; margin-left: 1em; margin-bottom: 1em"><a href="http://people.w3.org/rishida/blog/images/mandalay.gif"><img src="http://people.w3.org/rishida/blog/images/mandalay.gif" alt=" " /></a><br />
<span class="caption">The word Mandalay in Myanmar script.</span></p>
<p>I&#8217;ve been brushing up on the Myanmar script, since major changes are on the way with Unicode 5.1.  </p>
<p>I upgraded my <a href="http://rishida.net/scripts/pickers/myanmar/">myanmar picker</a> to handle the new characters, and I edited <a href="http://rishida.net/scripts/scriptnotes/myanmar/">my notes on how the script works</a>.</p>
<p>The new characters will make a big difference to how you author text in Unicode, and people will need to update currently existing pages to bring them in line with the new approach.  The changes should make it <em>much</em> easier to create content in Burmese, in addition to addressing some niggly problems with making the script work correctly.  One reason the changes were sanctioned is that there is currently very little Burmese content out there in Unicode.</p>
<p>I&#8217;ll be updating my <a href="http://rishida.net/scripts/scriptnotes/myanmar/mymrchars">character by character notes</a> later too.</p>
<p>The only problem with all this is that existing fonts will all need to be changed to support the new world order (or myanmar order).  I found one font that is already 5.1 ready from the <a href="http://www.myanmarnlp.net.mm/opentype.htm">Myanmar Unicode &amp; NLP Research Center</a>.  So if you don&#8217;t want to download that font, you&#8217;ll need to read the <a href="http://rishida.net/scripts/scriptnotes/myanmar/myanmar.pdf">PDF version</a> of my notes on the script.  </p>
<p>That would be a pity, however, since i had some fun adding javascript to the article today, so that it displays a breakdown, character by character, of each example as you mouse over it (using images, so you see it properly).</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=120</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Who needs vowels, anyway?</title>
		<link>http://rishida.net/blog/?p=116</link>
		<comments>http://rishida.net/blog/?p=116#comments</comments>
		<pubDate>Thu, 04 Oct 2007 11:56:21 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[writings]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=116</guid>
		<description><![CDATA[I&#8217;m at the ITS face-to-face meeting in Prague, Czech Republic and I&#8217;ve been trying to learn to read Czech words.  Jirka Kosek showed me a Czech tongue-twister last night at dinner.
Strč prst skrz krk.
How amazing is that?  A whole sentence without vowels! (Means &#8220;Put your finger down your throat.&#8221; &#8211; I&#8217;m wondering whether [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m at the ITS face-to-face meeting in Prague, Czech Republic and I&#8217;ve been trying to learn to read Czech words.  Jirka Kosek showed me a Czech tongue-twister last night at dinner.</p>
<blockquote><p><span style="font-size: 200%">Strč prst skrz krk.</span></p></blockquote>
<p>How amazing is that?  A whole sentence without vowels! (Means &#8220;Put your finger down your throat.&#8221; &#8211; I&#8217;m wondering whether that has something to do with the missing vowels&#8230;)</p>
<p>See a <a href="http://rishida.net/photos/videoviewer.php?id=5301241091010446014&amp;caption=Strč%20prst%20skrz%20krk">video of Jirka</a> pronouncing it.</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=116</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Viewing non-Latin text in XMetal</title>
		<link>http://rishida.net/blog/?p=112</link>
		<comments>http://rishida.net/blog/?p=112#comments</comments>
		<pubDate>Fri, 24 Aug 2007 10:07:47 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[writings]]></category>

		<guid isPermaLink="false">http://people.w3.org/rishida/blog/?p=112</guid>
		<description><![CDATA[
Multiple scripts in XMetal&#8217;s tags-on view (click to enlarge).
I received a query from someone asking:
I try to edit lao and thai text with XMetal 5.0, but nothing is displayed but squares. In fact, Unicode characters seems to be correctly saved in the XML file and displayed in Firefox (for example), but i can&#8217;t get a [...]]]></description>
			<content:encoded><![CDATA[<p style="float: right; width: 260px; margin-left: 1em; margin-bottom: 1em"><a href="http://people.w3.org/rishida/blog/images/xmetal-fonts.gif"><img src="http://people.w3.org/rishida/blog/images/xmetal-fonts-small.gif" alt=" " /></a><br />
<span class="caption">Multiple scripts in XMetal&#8217;s tags-on view (click to enlarge).</span></p>
<p>I received a query from someone asking:</p>
<blockquote><p>I try to edit lao and thai text with XMetal 5.0, but nothing is displayed but squares. In fact, Unicode characters seems to be correctly saved in the XML file and displayed in Firefox (for example), but i can&#8217;t get a correct display in XMetal. Is it a font problem ?</p></blockquote>
<p>There are two places this needs to be addressed:</p>
<ol>
<li>in the plain text view</li>
<li>in the tags-on view</li>
</ol>
<p>For the plain text view, it is a question of setting a font that shows Lao and Thai (or whatever other language/script you need) in Tools&gt;Options&gt;Plain Text View&gt;Font.  You can only set one font at a time, so a wide ranging Unicode font like Arial Unicode MS or Code2000 may be useful for Windows users.</p>
<p>For the tags-on view (which is the view I use most of the time) you need to edit the CSS file that sets the editor&#8217;s styling for the DOCTYPE you are working with.  This may be in one of a number of places.  The one I edit is C:\Program Files\Blast Radius\XMetaL 4.6\Author\Display\xhtml1-transitional.css.</p>
<p>I added the following to mine.  I chose fonts I have on my PC and sets font sizes relative to the size I set for my body element.  You should, of course, choose your own fonts and sizes.</p>
<blockquote>
<pre>
[lang="am"] { font-family: "Code2000", serif; font-size: 120%; }
[lang="ar"] {font-family: "Traditional Arabic", sans-serif; font-size: 200%; }
[lang="bn"] {font-family: SolaimanLipi, sans-serif; font-size: 200%; }
[lang="dz"] { font-family: "Tibetan Machine Uni", serif; font-size: 140%; }
[lang="he"] {font-family: "Arial Unicode MS", sans-serif; font-size: 120%;}
[lang="hi"] {font-family: Mangal, sans-serif;  font-size: 120%;}
[lang="kk"] {font-family: "Arial Unicode MS", sans-serif;  }
[lang="iu"] {font-family: Pigiarniq, Uqammaq, sans-serif; font-size: 120%; }
[lang="ko"] { font-family: Batang, sans-serif; font-size: 120%;}
[lang="ne"] {font-family: Mangal, sans-serif;  font-size: 120%; }
[lang="pa"] { font-family: Raavi, sans-serif; font-size: 120%;}
[lang="te"] {font-family: Gautami, sans-serif; font-size: 140%;}
[lang="my"] {font-family: Myanmar1, sans-serif; font-size: 200%;}
[lang="th"] {font-family: "Cordia New", sans-serif; font-size: 200%; }
[lang="ur"] { font-family: "Nafees Nastaleeq", serif; font-size: 130%;}
[lang="ve"] { font-family: "Arial Unicode MS", sans-serif; }
[lang="zh-Hans"] { font-family: "Simsun", sans-serif; font-size: 140%; }
[lang="zh-Hant"] { font-family: "Mingliu", sans-serif; font-size: 140%; }</pre>
</blockquote>
<p>Note that I would have preferred to say :lang(am) { font-family&#8230; } etc, but XMetal 4.6 seems to require you to specify the attribute value as shown above.  (You also have to specify class selectors as [class="myclass"] {&#8230;} rather than .myclass {&#8230;}.)</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=112</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
