<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>&#62;&#62; blog &#187; Search Results  &#187;  uniview</title>
	<atom:link href="http://rishida.net/blog/?s=uniview&#038;feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://rishida.net/blog</link>
	<description>News of changes to my main site, and W3C related posts.</description>
	<lastBuildDate>Sun, 22 Aug 2010 09:53:41 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>String analyser and conversion apps updated</title>
		<link>http://rishida.net/blog/?p=460</link>
		<comments>http://rishida.net/blog/?p=460#comments</comments>
		<pubDate>Sun, 22 Aug 2010 09:23:24 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[utilities]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[converter tool]]></category>
		<category><![CDATA[string analyser]]></category>
		<category><![CDATA[unicode]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=460</guid>
		<description><![CDATA[Analyser: http://rishida.net/tools/analysestring/
Converter: http://rishida.net/tools/conversion/
The string analyser tool provides information about the characters in a string. One difference in this version is a new section &#8220;Data input as graphics&#8221;, where you see a horizontal sequence of graphics for each of the characters in the string you are analysing. This can be useful to get a screen snap [...]]]></description>
			<content:encoded><![CDATA[<p>Analyser: <a href="http://rishida.net/tools/analysestring/">http://rishida.net/tools/analysestring/</a></p>
<p>Converter: <a href="http://rishida.net/tools/conversion/">http://rishida.net/tools/conversion/</a></p>
<p>The <b class"noticeme">string analyser tool</b> provides information about the characters in a string. One difference in this version is a new section &#8220;Data input as graphics&#8221;, where you see a horizontal sequence of graphics for each of the characters in the string you are analysing. This can be useful to get a screen snap of the characters.  Of course, there is no combining or ligaturing behaviour involved &#8211; just a graphic per character.</p>
<p>You can reverse the character order for right-to-left scripts.</p>
<p>Another difference is that you can explode example text in the notes.  Take <a href="http://rishida.net/tools/analysestring/index.php?list=%D8%A2">this example</a>: if you click on the Arabic word for Koran (red word near the bottom of the notes), you&#8217;ll see a pop-up window in the bottom right corner of the window that lists the characters in that word.</p>
<p>The other change is that the former &#8220;Related links&#8221; section in the sidebar is now called &#8220;Do more&#8221;, and the links carry the string you are analysing to the Converter or UniView apps.</p>
<p>Oh, and the page now remembers the options you set between refreshes, which makes life much easier.</p>
<p>The <b class="noticeme">converter</b> tool converts between characters and various escaped character formats.  It was changed so that the &#8220;View names&#8221; button sends the characters to the string analyser tool.  This means that you&#8217;ll now see graphics for the characters, and that, once on the string analyser page, you can change the amount of information displayed for each character (including showing font-based characters, if you need to).</p>
<p>I also fixed a bug related to the UTF-8 and UTF-16 input.  Including spaces after the code values no longer fires off a bug.</p>
<p><i class="footnote">PS: The string analyser tool has graphics for all new Unicode 6.0 characters, however I haven&#8217;t updated the data for those characters yet.  I was planning to do so with the next release of UniView, which should be in October, when the final Unicode database is available.</i></p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=460</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>UniView lite available and an update to the full version</title>
		<link>http://rishida.net/blog/?p=419</link>
		<comments>http://rishida.net/blog/?p=419#comments</comments>
		<pubDate>Tue, 18 May 2010 06:13:10 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[utilities]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[character]]></category>
		<category><![CDATA[unicode]]></category>
		<category><![CDATA[uniview]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=419</guid>
		<description><![CDATA[

&#62;&#62; Use UniView lite
&#62;&#62; Use UniView
About the tool: Look up and see characters (using graphics or fonts) and property information, view whole character blocks or custom ranges, select characters to paste into your document, paste in and discover unknown characters, search for characters, do hex/dec/ncr conversions, highlight character types, etc. etc. Supports Unicode 5.2 and [...]]]></description>
			<content:encoded><![CDATA[<div>
<p style="float: right; width: 310px; margin-left: 1em; margin-bottom: 1em"><a href="http://rishida.net/blog/images/univiewlite-large.jpg"><img src="http://rishida.net/blog/images/univiewlite-small.jpg" alt="Picture of the page in action." /></a></p>
<p style="font-size: 150%"><a href="http://rishida.net/scripts/uniview/lite">&gt;&gt; Use UniView lite</a></p>
<p style="font-size: 150%"><a href="http://rishida.net/scripts/uniview/">&gt;&gt; Use UniView</a></p>
<p><strong>About the tool:</strong> Look up and see characters (using graphics or fonts) and property information, view whole character blocks or custom ranges, select characters to paste into your document, paste in and discover unknown characters, search for characters, do hex/dec/ncr conversions, highlight character types, etc. etc. Supports Unicode 5.2 and written with Web Standards to work on a variety of browsers. No need to install anything.</p>
<p><strong>Latest changes:</strong> The major change in this update is the addition of an alternative UniView lite interface for the tool that makes it easier to use UniView in restricted screen sizes, such as on mobile devices. The lite interface offers a subset of the functionality provided in the full version, rearranges the user interface and sets up some different defaults (eg. list view is the default, rather than the matrix view). However, the underlying code is the same &#8211; only the initial markup and the CSS are different.</p>
<p>Another significant change is that when you click on a character in a list or matrix that character is either added to the text area or detailed information for that character is displayed, but not now both at the same time. You switch between the two possibilities by clicking on the <img src="/rishida/scripts/uniview/images/pickeroff-long.png" title="Toggle to copy characters to the character area or not"/> icon. When the background is white (default) details are shown for the character. When the background is orange <img src="/rishida/scripts/uniview/images/pickeron-long.png" title="Toggle to copy characters to the character area or not"/> the character will be added to the text area (like a character map or picker).</p>
<p>Information from my character database is now shown by default when you are shown detailed information for a character. The switch to disable this has been moved to the <span class="onscreen">Options</span> panel.</p>
<p style="">Text highlighted in red in information from the character database contains examples. In case you don&#8217;t have a font for viewing such examples, or in case you just want to better understand the component characters, you can now click on these and the component characters will be listed in a new window (using the <a href="/rishida/tools/analysestring/">String Analyzer</a> tool).</p>
<p>Access to <span class="onscreen">Settings</span> panel has been moved slightly downwards and renamed <span class="onscreen">Options</span> in the full version.</p>
<p>The default order for items in lists is now &lt;character&gt;&lt;codepoint&gt;&lt;name&gt;, rather than the previous &lt;codepoint&gt;&lt;character&gt;&lt;name&gt;. This can still be changed in the <span class="onscreen">Options</span> panel, or by setting query parameters.</p>
<p>I changed the <span class="onscreen">Next</span> and <span class="onscreen">Previous</span> functions in the character detail pane so that it moves one codepoint at a time through the Unicode encoding space. The controls are now buttons rather than images.</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=419</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>UniView updated: Show age, context-sensitive help, highlighting, etc.</title>
		<link>http://rishida.net/blog/?p=385</link>
		<comments>http://rishida.net/blog/?p=385#comments</comments>
		<pubDate>Mon, 08 Feb 2010 09:33:10 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[utilities]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[unicode]]></category>
		<category><![CDATA[uniview]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=385</guid>
		<description><![CDATA[

About the tool: Look up and see characters (using graphics or fonts) and property information, view whole character blocks or custom ranges, select characters to paste into your document, paste in and discover unknown characters, search for characters, do hex/dec/ncr conversions, highlight character types, etc. etc. Supports Unicode 5.2 and written with Web Standards to [...]]]></description>
			<content:encoded><![CDATA[<div>
<p style="float: right; width: 260px; margin-left: 1em; margin-bottom: 1em"><a href="http://rishida.net/blog/images/uniview520a.png"><img src="http://rishida.net/blog/images/uniview520a-small.png" alt="Picture of the page in action." /></a></p>
<p><strong>About the tool:</strong> Look up and see characters (using graphics or fonts) and property information, view whole character blocks or custom ranges, select characters to paste into your document, paste in and discover unknown characters, search for characters, do hex/dec/ncr conversions, highlight character types, etc. etc. Supports Unicode 5.2 and written with Web Standards to work on a variety of browsers. No need to install anything.</p>
<p><strong>Latest changes:</strong> The major change in this update is the addition of a function, <span class="onscreen">Show age</span>, to show the version of Unicode where a character was added (after version 1.1). The same information is also listed in the details given for a character in the lower right panel.</p>
<p>The trigger for context-sensitive help was reduced to the first character of a command name, rather than the whole command name. This improves behaviour for commands under <span class="onscreen">More actions</span> by allowing you to click on the command name rather than just the icon alongside to activate the command. </p>
<p>Some &#8216;quick start&#8217; instructions were also added to the initial display to orient people new to the tool, and this help text was updated in various areas.</p>
<p style="">The highlighting mechanism was changed. Rather than highlight characters using a coloured border (which is typically not very visible), highlighting now works by greying out characters that are not highlighted. This also makes it clearer when nothing is highlighted.</p>
<p style="">In the recent past, when you converted a matrix to a list in the lower left panel, greyed-out rows would be added for non-characters. These are no longer displayed. Consequently, the command to remove such rows from the list (previously under <span class="onscreen">More actions</span>) has been removed.</p>
<p>A lot of invisible work went into replacing style attributes in the code with class names. This produces better source code, but doesn&#8217;t affect the user experience.</p>
</div>
<p style="float:left; font-size: 150%"><a href="http://rishida.net/scripts/uniview/">&gt;&gt; Use it</a></p>
<p><br clear="all" /></p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=385</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New web app: String analyser</title>
		<link>http://rishida.net/blog/?p=343</link>
		<comments>http://rishida.net/blog/?p=343#comments</comments>
		<pubDate>Mon, 04 Jan 2010 19:50:51 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[utilities]]></category>
		<category><![CDATA[web]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=343</guid>
		<description><![CDATA[

About the tool: This tool shows you what characters are in a string of Unicode characters, and gives you informaiton about each one.  Either type/paste the string into the box on the right of the page, or send it in the URL.  It&#8217;s especially useful if you have no font for the text, [...]]]></description>
			<content:encoded><![CDATA[<div>
<p style="float: right; width: 260px; margin-left: 1em; margin-bottom: 1em"><a href="http://rishida.net/blog/images/stringanalyser.png"><img src="http://rishida.net/blog/images/stringanalyser-small.png" alt="Picture of the page in action." /></a></p>
<p><strong>About the tool:</strong> This tool shows you what characters are in a string of Unicode characters, and gives you informaiton about each one.  Either type/paste the string into the box on the right of the page, or send it in the URL.  It&#8217;s especially useful if you have no font for the text, or you are trying to unravel a sequence of characters in a complex script, but also allows you to just dig out information about one or more characters.</p>
<p>Here&#8217;s <a href="http://rishida.net/tools/analysestring/index.php?list=%E0%B8%B3%D7%9A%DB%93%E1%80%94%E1%AC%92%E1%AC%84%F0%90%8C%B0%F0%93%80%80%EA%AA%90">an example</a></p>
<p>By default you see a large graphic image of each character, the Unicode code point number and name, the Unicode script block in which it occurs, any annotations in the Unicode Standard, and any notes for that character in my character database (which I also updated today with information about Hebrew, Malayalam, Lisu and other scripts).</p>
<p>However, the result can be tailored in terms of the level of information and various aspects of the presentation.  Simply click on the options to the right of the page, or (again) include the relevant info in the URI.</p>
<p>For example, you can <a href="http://rishida.net/tools/analysestring/index.php?list=%E0%B8%B3%D7%9A%DB%93%E1%80%94%E1%AC%92%E1%AC%84%F0%90%8C%B0%F0%93%80%80%EA%AA%90&#038;nographics=on&amp;compact=on&amp;noblock=on&amp;nounicode=on&amp;nonotes=on">remove any of these items of information individually</a> (except the codepoint and name), or <a href="http://rishida.net/tools/analysestring/index.php?list=%E0%B8%B3%D7%9A%DB%93%E1%80%94%E1%AC%92%E1%AC%84%F0%90%8C%B0%F0%93%80%80%EA%AA%90&amp;nographics=on&amp;compact=on&amp;character=on&amp;noblock=on&amp;nounicode=on&amp;nonotes=on">add a text version of the character</a>.  You can also <a href="http://rishida.net/tools/analysestring/index.php?list=%E0%B8%B3%D7%9A%DB%93%E1%80%94%E1%AC%92%E1%AC%84%F0%90%8C%B0%F0%93%80%80%EA%AA%90&amp;smallgraphics=on&amp;compact=on&amp;nonotes=on">choose a smaller graphic</a>.</p>
<p>In addition, notes from my character database contain examples (coloured red).  By clicking on these examples you can list the characters in the example text without leaving the page. The list of characters shows up in the right margin.</p>
<p>Oh, and you can click on links to see a character in UniView (to explore its Unicode properties) or to show the whole block in which the character lives.</p>
<p>You&#8217;ll shortly see my other applications such as pickers, UniView, etc, linking to this app.</p>
<p>Hope it&#8217;s useful.
</p></div>
<p style="float:left; font-size: 150%"><a href="http://rishida.net/tools/analysestring/">&gt;&gt; Use it</a></p>
<p><br clear="all" /></p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=343</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>UniView 5.2.0 available</title>
		<link>http://rishida.net/blog/?p=326</link>
		<comments>http://rishida.net/blog/?p=326#comments</comments>
		<pubDate>Sat, 05 Dec 2009 11:31:28 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[utilities]]></category>
		<category><![CDATA[web]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=326</guid>
		<description><![CDATA[&#62;&#62; See what it can do
&#62;&#62; Use it

It took me a while to find the time, but I have finally upgraded UniView to suport the final 5.2 release of Unicode, plus a few extra features.
The order of blocks listed in the top left pulldown menu was changed to ressemble the order in the Unicode Charts [...]]]></description>
			<content:encoded><![CDATA[<p style="font-size: 110%"><a href="http://rishida.net/scripts/uniview/help">&gt;&gt; See what it can do</a></p>
<p style="font-size: 150%"><a href="http://rishida.net/scripts/uniview/">&gt;&gt; Use it</a></p>
<p style="float: right; width: 290px; margin-left: 1em; margin-bottom: 1em"><a href="http://rishida.net/blog/images/uniview520.png"><img src="http://rishida.net/blog/images/uniview520-small.png" alt="Picture of a part of the page." /></a></p>
<p>It took me a while to find the time, but I have finally upgraded UniView to suport the final 5.2 release of Unicode, plus a few extra features.</p>
<p>The order of blocks listed in the top left pulldown menu was changed to ressemble the order in the <a href="http://www.unicode.org/charts">Unicode Charts page</a>. Several sub-block selections were also added to the list (as in the Unicode page), and are displayed in italics.</p>
<p>When you display details of a character in the right panel, the heading <code>Script group</code> has now been used to indicate the sub-block-level headings in the block listings of the Unicode Standard. The link to the Unicode block now follows the heading <code>Unicode block</code>. These sub-block-level headings are also shown when you display a range as a list (as opposed to a matrix).</p>
<p>When you mouse over characters displayed in a matrix, the codepoint and name information for that character now appear just above the matrix. This makes it much easier to locate characters you are looking for.</p>
<p>Finally, but by no means least, small and large graphics are now available for all 1071 Egyptian Hieroglyph characters. This was the last block for which graphics were completely unavailable.</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=326</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Further developments to Language Tag Lookup tool</title>
		<link>http://rishida.net/blog/?p=304</link>
		<comments>http://rishida.net/blog/?p=304#comments</comments>
		<pubDate>Wed, 19 Aug 2009 14:45:38 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[utilities]]></category>
		<category><![CDATA[web]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=304</guid>
		<description><![CDATA[&#62;&#62; Use it

I have added a bunch of additional new features to my lookup tool to help with choosing language tags. There is additional information available when you look up subtags (such as what to use if the subtag is deprecated, and what subtags macrolanguages enclose, etc.), and more tests on well-formedness with clearer explanations [...]]]></description>
			<content:encoded><![CDATA[<p style="font-size: 150%"><a href="http://rishida.net/utils/subtags/">&gt;&gt; Use it</a></p>
<p style="float: right; width: 260px; margin-left: 1em; margin-bottom: 1em"><a href="http://rishida.net/blog/images/subtags-0908b.png"><img src="http://rishida.net/blog/images/subtags-0908b-small.png" alt="Picture of the page in action." /></a></p>
<p>I have added a bunch of additional new features to my lookup tool to help with choosing language tags. There is additional information available when you look up subtags (such as what to use if the subtag is deprecated, and what subtags macrolanguages enclose, etc.), and more tests on well-formedness with clearer explanations of the problem.  <a href="http://rishida.net/utils/subtags/index.php?lookup=mg+valencia+avst+in&#038;submit=Look+up">Example</a>.</p>
<p>This should make it a lot more useful to people who haven&#8217;t read BCP 47 and want to create language tags.  Hopefully, in a short while, I&#8217;ll also write and link to an article that describes how to use subtags from the ground up in a procedural way, that will complement the tool.</p>
<p>For further assistance, you can now link from a language subtag result to the SIL Ethnologue, to make it easier to check whether that subtag really does refer to the language you were thinking of.</p>
<p>In addition, script subtag results link to Unicode blocks in UniView.</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=304</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>UniView 5.2(beta)a: Graphics as default</title>
		<link>http://rishida.net/blog/?p=293</link>
		<comments>http://rishida.net/blog/?p=293#comments</comments>
		<pubDate>Fri, 31 Jul 2009 06:33:19 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[utilities]]></category>
		<category><![CDATA[web]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=293</guid>
		<description><![CDATA[&#62;&#62; See what it can do
&#62;&#62; Use it

Following hot on the heels of the last release come some further significant changes to UniView aimed at making it easier to use as Unicode grows.
The big change is that UniView now starts up in graphics mode by default. This means that pages load more slowly, but (especially [...]]]></description>
			<content:encoded><![CDATA[<p style="font-size: 110%"><a href="http://rishida.net/scripts/uniview/help">&gt;&gt; See what it can do</a></p>
<p style="font-size: 150%"><a href="http://rishida.net/scripts/uniview/">&gt;&gt; Use it</a></p>
<p style="float: right; width: 260px; margin-left: 1em; margin-bottom: 1em"><a href="http://rishida.net/blog/images/uniview52betaa.png"><img src="http://rishida.net/blog/images/uniview52betaa-small.png" alt="Picture of the page in action." /></a></p>
<p>Following hot on the heels of the last release come some further significant changes to UniView aimed at making it easier to use as Unicode grows.</p>
<p>The big change is that UniView now starts up in graphics mode by default. This means that pages load more slowly, but (especially with the continuing growth of Unicode) also means that you are more likely to be able to see the characters you are looking for. It&#8217;s easy to switch between modes at any point, using the &#8220;<span style="color:brown">Use graphics</span>&#8221; checkbox.  (And if you preferred font glyphs as a default, you just need to <a href="http://rishida.net/scripts/uniview/help.html#defaults">change the URI</a> in your bookmarked link slightly, and you can continue to work that way.)</p>
<p>To facilitate this change, I created my own graphics for a number of blocks which are not yet covered by decodeunicode, or which are no longer fully covered by decodeunicode. The blocks for which I provided graphics are <strong>Latin Extended-C, Latin Extended-D, Latin Extended Additional, Cyrillic Supplement, Cyrillic Extended-B, Modifier Tone Letters, Tibetan, Malayalam, Saurashtra, Ol Chiki, Myanmar, Kayah Li, Cham, Rejang, Vai, Supplemental Punctuation</strong>, and <strong>Miscellaneous Symbols and Arrows.</strong></p>
<p>There are still many characters for which there are no graphics (especially the new characters in Unicode 5.2), but coverage is much better than it was. As I find more fonts, I will be able to create graphics for the remaining characters.</p>
<p>I also put a grey box around the characters in tables. This is particularly useful if there are no graphics or font glyphs for a block or range of characters, as it makes it easier to locate the character you are looking for.</p>
<p>I also fixed a bug that was preventing Chrome and Safari and IE from displaying the first two Latin blocks. I think the bug was actually in the Unicode data file.</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=293</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>UniView 5.2beta available</title>
		<link>http://rishida.net/blog/?p=289</link>
		<comments>http://rishida.net/blog/?p=289#comments</comments>
		<pubDate>Mon, 27 Jul 2009 12:46:19 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[utilities]]></category>
		<category><![CDATA[web]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=289</guid>
		<description><![CDATA[&#62;&#62; See what it can do
&#62;&#62; Use it

With the family now in Japan, I had some extra time to spare this weekend, so I upgraded UniView to handle all the proposed characters for Unicode 5.2.  
While the properties for new and modified characters are still in beta they are not officially stable, however the [...]]]></description>
			<content:encoded><![CDATA[<p style="font-size: 110%"><a href="http://rishida.net/scripts/uniview/help">&gt;&gt; See what it can do</a></p>
<p style="font-size: 150%"><a href="http://rishida.net/scripts/uniview/">&gt;&gt; Use it</a></p>
<p style="float: right; width: 260px; margin-left: 1em; margin-bottom: 1em"><a href="http://rishida.net/blog/images/uniview52beta.png"><img src="http://rishida.net/blog/images/uniview52beta-small.png" alt="Picture of the page in action." /></a></p>
<p>With the family now in Japan, I had some extra time to spare this weekend, so I upgraded UniView to handle all the proposed characters for Unicode 5.2.  </p>
<p>While the properties for new and modified characters are still in beta they are not officially stable, however the character allocations should be stable at this point. UniView therefore alerts you if you are looking at a new character. </p>
<p>If the Unicode database information has changed for a given character you are also warned, and provided with a link that points to the previous information for that character. These warnings will be removed from UniView when Unicode 5.2 is released.</p>
<p>Of course, you are unlikely to be able to actually see the new characters themselves, unless you are lucky enough to have a very new font to hand.  The graphic alternatives are not available yet for these characters.  I&#8217;m wondering whether it&#8217;s possible for me to do something about that, but that will take a little longer. In the meantime, you might find it more useful to view blocks in list view. (Click on &#8216;Show range as list&#8217;).</p>
<p>This release also fixes a few small bugs in the HTML and JavaScript code.</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=289</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>UniView 5.1.0d: Normalisation, New interface, Decimal code points, etc</title>
		<link>http://rishida.net/blog/?p=242</link>
		<comments>http://rishida.net/blog/?p=242#comments</comments>
		<pubDate>Sat, 14 Feb 2009 07:02:21 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[utilities]]></category>
		<category><![CDATA[web]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=242</guid>
		<description><![CDATA[&#62;&#62; See what it can do !
&#62;&#62; Use it !

The major changes in this version include a new feature to normalise text as NFC or NFD, the ability to accept decimal code point values, and an overhaul of top part of the user interface.
Added buttons to the Text area to allow conversion of the text [...]]]></description>
			<content:encoded><![CDATA[<p style="font-size: 110%"><a href="http://rishida.net/scripts/uniview/help">&gt;&gt; See what it can do !</a></p>
<p style="font-size: 150%"><a href="http://rishida.net/scripts/uniview/">&gt;&gt; Use it !</a></p>
<p style="float: right; width: 260px; margin-left: 1em; margin-bottom: 1em"><a href="http://rishida.net/blog/images/uniview510d.png"><img src="http://rishida.net/blog/images/uniview510d-small.png" alt="Picture of the page in action." /></a></p>
<p>The major changes in this version include a new feature to normalise text as NFC or NFD, the ability to accept decimal code point values, and an overhaul of top part of the user interface.</p>
<p>Added buttons to the <span class="onscreen">Text area</span> to allow conversion of the text to NFC or NFD normalization forms. (You may not notice the change until you list the characters.)</p>
<p>The control panel was also substantially rearranged again to hopefully make it easier for newcomers to see what they can do.</p>
<p>The <span class="onscreen">Code point</span> conversion feature was upgraded to handle decimal code point values.</p>
<p>A single character in the codepoints area or text area is now listed in the lower left panel when you click on <img src="/rishida/scripts/uniview/images/apply.gif" alt=" " align="bottom" border="0"/>, rather than in the right-hand properties panel.  This is to improve consistency and avoid surprises.</p>
<p>Added a link to the CLDR property demo from the right panel to give access to additional properties.</p>
<p>Improved the parsing of codepoints when surrounded by text in the <span class="onscreen">Code point</span> input field, so that it now works with &amp;#x&#8230;;  and \u&#8230; and \U&#8230; escapes.</p>
<p>Jettisoned some unneeded code to reduce download by around 40-50K bytes. Implemented the NFC/NFD feature using AJAX, to avoid putting the download size back up.</p>
<p>When you delete the contents of the text area or the code point area, the associated input field is given focus, so you are ready for input.</p>
<p>A couple more minor bug fixes.</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=242</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Normalization code</title>
		<link>http://rishida.net/blog/?p=222</link>
		<comments>http://rishida.net/blog/?p=222#comments</comments>
		<pubDate>Wed, 04 Feb 2009 07:52:12 +0000</pubDate>
		<dc:creator>r12a</dc:creator>
				<category><![CDATA[code notes]]></category>
		<category><![CDATA[general]]></category>
		<category><![CDATA[i18n]]></category>
		<category><![CDATA[utilities]]></category>
		<category><![CDATA[web]]></category>

		<guid isPermaLink="false">http://rishida.net/blog/?p=222</guid>
		<description><![CDATA[I was asked to make available the code for my normalization functions in JavaScript and PHP.  The links are below. I&#8217;m making the code available under a Creative Commons Attribution-Noncommercial-Share Alike licence. 
Disclaimers Note that I make no claim to have produced polished, compact or well-optimised code!  The code does what I need, [...]]]></description>
			<content:encoded><![CDATA[<p>I was asked to make available the code for my normalization functions in JavaScript and PHP.  The links are below. I&#8217;m making the code available under a <a href="http://creativecommons.org/licenses/by-nc-sa/3.0/">Creative Commons Attribution-Noncommercial-Share Alike</a> licence. </p>
<p><strong>Disclaimers</strong> Note that I make no claim to have produced polished, compact or well-optimised code!  The code does what I need, and I&#8217;m happy with that.  You are welcome to suggest improvements, and I&#8217;m sure there are many that could be made.</p>
<p>As they say, this code is made available in the hope that it will be useful, but without any warranty; without even the implied warranty of merchantability or fitness for a particular purpose.</p>
<p>The code is a little more convoluted that it ought to be, to get around the fact that JavaScript doesn&#8217;t understand supplementary characters, and PHP just doesn&#8217;t naturally understand Unicode. (How I long for PHP6.)</p>
<p><strong>Update: [[</strong>I meant to mention that there is a way of <a href="http://us.php.net/manual/en/class.normalizer.php">doing normalization in PHP</a> already.  I made this code available just because I had it.  I created it as a learning exercise. It may be useful, however, if you are unable to load the ICU and intl packages onto your server.<strong>]]</strong></p>
<p><strong>To use the code</strong>, simply call <code>nfc('your-text-string')</code> or <code>nfd('your-text-string')</code> from your code and capture the result.</p>
<p><strong>For PHP</strong> you&#8217;ll need <a href="http://rishida.net/code/showsource.php?source=normalization/n11n.php">these routines</a> and <a href="http://rishida.net/code/showsource.php?source=normalization/n11ndata.php">this data</a>.</p>
<p><strong>For JavaScript</strong> look at <a href="http://rishida.net/code/showsource.php?source=normalization/js/n11n.js">these routines</a> and <a href="http://rishida.net/code/showsource.php?source=normalization/js/n11ndata.js">this data</a>.  There is also a <a href="http://rishida.net/code/showsource.php?source=normalization/js/n11ndata-lite.js">lite version</a> of the data file that doesn&#8217;t include Han characters.  I use this sometimes for bandwidth savings (about 14K less).</p>
<p><strong>Test files</strong> I also created some test files for <a href="http://rishida.net/code/showsource.php?source=normalization/tests/n11ntestphp.php">PHP</a> and for <a href="http://rishida.net/code/showsource.php?source=normalization/tests/n11ntestjs.php">JavaScript</a>.<br />
Both of these expect to find a copy of <a href="http://www.unicode.org/Public/UNIDATA/NormalizationTest.txt">http://www.unicode.org/Public/UNIDATA/NormalizationTest.txt</a> in the local directory.  These files run 71,076 tests.</p>
<p><strong>Cautions</strong> Be careful about the editor you use for the data files.  I spent several hours fruitlessly debugging the routines, only to find that Notepad++ was displaying certain supplementary characters ok, but corrupting them on save.  I switched to Notepad and the problem evaporated. And I probably don&#8217;t need to add that editing the data files in something like DreamWeaver is a bad idea because it will probably normalize the data before saving.</p>
<p>Another point: you may see Unicode replacement characters at a couple of points in the PHP source.  These represent the first and last characters in the high surrogate range.</p>
<p><strong>Experimenting</strong> If you want to play with something that uses this you could try my <a href="http://rishida.net/scripts/pickers/tlicho/">Tłįchǫ (Dogrib) character picker</a>, or my <a href="http://rishida.net/tools/normalizer/">Normalizer</a> tool.  I will slowly fit this to all the pickers and to <a href="http://rishida.net/scripts/uniview/">UniView</a>.  I have a local version of UniView waiting in the wings that uses the PHP files via AJAX, to reduce download size.  For that you need a file that returns the result as plain text across the wire, such as <a href="http://rishida.net/code/showsource.php?source=normalization/getn11n.php">this</a>.</p>
<p>Well, I hope that that may be of use to someone, somewhere.  I hope I haven&#8217;t forgotten anything.</p>
]]></content:encoded>
			<wfw:commentRss>http://rishida.net/blog/?feed=rss2&amp;p=222</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
