Joanna About this site

About

How this site is organized and what it's for

Weblog start page

The start page contains the most recent 15 articles.

Home page
The main home page of my website, not my weblog. Currently not used.
------------------
Articles by month
Click here to get all the articles for a particular month.
This month's articles (if any)
Current month
Today's articles (if any)
Articles dated 2008/11/23 only

------------------
Subtopics

------------------
My email address
Site map
Search my weblog
Search for text on this site
You may have to use search
if I move files around!
Listing of all articles by date
Moving man
Flavours
There's more than one way to view this weblog; these links display the current page in other formats.
External links
These are a few of my favourite sites.
T E S T
Slashdot yesterday

Copyright © 2003-2007 Alternate Worlds Publishing, Boston MA USA


powered by blosxom -- www.blosxom.com
Wenhua dageming de zhongyao jiaoxun shi bixu fandui geren mixin
If I have been able to see further, it is because I am surrounded by midgets.
Never ascribe to stupidity that which can adequately be explained by malice.
"Your argument's repugnant and intriguing." "That's kinda my thing."

Danny's Weblog

This section is for articles which relate mainly to the Cambodian language, often referred to as Khmer. As with the rest of my site, the articles are presented in *reverse* chronological order. Also, they tend to represent things which I have discovered or speculated about which *supplement* the standard materials: this is not intended to present a free teach-yourself-Khmer course.

In particular, note that I originally focused on using the "Limon-type" fonts for Cambodian, as they were far more commonly used than Unicode. Although I believe Limon is still much more common, support for Unicode is so much better these days that more recent posts focus on Unicode. To get a balanced picture, you should read the entire folder.

You may also be interested in articles which refer to Asian languages in general: Asia/Language-misc

2004 Mar 02 [ Tue ]

Yet more info on using Khmer fonts

The following link is interesting, but I could not find one of the links on the page which led to the keyboard layout which seems to be actually *used* in PP.

www.angkor.com [http://www.angkor.com/fonts.shtml]

The layout I mean has the following characters available from the figure one key:

Unshifted: Khmer figure one

Shifted: Arabic (Western) figure one

Ctrl-alt: Exclamation point

Ctrl-alt-shift: Funny character like Khmer "r" with a bar through it (I just asked somebody and it means "riel", the Cambodian currency).

Incidentally, that link above mentions something about "zero-space fonts". The concept seems to be that by manually embedding non-printing characters between the characters of printable words, applications can automagically handle line breaks at the ends of words. I had certainly wondered before how this is administered, in English as much as Khmer. My impression is that there must be a flag on every character in a Truetype font which tells an application whether a character is an in-word character or a word separator character, but every time I tried to understand available info on Truetype fonts my head went round and round.

In the link you need to download some sort of special stuff, but actually I did notice when I was going through the table of alt-codes that alt-0173 puzzlingly did absolutely nothing. Maybe it's actually an intentional "zero-space" character. Hmmm. I just tried it, and alt-0173 has an interesting effect: it is indeed normally invisible, but if Wordpad needs to break a long "word" it indeed will prefer to break at an alt-0173 character. However, when it does so, it puts a hyphen at the end of the line, which seems suboptimal although still useful.

I just checked with somebody again (long-suffering chap) and although he had forgotten about the feature when I started to explain it, when I showed him how it worked he said he did remember it being mentioned in his computer classes, but apparently people don't actually use it. Wow. He said that people often put actual spaces between words when entering text on a computer, or alternatively are used to seeing line breaks inside words.

I just checked the Cambodian government page where you can download the Limon fonts. Here's a link to the page in Cambodian: www.cambodia.gov.kh [http://www.cambodia.gov.kh/unisql2/egov/khmer/home.view.html]

Of course, you need to have the Limon fonts installed already to get much out of this...

It's a little hard to see this on the original page, but if you cut and paste the text into Wordpad and of course set the selection to a Limon font at 36 pt, you can see a large number of (ordinary) spaces between the words. I can't read Khmer well enough to be sure, but it certainly seems to support what I'd been told.

I think I should try asking the same question at a print/design shop too. They probably put more effort into typographical issues.

Incidentally, I just discovered that alt-0160 is a non-breaking space, though that doesn't help problems in Khmer much.

Here is a link to a site which offers the "zero-space" system, as well as an interesting utility to convert Word documents using one font system into another: www.forum.org.kh [http://www.forum.org.kh/eng/download.htm]



I hope this information was useful. There may be a great deal more information on this site that is relevant to what you need. Take a look at the "site map" display at left; you can click on a topic to see many recent items on that topic.

This page is available for searching.

Debug: hittotal: 39 startban: 0 dancookie: endbandate: banned: 0 tempdate: tert: jse: jsno jsh: 39